Abstract
This article introduces an image classification approach in which the semantic context of images and multiple low-level visual features are jointly exploited. The context consists of a set of semantic terms defining the classes to be associated to unclassified images. Initially, a multiobjective optimization technique is used to define a multifeature fusion model for each semantic class. Then, a Bayesian learning procedure is applied to derive a context model representing relationships among semantic classes. Finally, this context model is used to infer object classes within images. Selected results from a comprehensive experimental evaluation are reported to show the effectiveness of the proposed approaches.
- Aksoy, S., Koperski, K., Tusk, C., Marchisio, G., and Tilton, J. 2005. Learning bayesian classifiers for scene classification. IEEE Trans. Geosci. Remote Sensing 43, 3, 581--589.Google Scholar
Cross Ref
- Athanasiadis, T., Mylonas, P., Avrithis, Y., and Kollias, S. 2007. Semantic image segmentation and object labeling. IEEE Trans. Circ. Syst. Video Technol. 17, 3, 298--312. Google Scholar
Digital Library
- Berman, A. and Shapiro, L. 1999. A flexible image database system for content-based retrieval. Comput. Vis. Image Understand. 75, 1--2, 175--195. Google Scholar
Digital Library
- Chang, S., Sikora, T., and Purl, A. 2001. Overview of the mpeg-7 standard. IEEE Trans. Circ. Syst. Video Technol. 11, 6, 688--695. Google Scholar
Digital Library
- Chun, Y., Kim, N., and Jang, I. 2008. Content-Based image retrieval using multiresolution color and texture features. IEEE Trans. Multimedia 10, 6, 1073--1084. Google Scholar
Digital Library
- Cooper, G. and Herskovits, E. 1992. A Bayesian method for the induction of probabilistic networks from data. Mach. Learn. 9, 4, 309--347. Google Scholar
Digital Library
- Csurka, G., Dance, C., Fan, L., Willamowski, J., and Bray, C. 2004. Visual categorization with bags of keypoints. In Proceedings of the Workshop on Statistical Learning in Computer Vision. Vol. 1, 1--22.Google Scholar
- Dagli, C. and Huang, T. 2004. A framework for grid-based image retrieval. In Proceedings of the 17th IEEE International Conference on Pattern Recognition. Vol. 2, 1021--1024. Google Scholar
Digital Library
- Datta, R., Joshi, D., Li, J., and Wang, J. 2008. Image retrieval: Ideas, influences, and trends of the new age. ACM Comput. Surv. 40, 2, 5. Google Scholar
Digital Library
- de Jong, F., Westerveld, T., and de Vries, A. 2007. Multimedia search without visual analysis: the value of linguistic and contextual information. IEEE Trans. Circ. Syst. Video Technol. 17, 3, 365--371. Google Scholar
Digital Library
- Deselaers, T., Keysers, D., and Ney, H. 2008. Features for image retrieval: an experimental comparison. Inf. Retr. 11, 2, 77--107. Google Scholar
Digital Library
- Everingham, M., van Gool, L., Williams, C. K. I., Winn, J., and Zisserman, A. 2010. The pascal visual object classes challenge. Int. J. Comput. Vis. 88, 2, 303--338. Google Scholar
Digital Library
- Fan, J., Gao, Y., Luo, H., and Jain, R. 2008. Mining multilevel image semantics via hierarchical classification. IEEE Trans. Multimedia 10, 2, 167--187. Google Scholar
Digital Library
- Gehler, P. and Nowozin, S. 2009. On feature combination for multiclass object classification. In Proceedings of the 12th IEEE International Conference on Computer Vision. 221--228.Google Scholar
- Huiskes, M. and Lew, M. 2008. The mir flickr retrieval evaluation. In Proceedings of the 1st ACM International Conference on Multimedia Information Retrieval. 39--43. Google Scholar
Digital Library
- Huiskes, M. J., Thomee, B., and Lew, M. S. 2010. New trends and ideas in visual concept detection: the mir flickr retrieval evaluation initiative. In Proceedings of the International Conference on Multimedia Information Retrieval. 527--536. Google Scholar
Digital Library
- Jain, A. and Vailaya, A. 1996. Image retrieval using color and shape. Pattern Recogn. 29, 8, 1233--1244.Google Scholar
Cross Ref
- Jiang, W., Chang, S.-F., and Loui, A. 2007. Context-Based concept fusion with boosted conditional random fields. In Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP'07). Vol. 1. I--949--I--952.Google Scholar
- Jiang, Y.-G., Wang, J., Chang, S.-F., and Ngo, C.-W. 2009. Domain adaptive semantic diffusion for large scale context-based video annotation. In Proceedings of the 12th IEEE International Conference on Computer Vision. 1420--1427.Google Scholar
- Jing, F., Li, M., Zhang, H., and Zhang, B. 2004. Relevance feedback in region-based image retrieval. IEEE Trans. Circ. Syst. Video Technol. 14, 5, 672--681. Google Scholar
Digital Library
- Kherfi, M. and Ziou, D. 2007. Image collection organization and its application to indexing, browsing, summarization, and semantic retrieval. IEEE Trans. Multimedia 9, 4, 893--900. Google Scholar
Digital Library
- Knowles, J. and Corne, D. 2000. Approximating the nondominated front using the Pareto archived evolution strategy. Evol. Comput. 8, 2, 149--172. Google Scholar
Digital Library
- Koskela, M., Smeaton, A., and Laaksonen, J. 2007. Measuring concept similarities in multimedia ontologies: Analysis and evaluations. IEEE Trans. Multimedia 9, 5, 912--922. Google Scholar
Digital Library
- Lavrenko, V., Feng, S., and Manmatha, R. 2004. Statistical models for automatic video annotation and retrieval. In Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing. Vol. 3, IEEE, 1044--1047.Google Scholar
- Levinshtein, A., Stere, A., Kutulakos, K., Fleet, D., Dickinson, S., and Siddiqi, K. 2009. Turbopixels: Fast superpixels using geometric flows. IEEE Trans. Pattern Anal. Mach. Intell. 31, 12, 2290--2297. Google Scholar
Digital Library
- Li, F., Dai, Q., Xu, W., and Er, G. 2008. Multilabel neighborhood propagation for region-based image retrieval. IEEE Trans. Multimedia 10, 8, 1592--1604. Google Scholar
Digital Library
- Li, J. and Wang, J. 2003. Automatic linguistic indexing of pictures by a statistical modeling approach. IEEE Trans. Pattern Anal. Mach. Intell. 25, 9, 1075--1088. Google Scholar
Digital Library
- Lowe, D. 2004. Distinctive image features from scale-invariant keypoints. Int. J. Comput. Vis. 60, 2, 91--110. Google Scholar
Digital Library
- Mikolajczyk, K. and Schmid, C. 2005. A performance evaluation of local descriptors. IEEE Trans. Pattern Anal. Mach. Intell. 27, 10, 1615--1630. Google Scholar
Digital Library
- Natsev, A., Rastogi, R., and Shim, K. 2004. Walrus: A similarity retrieval algorithm for image databases. IEEE Trans. Knowl. Data Engin. 16, 3, 301--316. Google Scholar
Digital Library
- Qi, G.-J., Hua, X.-S., Rui, Y., Tang, J., Mei, T., and Zhang, H.-J. 2007. Correlative multi-label video annotation. In Proceedings of the 15th International Conference on Multimedia. 17--26. Google Scholar
Digital Library
- Rabinovich, A., Vedaldi, A., Galleguillos, C., Wiewiora, E., and Belongie, S. 2007. Objects in context. In Proceedings of the 11th IEEE International Conference on Computer Vision. 1--8.Google Scholar
- Rasiwasia, N. and Vasconcelos, N. 2009. Holistic context modeling using semantic co-occurrences. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. 1889--1895.Google Scholar
- Rui, Y., Huang, T., Ortega, M., and Mehrotra, S. 1998. Relevance feedback: A power tool for interactive content-based image retrieval. IEEE Trans. Circ. Syst. Video Technol. 8, 5, 644--655. Google Scholar
Digital Library
- Russell, B., Torralba, A., Murphy, K., and Freeman, W. 2008. Labelme: A database and web-based tool for image annotation. Int. J. Comput. Vis. 77, 1, 157--173. Google Scholar
Digital Library
- Steuer, R. 1986. Multiple Criteria Optimization. Theory, Computation, and Application. Wiley Series in Probability and Mathematical Statistics: Applied Probability and Statistics, Wiley.Google Scholar
- Swain, M. and Ballard, D. 1991. Color indexing. Int. J. Comput. Vis. 7, 1, 11--32. Google Scholar
Digital Library
- Tang, J., Hua, X.-S., Wang, M., Gu, Z., Qi, G.-J., and Wu, X. 2009a. Correlative linear neighborhood propagation for video annotation. IEEE Trans. Syst. Man Cybernet. 39, 2, 409--416. Google Scholar
Digital Library
- Tang, J., Yan, S., Hong, R., Qi, G.-J., and Chua, T.-S. 2009b. Inferring semantic concepts from community-contributed images and noisy tags. In Proceedings of the 17th ACM International Conference on Multimedia (MM'09). 223--232. Google Scholar
Digital Library
- Torres, R. D. S., Falcao, A. X., Goncalves, M. A., Papa, J. A. P., Zhang, B., Fan, W., and Fox, E. A. 2009. A genetic programming framework for content-based image retrieval. Pattern Recogn. 42, 2, 283--292. Google Scholar
Digital Library
- Tuceryan, M. and Jain, A. 1993. Texture analysis. In Handbook of Pattern Recognition and Computer Vision, 235--276. Google Scholar
Digital Library
- Vailaya, A., Jain, A., and Zhang, H. 1998. On image classification: City images vs. landscapes. Pattern Recogn. 31, 12, 1921--1935.Google Scholar
Cross Ref
- Vedaldi, A., Gulshan, V., Varma, M., and Zisserman, A. 2009. Multiple kernels for object detection. In Proceedings of the 12th IEEE International Conference on Computer Vision. 606--613.Google Scholar
- Vogel, J. and Schiele, B. 2007. Semantic modeling of natural scenes for content-based image retrieval. Int. J. Comput. Vis. 72, 2, 133--157. Google Scholar
Digital Library
- Wang, J., Li, J., and Wiederhold, G. 2001. Simplicity: Semantics-Sensitive integrated matching for picture libraries. IEEE Trans. Pattern Anal. Mach. Intell. 23, 9, 947--963. Google Scholar
Digital Library
- Zhang, J. and Ye, L. 2009. Content based image retrieval using unclean positive examples. IEEE Trans. Image Process. 18, 10, 2370--2375. Google Scholar
Digital Library
- Zhang, L., Zhang, K., and Dong, X. 2011. Online sparse learning utilizing multi-feature combination for image classification. In Proceedings of the 18th IEEE International Conference on Image Processing. 197--200.Google Scholar
- Zhang, Q. and Izquierdo, E. 2007. Combining low-level features for semantic inference in image retrieval. EURASIP J. Adv. Signal Process. 1, 1--12. Google Scholar
Digital Library
- Zhang, R. and Zhang, Z. 2007. Effective image retrieval based on hidden concept discovery in image database. IEEE Trans. Image Process. 16, 2, 562--572. Google Scholar
Digital Library
- Zhu, X., Wu, X., Elmagarmid, A., Feng, Z., and Wu, L. 2005. Video data mining: Semantic indexing and event detection from the association perspective. IEEE Trans. Knowl. Data Engin. 17, 5, 665--677. Google Scholar
Digital Library
Index Terms
Multifeature analysis and semantic context learning for image classification
Recommendations
Does Haze Removal Help CNN-Based Image Classification?
Computer Vision – ECCV 2018AbstractHazy images are common in real scenarios and many dehazing methods have been developed to automatically remove the haze from images. Typically, the goal of image dehazing is to produce clearer images from which human vision can better identify the ...
Feature context for image classification and object detection
CVPR '11: Proceedings of the 2011 IEEE Conference on Computer Vision and Pattern RecognitionIn this paper, we presents a new method to encode the spatial information of local image features, which is a natural extension of Shape Context (SC), so we call it Feature Context (FC). Given a position in a image, SC computes histogram of other points ...
Entropy based image semantic cycle for image classification
ICONIP'12: Proceedings of the 19th international conference on Neural Information Processing - Volume Part VThis paper proposes a novel framework for image classification with an entropy based image semantic cycle. Entropy minimization leads to an optimal image semantic cycle where images are connected in the semantic order. For classification, the training ...






Comments