Abstract
Graph-based semi-supervised image annotation has achieved great success in a variety of studies, yet it essentially and intuitively suffers from both the irrelevant/noisy features (referred to as feature outliers) and the unusual/corrupted samples (referred to as sample outliers). In this work, we investigate how to derive robust sample affinity matrix via simultaneous feature and sample outlier pursuit. This task is formulated as a Dual-outlier and Prior-driven Low-Rank Representation (DP-LRR) problem, which possesses convexity in objective function. In DP-LRR, the clean data are assumed to be self-reconstructible with low-rank coefficient matrix as in LRR; while the error matrix is decomposed as the sum of a row-wise sparse matrix and a column-wise sparse matrix, the ℓ2,1-norm minimization of which encourages the pursuit of feature and sample outliers respectively. The DP-LRR is further regularized by the priors from side information, that is, the inhomogeneous data pairs. An efficient iterative procedure based on linearized alternating direction method is presented to solve the DP-LRR problem, with closed-form solutions within each iteration. The derived low-rank reconstruction coefficient matrix is then fed into any graph based semi-supervised label propagation algorithm for image annotation, and as a by-product, the cleaned data from DP-LRR can also be utilized as a better image representation to generally boost image annotation performance. Extensive experiments on MIRFlickr, Corel30K, NUS-WIDE-LITE and NUS-WIDE databases well demonstrate the effectiveness of the proposed formulation for robust image annotation.
- Belkin, M., Niyogi, P., and Sindhwani, V. 2006. Manifold regularization: A geometric framework for learning from labeled and unlabeled examples. J. Mach. Learn. Res. Google Scholar
Digital Library
- Bertsekas, D. P. 1982. Constrained Optimization and Lagrange Multiplier Methods. Athena Scientific.Google Scholar
- Blum, A. and Langley, P. 1997. Selection of relevant features and examples in machine learning. Artif. Intell. Google Scholar
Digital Library
- Cai, D., Zhang, C., and He, X. 2010b. Unsupervised feature selection for multi-cluster data. In Proceedings of the ACM SIGKDD International Conference on Knowledge Discovery and Data Mining. Google Scholar
Digital Library
- Cai, J.-F., Candès, E. J., and Shen, Z. 2010a. A singular value thresholding algorithm for matrix completion. SIAM J. Optim. Google Scholar
Digital Library
- Candès, E. J., Li, X., Ma, Y., and Wright, J. 2011. Robust principal component analysis? J. ACM.Google Scholar
- Candès, E. J. and Recht, B. 2009. Exact matrix completion via convex optimization. Found. Comput. Math. Google Scholar
Digital Library
- Carneiro, G., Chan, A. B., Moreno, P. J., and Vasconcelos, N. 2007. Supervised learning of semantic classes for image annotation and retrieval. IEEE Trans. Pattern Anal. Mach. Intell. Google Scholar
Digital Library
- Chang, C.-C. and Lin, C.-J. 2011. LIBSVM: A library for support vector machines. ACM Trans. Intell. Syst. Technol. 2, 27:1--27:27. Google Scholar
Digital Library
- Chen, X., Mu, Y., Yan, S., and Chua, T. 2010. Efficient large-scale image annotation by probabilistic collaborative multi-label propagation. In Proceedings of the International Conference on Multimedia. Google Scholar
Digital Library
- Chen, X., Yuan, X., Yan, S., Tang, J., Rui, Y., and Chua, T. 2011b. Towards multi-semantic image annotation with graph regularized exclusive group lasso. In Proceedings of the International Conference on Multimedia. Google Scholar
Digital Library
- Chen, X., Yuan, X.-T., Chen, Q., Yan, S., and Chua, T.-S. 2011a. Multi-label visual classification with label exclusive context. In Proceedings of the IEEE International Conference on Computer Vision. Google Scholar
Digital Library
- Cheng, B., Yang, J., Yan, S., Fu, Y., and Huang, T. S. 2010. Learning with l1-graph for image analysis. IEEE Trans. Image Process. Google Scholar
Digital Library
- Chua, T.-S., Tang, J., Hong, R., Li, H., Luo, Z., and Zheng, Y.-T. 2009. NUS-WIDE: A real-world web image database from National University of Singapore. In Proceedings of the International Conference on Image and Video Retrieval. Google Scholar
Digital Library
- Collobert, R., Sinz, F., Weston, J., and Bottou, L. 2006. Large scale transductive SVMs. J. Mach. Learn. Res. Google Scholar
Digital Library
- Datta, R., Joshi, D., Li, J., and Wang, J. Z. 2008. Image retrieval: Ideas, influences, and trends of the new age. ACM Comput. Surv. Google Scholar
Digital Library
- Gao, Y., Fan, J., Xue, X., and Jain, R. 2006. Automatic image annotation by incorporating feature hierarchy and boosting to scale up SVM classifiers. In Proceedings of the International Conference on Multimedia. Google Scholar
Digital Library
- Guillaumin, M., Verbeek, J. J., and Schmid, C. 2010. Multimodal semi-supervised learning for image classification. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition.Google Scholar
- He, B., Tao, M., and Yuan, X. 2011. Alternating direction method with Gaussian back substitution for separable convex programming. SIAM J. Optim.Google Scholar
- Hsieh, L.-C. and Hsu, W. H. 2010. Search-Based Automatic Image Annotation via Flickr Photos Using Tag Expansion. In Proceedings of the International Conference on Acoustics, Speech, and Signal Processing.Google Scholar
- Huiskes, M. J. and Lew, M. S. 2008. The MIR Flickr Retrieval Evaluation. In Proceedings of the 1st ACM International Conference on Multimedia Information Retrieval. Google Scholar
Digital Library
- Kohavi, R. and John, G. H. 1997. Wrappers for feature subset selection. Artif. Intell. Google Scholar
Digital Library
- Lang, C., Liu, G., Yu, J., and Yan, S. 2012. Saliency detection by multitask sparsity pursuit. Trans. Image Process. Google Scholar
Digital Library
- Li, X., Chen, L., Zhang, L., Lin, F., and Ma, W. 2006. Image annotation by large-scale content-based image retrieval. In Proceedings of the International Conference on Multimedia. Google Scholar
Digital Library
- Lin, Z., Chen, M., and Wu, L. 2010. The augmented Lagrange multiplier method for exact recovery of corrupted low-rank matrices. Math. Program.Google Scholar
- Lin, Z., Liu, R., and Su, Z. 2011. Alternating direction method with adaptive penalty for low-rank representation. In Proceedings of the Conference on Advances in Neural Information Processing Systems.Google Scholar
- Liu, G., Lin, Z., Yan, S., Sun, J., Yu, Y., and Ma, Y. 2010b. Robust recovery of subspace structures by low-rank representation. IEEE Trans. Pattern Anal. Mach. Intell. Google Scholar
Digital Library
- Liu, G., Lin, Z., and Yu, Y. 2010a. Robust Subspace Segmentation by low-rank representation. IEEE Trans. Pattern Anal. Mach. Intell. Google Scholar
Digital Library
- Liu, H. and Motoda, H. 1998. Feature Selection for Knowledge Discovery and Data Mining. Kluwer Google Scholar
Digital Library
- Liu, J., Ji, S., and Ye, J. 2009. SLEP: Sparse Learning with Efficient Projections. Arizona State University.Google Scholar
- Ma, Z., Yang, Y., Nie, F., Uijlings, J., and Sebe, N. 2011. Exploiting the entire feature space with sparsity for automatic image annotation. In Proceedings of the International Conference on Multimedia. Google Scholar
Digital Library
- Ni, B., Yan, S., Kassim, A. A., and Cheong, L. F. 2008. Learning by Propagability. In Proceedings of the IEEE International Conference on Data Mining. Google Scholar
Digital Library
- Nie, F., Huang, H., Cai, X., and Ding, C. 2010. Efficient and Robust Feature Selection via Joint l2,1-Norms Minimization. In Proceedings of the Conference on Advances in Neural Information Processing Systems.Google Scholar
- Peng, Y., Ganesh, A., Wright, J., Xu, W., and Ma, Y. 2011. RASL: Robust alignment by sparse and low-rank decomposition for linearly correlated images. IEEE Trans. Pattern Anal. Mach. Intell. Google Scholar
Digital Library
- Subramanya, A. and Bilmes, J. A. 2009. Entropic graph regularization in non-parametric semi-supervised classification. In Proceedings of the Conference on Advances in Neural Information Processing Systems.Google Scholar
- Tang, J., Chen, Z., Fu, A. W., and Cheung, D. W. 2007. Capabilities of outlier detection schemes in large datasets, framework and methodologies. Knowledge Inf. Syst. Google Scholar
Digital Library
- Tang, J., Yan, S., Hong, R., Qi, G. J., and Chua, T. S. 2009. Inferring semantic concepts from community-contributed images and noisy tags. In Proceedings of the International Conference on Multimedia. Google Scholar
Digital Library
- Tao, M. and Yuan, X. 2011. Recovering low-rank and sparse components of matrices from incomplete and noisy observations. SIAM J. Optim. Google Scholar
Digital Library
- Wang, C., Yan, S., Zhang, L., and Zhang, H.-J. 2009b. Multi-label sparse coding for automatic image annotation. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition.Google Scholar
- Wang, F. and Zhang, C. 2008. Label Propagation through Linear Neighborhoods. IEEE Trans. Knowl. Data Eng. Google Scholar
Digital Library
- Wang, J., Wang, F., Zhang, C., Shen, H. C., and Quan, L. 2009a. Linear neighborhood propagation and its applications. IEEE Trans. Pattern Anal. Mach. Intell. Google Scholar
Digital Library
- Wang, X.-J., Zhang, L., Jing, F., and Ma, W.-Y. 2006. AnnoSearch: Image auto-annotation by search. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. Google Scholar
Digital Library
- Wang, X.-J., Zhang, L., Li, X., and Ma, W.-Y. 2008. Annotating images by mining image search results. IEEE Trans. Pattern Anal. Mach. Intell. Google Scholar
Digital Library
- Xu, H., Caramanis, C., and Sanghavi, S. 2010. Robust PCA via outlier pursuit. In Proceedings of the Conference on Advances in Neural Information Processing Systems.Google Scholar
- Yang, J. and Yuan, X. 2011. Linearized augmented Lagrangian and alternating direction methods for nuclear norm minimization. Math. Comput.Google Scholar
- Yang, Y., Yang, Y., Huang, Z., Shen, H. T., and Nie, F. 2011. Tag localization with spatial correlations and joint group sparsity. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. Google Scholar
Digital Library
- Yang, Y., Wu, F., Nie, F., Shen, H. T., Zhuang, Y., and Hauptmann, A. G. 2012. Web and personal image annotation by mining label correlation with relaxed visual graph embedding. Trans. Image Process. Google Scholar
Digital Library
- Zhao, Z., Wang, L., and Liu, H. 2010. Efficient spectral feature selection with minimum redundancy. In Proceedings of the National Conference on Artificial Intelligence.Google Scholar
- Zhou, D., Bousquet, O., Lal, T. N., Weston, J., and Schölkopf, B. 2003. Learning with Local and Global Consistency. In Proceedings of the Conference on Advances in Neural Information Processing Systems.Google Scholar
- Zhu, G., Yan, S., and Ma, Y. 2010. Image tag refinement towards low-rank, content-tag prior and error sparsity. In Proceedings of the International Conference on Multimedia. Google Scholar
Digital Library
- Zhu, X. and Ghahramani, Z. 2002. Learning from labeled and unlabeled data with label propagation. Tech. rep. CMU-CALD-02-107, CMU.Google Scholar
- Zhu, X., Ghahramani, Z., and Lafferty, J. D. 2003. Semi-Supervised Learning Using Gaussian Fields and Harmonic Functions. In Proceedings of the International Conference on Machine Learning.Google Scholar
Index Terms
Robust image annotation via simultaneous feature and sample outlier pursuit
Recommendations
Exact recoverability of robust PCA via outlier pursuit with tight recovery bounds
AAAI'15: Proceedings of the Twenty-Ninth AAAI Conference on Artificial IntelligenceSubspace recovery from noisy or even corrupted data is critical for various applications in machine learning and data analysis. To detect outliers, Robust PCA (R-PCA) via Outlier Pursuit was proposed and had found many successful applications. However, ...
Robust Low-Rank Representation via Correntropy
ACPR '13: Proceedings of the 2013 2nd IAPR Asian Conference on Pattern RecognitionSubspace clustering via Low-Rank Representation (LRR) has shown its effectiveness in clustering the data points sampled from a union of multiple subspaces. In original LRR, the noise in data is assumed to be Gaussian or sparse, which may be ...
Discriminative low-rank representation with Schatten-p norm for image recognition
AbstractLow-rank representation (LRR) has attracted much attention recently due to its efficacy in a rich variety of real world applications. Recently, the non-convex regularization has become widely used in the rank minimization problem. In this paper, ...






Comments