Abstract
The increasing amount of multimedia data collections available today evinces the pressing need for methods capable of indexing and retrieving this content. Despite the continuous advances in multimedia features and representation models, to establish an effective measure for comparing different multimedia objects still remains a challenging task. While supervised and semi-supervised techniques made relevant advances on similarity learning tasks, scenarios where labeled data are non-existent require different strategies. In such situations, unsupervised learning has been established as a promising solution, capable of considering the contextual information and the dataset structure for computing new similarity/dissimilarity measures. This article extends a recent unsupervised learning algorithm that uses an iterative re-ranking strategy to take advantage of different k-Nearest Neighbors (kNN) sets and rank correlation measures. Two novel approaches are proposed for computing the kNN sets and their corresponding top-k lists. The proposed approaches were validated in conjunction with various rank correlation measures, yielding superior effectiveness results in comparison with previous works. In addition, we also evaluate the ability of the method in considering different multimedia objects, conducting an extensive experimental evaluation on various image and video datasets.
- J. Almeida, N. J. Leite, and R. S. Torres. 2011. Comparison of video sequences with histograms of motion patterns. In Proceedings of the IEEE International Conference on Image Processing (ICIP’11). 3673--367Google Scholar
- J. Almeida, D. C. G. Pedronette, and O. A. B. Penatti. 2014. Unsupervised manifold learning for video genre retrieval. In Proceedings of the Iberoamerican Congress on Pattern Recognition (CIARP’14). 604--612.Google Scholar
Cross Ref
- J. Almeida, A. Rocha, R. S. Torres, and S. Goldenstein. 2008. Making colors worth more than a thousand words. In Proceedings of the ACM Symposium on Applied Computing (ACM-SAC’08). 1180--1186. Google Scholar
Digital Library
- Nafiz Arica and Fatos T. Yarman Vural. 2003. BAS: A perceptual shape descriptor based on the beam angle statistics. Pattern Recogn. Lett. 24, 9-10 (2003), 1627--1639. Google Scholar
Digital Library
- S. Bai and X. Bai. 2016. Sparse contextual activation for efficient visual re-ranking. IEEE Trans. Image Processing 25, 3 (2016), 1056--1069.Google Scholar
Digital Library
- Song Bai, Xiang Bai, Qi Tian, and Longin Jan Latecki. 2017. Regularized diffusion process for visual retrieval. In Proceedings of the AAAI Conference on Artificial Intelligence (AAAI’17). 3967--3973.Google Scholar
Cross Ref
- S. Bai, Z. Zhou, J. Wang, X. Bai, L. J. Latecki, and Q. Tian. 2017. Ensemble diffusion for retrieval. In Proceedings of the IEEE International Conference on Computer Vision (ICCV’17). 774--783.Google Scholar
- Xiang Bai, Song Bai, and Xinggang Wang. 2015. Beyond diffusion process: Neighbor set similarity for fast re-ranking. Inf. Sci. 325 (2015), 342--354. Google Scholar
Digital Library
- Xiang Bai, Bo Wang, Xinggang Wang, Wenyu Liu, and Zhuowen Tu. 2010. Co-transduction for shape retrieval. In Proceedings of the European Conference on Computer Vision (ECCV’10), Vol. 3. 328--341. Google Scholar
Digital Library
- Christian Beecks. 2013. Distance-based Similarity Models for Content-based Multimedia Retrieval. Ph.D. Dissertation. Fakultat fur Mathematik, Informatik und Naturwissenschaften, RWTH Aachen University.Google Scholar
- C. Beecks, M. S. Uysal, and T. Seidl. 2010. A comparative study of similarity measures for content-based multimedia retrieval. In Proceedings of the IEEE International Conference on Multimedia and Expo (ICME’10). 1552--1557.Google Scholar
- Serge Belongie, Jitendra Malik, and Jan Puzicha. 2002. Shape matching and object recognition using shape contexts. IEEE Trans. Pattern Anal. Mach. Intell. 24, 4 (2002), 509--522. Google Scholar
Digital Library
- Phil Brodatz. 1966. Textures: A Photographic Album for Artists and Designers. Dover Publications, Mineola, NY.Google Scholar
- Yanzhi Chen, Xi Li, Anthony Dick, and Rhys Hill. 2014. Ranking consistency for image matching and object retrieval. Pattern Recogn. 47, 3 (2014), 1349--1360. Google Scholar
Digital Library
- Joost Van de Weijer and Cordelia Schmid. 2006. Coloring local feature extraction. In Proceedings of the European Conference on Computer Vision (ECCV’06). Lecture Notes in Computer Science, Vol. 3952. 334--348. Google Scholar
Digital Library
- C. Deng, R. Ji, W. Liu, D. Tao, and X. Gao. 2013. Visual reranking through weakly supervised multi-graph learning. In Proceedings of the IEEE International Conference on Computer Vision (ICCV’13). 2600--2607. Google Scholar
Digital Library
- M. Donoser and H. Bischof. 2013. Diffusion processes for retrieval revisited. In Proceedings of the IEEE International Conference on Computer Vision and Pattern Recognition (CVPR’13). 1320--1327. Google Scholar
Digital Library
- L. A. Duarte, O. A. B. Penatti, and J. Almeida. 2016. Bag of genres for video retrieval. In Proceedings of the Conference on Graphics, Patterns and Images (SIBGRAPI’16). 257--264.Google Scholar
- Ronald Fagin, Ravi Kumar, and D. Sivakumar. 2003. Comparing top k lists. In Proceedings of the ACM-SIAM Symposium on Discrete Algorithms (SODA’03). 28--36. Google Scholar
Digital Library
- Leo A. Goodman and William H. Kruskal. 1954. Measures of association for cross classifications. J. Am. Stat. Assoc. 49, 268 (1954), 732--764.Google Scholar
- Raghuraman Gopalan, Pavan Turaga, and Rama Chellappa. 2010. Articulation-invariant representation of non-planar shapes. In Proceedings of the European Conference on Computer Vision (ECCV’10), Vol. 3. 286--299. Google Scholar
Digital Library
- Jing Huang, S. Ravi Kumar, Mandar Mitra, Wei-Jing Zhu, and Ramin Zabih. 1997. Image indexing using color correlograms. In Proceedings of the IEEE International Conference on Computer Vision and Pattern Recognition (CVPR’97). 762--768. Google Scholar
Digital Library
- Hervé Jegou, Cordelia Schmid, Hedi Harzallah, and Jakob Verbeek. 2010. Accurate image search using the contextual dissimilarity measure. IEEE Trans. Pattern Anal. Mach. Intell. 32, 1 (2010), 2--11. Google Scholar
Digital Library
- Yangqing Jia, Evan Shelhamer, Jeff Donahue, Sergey Karayev, Jonathan Long, Ross Girshick, Sergio Guadarrama, and Trevor Darrell. 2014. Caffe: Convolutional architecture for fast feature embedding. In Proceedings of the ACM International Conference on Multimedia (ACM-MM’14). 675--678. Google Scholar
Digital Library
- Jiayan Jiang, Bo Wang, and Zhuowen Tu. 2011. Unsupervised metric learning by self-smoothing operator. In Proceedings of the IEEE International Conference on Computer Vision (ICCV’11). 794--801. Google Scholar
Digital Library
- Peter Kontschieder, Michael Donoser, and Horst Bischof. 2009. Beyond pairwise shape similarity analysis. In Proceedings of the Asian Conference on Computer Vision (ACCV’09). 655--666. Google Scholar
Digital Library
- Vassili Kovalev and Stephan Volmer. 1998. Color co-occurence descriptors for querying-by-example. In Proceedings of the International Conference on MultiMedia Modeling (MMM’98). 32--38. Google Scholar
Digital Library
- Longin Jan Latecki, Rolf Lakmper, and Ulrich Eckhardt. 2000. Shape descriptors for non-rigid shapes with a single closed contour. In Proceedings of the IEEE International Conference on Computer Vision and Pattern Recognition (CVPR’00). 424--429.Google Scholar
Cross Ref
- Haibin Ling and David W. Jacobs. 2007. Shape classification using the inner-distance. IEEE Trans. Pattern Anal. Mach. Intell. 29, 2 (2007), 286--299. Google Scholar
Digital Library
- Haibin Ling, Xingwei Yang, and Longin Jan Latecki. 2010. Balancing deformability and discriminability for shape matching. In Proceedings of the European Conference on Computer Vision (ECCV’10), Vol. 3. 411--424. Google Scholar
Digital Library
- D. G. Lowe. 1999. Object recognition from local scale-invariant features. In Proceedings of the IEEE International Conference on Computer Vision (ICCV’99). 1150--1157. Google Scholar
Digital Library
- D. Nistér and H. Stewénius. 2006. Scalable recognition with a vocabulary tree. In Proceedings of the IEEE International Conference on Computer Vision and Pattern Recognition (CVPR’06), Vol. 2. 2161--2168. Google Scholar
Digital Library
- Timo Ojala, Matti Pietikäinen, and Topi Mäenpää. 2002. Multiresolution gray-scale and rotation invariant texture classification with local binary patterns. IEEE Trans. Pattern Anal. Mach. Intell. 24, 7 (2002), 971--987. Google Scholar
Digital Library
- César Yugo Okada, Daniel Carlos Guimarães Pedronette, and Ricardo da S. Torres. 2015. Unsupervised distance learning by rank correlation measures for image retrieval. In Proceedings of the ACM International Conference on Multimedia Retrieval (ICMR’15). 331--338. Google Scholar
Digital Library
- Daniel Carlos Guimarães Pedronette, Jurandy Almeida, and Ricardo da S. Torres. 2014. A scalable re-ranking method for content-based image retrieval. Inf. Sci. 265, 1 (2014), 91--104. Google Scholar
Digital Library
- D. C. G. Pedronette, J. Almeida, and R. S. Torres. 2016. A graph-based ranked-list model for unsupervised distance learning on shape retrieval. Pattern Recogn. Lett. 83, Part 3 (2016), 357--367. Google Scholar
Digital Library
- Daniel Carlos Guimarães Pedronette, Otavio A. B. Penatti, and Ricardo da S. Torres. 2014. Unsupervised manifold learning using reciprocal kNN graphs in image re-ranking and rank aggregation tasks. Image Vis. Comput. 32, 2 (2014), 120--130. Google Scholar
Digital Library
- Daniel Carlos Guimarães Pedronette and Ricardo da S. Torres. 2010. Shape retrieval using contour features and distance optmization. In Proceedings of the International Conference on Computer Vision Theory and Applications (VISAPP’10), Vol. 1. 197--202.Google Scholar
- Daniel Carlos Guimarães Pedronette and Ricardo da S. Torres. 2012. Exploiting pairwise recommendation and clustering strategies for image re-ranking. Inf. Sci. 207 (2012), 19--34. Google Scholar
Digital Library
- Daniel Carlos Guimarães Pedronette and Ricardo da S. Torres. 2013. Image re-ranking and rank aggregation based on similarity of ranked lists. Pattern Recogn. 46, 8 (2013), 2350--2360. Google Scholar
Digital Library
- Otávio A. B. Penatti, Eduardo Valle, and Ricardo da S. Torres. 2012. Comparative study of global color and texture descriptors for web image retrieval. J. Vis. Commun. Image Represent. 23, 2 (2012), 359--380. Google Scholar
Digital Library
- Danfeng Qin, S. Gammeter, L. Bossard, T. Quack, and L. van Gool. 2011. Hello neighbor: Accurate object retrieval with k-reciprocal nearest neighbors. In Proceedings of the IEEE International Conference on Computer Vision and Pattern Recognition (CVPR’11). 777--784. Google Scholar
Digital Library
- Renato O. Stehling, Mario A. Nascimento, and Alexandre X. Falcão. 2002. A compact and efficient image retrieval approach based on border/interior pixel classification. In Proceedings of the ACM International Conference on Information and Knowledge Management (CIKM’02). 102--109. Google Scholar
Digital Library
- Michael J. Swain and Dana H. Ballard. 1991. Color indexing. Int. J. Comput. Vis. 7, 1 (1991), 11--32. Google Scholar
Digital Library
- Bo Tao and Bradley W. Dickinson. 2000. Texture recognition and image retrieval using gradient indexing. J. Vis. Commun. Image Represent. 11, 3 (2000), 327--342. Google Scholar
Digital Library
- Bart Thomee and Michael S. Lew. 2012. Interactive search in image retrieval: A survey. Int. J. Multimedia Inf. Retriev. 1, 2 (2012), 71--86.Google Scholar
Cross Ref
- Ricardo da S. Torres and Alexandre X. Falcão. 2007. Contour salience descriptors for effective image retrieval and analysis. Image Vis. Comput. 25, 1 (2007), 3--13.Google Scholar
Cross Ref
- L. P. Valem, D. C. G. Pedronette, R. S. Torres, E. Borin, and J. Almeida. 2015. Effective, efficient, and scalable unsupervised distance learning in image retrieval tasks. In Proceedings of the ACM International Conference on Multimedia Retrieval (ICMR’15). 51--58. Google Scholar
Digital Library
- Ji Wan, Dayong Wang, Steven Chu Hong Hoi, Pengcheng Wu, Jianke Zhu, Yongdong Zhang, and Jintao Li. 2014. Deep learning for content-based image retrieval: A comprehensive study. In Proceedings of the ACM International Conference on Multimedia (ACM-MM’14). 157--166. Google Scholar
Digital Library
- Bo Wang, Jiayan Jiang, Wei Wang, Zhi-Hua Zhou, and Zhuowen Tu. 2012. Unsupervised metric fusion by cross diffusion. In Proceedings of the IEEE International Conference on Computer Vision and Pattern Recognition (CVPR’12). 3013--3020.Google Scholar
Cross Ref
- Jingyan Wang, Yongping Li, Xiang Bai, Ying Zhang, Chao Wang, and Ning Tang. 2011. Learning context-sensitive similarity by shortest path propagation. Pattern Recogn. 44, 10-11 (2011), 2367--2374. Google Scholar
Digital Library
- Xiaoyu Wang, Ming Yang, T. Cour, Shenghuo Zhu, Kai Yu, and T. X. Han. 2011. Contextual weighting for vocabulary tree based image retrieval. In Proceedings of the IEEE International Conference on Computer Vision (ICCV’11). 209--216. Google Scholar
Digital Library
- William Webber, Alistair Moffat, and Justin Zobel. 2010. A similarity measure for indefinite rankings. ACM Trans. Inf. Syst. 28, 4, Article 20 (2010), 38 pages. Google Scholar
Digital Library
- Lingxi Xie, Richang Hong, Bo Zhang, and Qi Tian. 2015. Image classification and retrieval are ONE. In Proceedings of the ACM International Conference on Multimedia Retrieval (ICMR’15). 3--10. Google Scholar
Digital Library
- Xingwei Yang, Xiang Bai, Longin Jan Latecki, and Zhuowen Tu. 2008. Improving shape retrieval by learning graph transduction. In Proceedings of the European Conference on Computer Vision (ECCV’08), Vol. 4. 788--801.Google Scholar
Cross Ref
- Xingwei Yang, Suzan Koknar-Tezel, and Longin Jan Latecki. 2009. Locally constrained diffusion process on locally densified distance spaces with applications to shape retrieval. In Proceedings of the IEEE International Conference on Computer Vision and Pattern Recognition (CVPR’09). 357--364.Google Scholar
Cross Ref
- Xingwei Yang, L. Prasad, and L. J. Latecki. 2013. Affinity learning with diffusion on tensor product graph. IEEE Trans. Pattern Anal. Mach. Intell. 35, 1 (2013), 28--38. Google Scholar
Digital Library
- Shaoting Zhang, Ming Yang, T. Cour, Kai Yu, and D. N. Metaxas. 2015. Query specific rank fusion for image retrieval. IEEE Trans. Pattern Anal. Mach. Intell. 37, 4 (2015), 803--815.Google Scholar
Digital Library
- Liang Zheng, Shengjin Wang, Lu Tian, Fei He, Ziqiong Liu, and Qi Tian. 2015. Query-adaptive late fusion for image search and person re-identification. In Proceedings of the IEEE International Conference on Computer Vision and Pattern Recognition (CVPR’15).Google Scholar
Cross Ref
- Liang Zheng, Shengjin Wang, and Qi Tian. 2014. Lp-norm IDF for scalable image retrieval. IEEE Trans. Image Process. 23, 8 (2014), 3604--3617.Google Scholar
Cross Ref
Index Terms
Unsupervised Similarity Learning through Rank Correlation and kNN Sets
Recommendations
Unsupervised Distance Learning by Rank Correlation Measures for Image Retrieval
ICMR '15: Proceedings of the 5th ACM on International Conference on Multimedia RetrievalRanking accurately collection images is the main objective of Content-based Image Retrieval (CBIR) systems. In fact, the set of images ranked at the first positions generally defines the effectiveness of provided search services, i.e., they are used for ...
Selection and Combination of Unsupervised Learning Methods for Image Retrieval
CBMI '17: Proceedings of the 15th International Workshop on Content-Based Multimedia IndexingThe evolution of technologies to store and share images has made imperative the need for methods to index and retrieve multimedia information based on visual content. The CBIR (Content-Based Image Retrieval) systems are the main solution in this ...
Image re-ranking and rank aggregation based on similarity of ranked lists
In Content-based Image Retrieval (CBIR) systems, ranking accurately collection images is of great relevance. Users are interested in the returned images placed at the first positions, which usually are the most relevant ones. Collection images are ...






Comments