Abstract
This paper is a personal look back at twenty years of research in multimedia content analysis. It addresses the areas of audio, photo and video analysis for the purpose of indexing and retrieval from the perspective of a multimedia researcher. Whereas a general analysis of content is impossible due to the personal bias of the user, significant progress was made in the recognition of specific objects or events. The paper concludes with a brief outlook on the future.
- Cao, L., Chang, S.-F., Codella, N., Cotton, C., Ellis, D., Gong, L., Hill, M., Hua, G., Kender, J., Merler, M., Mu, Y., Natsev, A., and Smith, J. R. 2011. IBM Research and Columbia University TRECVID-2011 multimedia event detection (MED) system. In Proceedings of the NIST TRECVID Workshop.Google Scholar
- Chen, D., Odobez, J. M., and Bourlard, H. 2004. Text detection and recognition in images and video frames. J. Pattern Recog. Soc. 37, 3, 595--608.Google Scholar
Cross Ref
- Ghias, A., Logan, J., Chamberlin, D., and Smith, B. C. 1995. Query by humming: Musical information retrieval in an audio database. In Proceedings of the ACM Multimedia Conference. 231--236. Google Scholar
Digital Library
- Google. 2013. http://images.google.com. (Last accessed 7/13).Google Scholar
- Han, J., Farin, D., and de With, P. H. N. 2008. Broadcast court-net sports video analysis using fast 3-D camera modeling. IEEE Trans. Circuits Syst. Video Technol. 18, 11, 1628--1638. Google Scholar
Digital Library
- Lienhart, R., Kuhmünch, C. H., and Effelsberg, W. 1997a. On the detection and recognition of television commercials. In Proceedings of the IEEE International Conference on Multimedia Computing and Systems (ICMCS'97). 509--516. Google Scholar
Digital Library
- Lienhart, R., Pfeiffer, S., and Effelsberg, W. 1997b. Video abstracting. Comm. ACM, 40, 12, 55--62. Google Scholar
Digital Library
- Moore, B. E., Ali, S., Mehran, R., and Shah, M. 2011. Visual crowd surveillance through a hydrodynamic lens. Comm. ACM, 54, 12, 64--73. Google Scholar
Digital Library
- Niblack, C. W., Barber, R., Equitz, W., Flickner, M. D., Glasman, E. H., Petkovic, D., Yanker, P., Faloutsos, C. H., and Taubin, G. 1993. QBIC project: Querying images by content using color, texture, and shape. In Proceedings of the SPIE 1908, Storage and Retrieval for Image and Video Databases.Google Scholar
- Rowley, H. A., Baluja, S., and Kanade, T. 1998. Neural network-based face detection. IEEE Trans. Pattern Anal. Machine Intell. 20, 1, 23--38. Google Scholar
Digital Library
- Rui, Y., Huang, T. H., Ortega, M., and Mehrotra, S. 1998. Relevance feedback: A power tool for interactive content-based image retrieval. IEEE Trans. Circuits Syst. Video Technol. 8, 5, 644--655. Google Scholar
Digital Library
- Shah, M. 2010. Visual crowd surveillance is like hydrodynamics. In Proceedings of the ACM Multimedia Conference. 3--4. Google Scholar
Digital Library
- Uitdenbogerd, A., and Zobel, J. 1995. Melody matching techniques for large music databases. In Proceedings of the ACM Multimedia Conference. 57--66. Google Scholar
Digital Library
- Wactlar, H. D., Christel, M. G., Gong, Y., and Hauptmann, A. G. 1999. Lessons learned from building a terabyte digital video library. Computer 32, 2, 66--73. Google Scholar
Digital Library
- Wang, G., Hoiem, D., and Forsyth, D. 2012. Learning image similarity from Flickr groups using fast kernel machines. IEEE Trans. Pattern Anal. Machine Intell. 34, 11, 2177--2188. Google Scholar
Digital Library
- Zabih, R., Miller, J., and Mai, K. 1995. A feature-based algorithm for detecting and classifying scene breaks. In Proceedings of the ACM Multimedia Conference. 189--200. Google Scholar
Digital Library
- Zhang, H., Kankanhalli, A., and Smoliar, S. 1993. Automatic Partitioning of full-motion video. Multimedia Syst. 1, 10--28. Google Scholar
Digital Library
- Zhu, G., Huang, Q., Xu, C., Rui, Y., Jiang, S., Gao, W., and Yao, H. 2007. Trajectory based event tactics analysis in broadcast sports video. In Proceedings of the ACM Multimedia Conference. 58--67. Google Scholar
Digital Library
Index Terms
A personal look back at twenty years of research in multimedia content analysis
Recommendations
Analysing multimedia content in social networking environments
SAPMIA '10: Proceedings of the 2010 ACM workshop on Social, adaptive and personalized multimedia interaction and accessSocial and Peer-to-Peer (P2P) networks have received considerable interest in recent decades due to its focus on analysis and relationships among entities and on the patterns and implications of these relationships. In the meantime, with the rapid ...
Large scale content analysis engine
LS-MMRM '09: Proceedings of the First ACM workshop on Large-scale multimedia retrieval and miningThe evolution of IP video systems has resulted in unprecedented access to a wide range of video material for consumers via IPTV and Web delivery. Retrieval technologies help users find relevant content, but suffer from a paucity of reliable content ...
Affective content analysis of music video clips
MIRUM '11: Proceedings of the 1st international ACM workshop on Music information retrieval with user-centered and multimodal strategiesNowadays, the amount of multimedia contents is explosively increasing and it is often a challenging problem to find a content that will be appealing or matches users' current mood or affective state. In order to achieve this goal, an effcient indexing ...






Comments