ABSTRACT
Video is a very rich medium that is becoming increasingly dominant. A massive amount of video information is available, but very difficult to access if not adequately indexed: a challenging task to accomplish. We describe a Video Information Retrieval system, under development, that operates on a database composed of subtitled documents. The simultaneous analysis of video, subtitles and audio streams is performed in order to index, visualize and retrieve excerpts of video documents that share a certain emotional or semantic property.
References
- Bethard, S., Yu, H., Thornton, A., Hatzivassiloglou V., Jurafsky D. 2004. Automatic Extraction of Opinion Propositions and their Holders. In Proc. of the AAAI Spring Symposium on Exploring Attitude and Affect in Text: Theories and Applications, Stanford University, USA.Google Scholar
- Carvalho, P., Sarmento, L., Silva, M. J., de Oliveira, E. 2009, Clues for detecting irony in user-generated contents: Oh...!! it's "so easy";). In Proceedings of TSA'09-1st International CIKM Workshop on Topic-Sentiment Analysis for Mass Opinion Measurement, Hong Kong, November 6. Google Scholar
Digital Library
- Chu, S., Narayanan, S. and Jay Kuo, C.-C. 2008. Environmental Sound Recognition using MP-based Features. In IEEE Int. Conf. on Acoustics, Speech, and Signal Processing.Google Scholar
- Clavel, C., Ehrette, T., Richard, G. 2005. Events Detection for an Audio-Based Surveillance System. Multimedia and Expo.Google Scholar
Cross Ref
- Cowling, M. and Sitte, R. 2003. Comparison of techniques for environmental sound recognition. Pat. Recogn. Lett. Vol 24 (15). Google Scholar
Digital Library
- Doherty J., Girgensohn A., Helfman J., Shipman F., and Wilcox L. 2003. Detail-on demand hypervideo. In Proceedings of ACM Multimedia '03. Google Scholar
Digital Library
- Ellis, D. 2001. Detecting Alarm Sounds. Consistent & Reliable Acoustic Cues for sound analysis (CRAC) workshop, pp. 59--62.Google Scholar
- Eronen, A. J., Peltonen, V. T., Tuomi, J. T., Klapuri, A. P., Fagerlund, S.; Sorsa, T.; Lorho, G.; Huopaniemi, J. 2006. Audio-based contextGoogle Scholar
- Esuli, A., Sebastiani, F. 2006. Determining term subjectivity and term orientation for opinion mining. In Proc. of the Conf. of the European Chapter of the Association for Computational Linguistics, ItalyGoogle Scholar
- Godbole, N., Srinivasaiah, M., Skiena S. 2007. Large-scale sentiment analysis for news and blogs. In Proc. of the Int. Conf. on WebLogs and Social Media (ICWSM 07), Colorado, USA.Google Scholar
- Harma, A., McKinney, M. F., Skowronek, J. 2005. Automatic surveillance of the acoustic activity in our living environment, ICME.Google Scholar
- Hatzivassiloglou, V., McKeown, K. 1997. Predicting the Semantic Orientation of Adjectives. In Proc. of the 35th Annual Meeting of the Association for Computational Linguistics, Spain. Google Scholar
Digital Library
- Jawahar, C. V., Chennupati, B., Paluri, B., Jammalamadaka, N. 2005. Video Retrieval Based on Textual Queries. Proc. of the 13th Int. Conf. on Advanced Computing and Communications.Google Scholar
- Jiachen X., Wichern, G., Thornburg, H., Spanias, A. 2008. Fast query by example of environmental sounds via robust and efficient cluster-based indexing. ICASSP 2008.Google Scholar
- Katsiouli P., Tsetsos V., Hadjiefthymiades S. 2007. Semantic video classification based on subtitles and domain terminologies Workshop on Knowledge Acquisition from Multimedia Content.Google Scholar
- Kennedy, L. and Ellis, D. 2004. Laughter Detection. In NIST Meeting Recognition Workshop ICASSP, pp. 118--121.Google Scholar
- Kim, S. M., Hovy, E. 2006. Extracting Opinions, Opinion Holders, and Topics Expressed in Online News Media Text. In Proc. of ACL/COLING Workshop on Sentiment and Subjectivity in Text. Google Scholar
Digital Library
- Langlois, T. and Marques, G. 2009. Music Classification Method Based on Timbral Features. International Symposium on Music Information Retrieval (ISMIR 2009), October, Kobe, Japan.Google Scholar
- Laskowski, K. and Schultz, T. 2008. Detection of Laughter-in-Interaction in Multichannel Close-Talk Microphone Recordings of Meetings. In Machine Learning for Multimodal Interaction. Google Scholar
Digital Library
- Oliveira, E., and Chambel, T. 2008. Emotional Video Album: getting emotions into the picture. The 4th Workshop on Emotion in Human-Computer Interaction: Designing for People, at HCI'2008, Liverpool, UK, September 1--5.Google Scholar
- Perttunen, M., Kleek, M., van Lassila, O.; Riekki, J. 2008. Mobile Ubiquitous Computing, Systems, Services and Technologies, Spain.Google Scholar
- Pradeep K. Atrey, Namunu C. Maddage and Mohan S. Kankanhalli. 2006. Audio Based Event Detection For Multimedia Surveillance, ICASSP.Google Scholar
- Sarmento, L., Carvalho, P., Silva, M. J., and Oliveira, E. de. 2009. Automatic creation of a reference corpus for political opinion mining in user-generated content. In Proceedings of TSA'09-1st International CIKM Workshop on Topic-Sentiment Analysis for Mass Opinion Measurement, Hong Kong, November 6. Google Scholar
Digital Library
- Yoo H. and Cho S., 2007. Video scene retrieval with interactive genetic algorithm. Multimedia Tools and Applications, 34, September. pp. 317--336. Google Scholar
Digital Library
- Yu, H., Hatzivassiloglou, V. 2003. Towards answering opinion questions: Separating facts from opinions and identifying the polarity of opinion sentences. In Proc. of EMNLP'03, Japan. Google Scholar
Digital Library
- Zhi Z., Xin L., Xiaohong M., Qiang J. 2008. Adaptive Context Recognition Based on Audio Signal, ICPR08.Google Scholar
Index Terms
VIRUS




Comments