Abstract
To facilitate users to access news quickly and comprehensively, we design a news search and browsing system named GeoVisNews, in which the news elements of “Where”, “Who”, “What” and “When” are enhanced via news geo-localization, image enrichment and joint ranking, respectively. For news geo-localization, an Ordinal Correlation Consistent Matrix Factorization (OCCMF) model is proposed to maintain the relevance rankings of locations to a specific news document and simultaneously capture intra-relations among locations and documents. To visualize news, we develop a novel method to enrich news documents with appropriate web images. Specifically, multiple queries are first generated from news documents for image search, and then the appropriate images are selected from the collected web images by an intelligent fusion approach based on multiple features. Obtaining the geo-localized and image enriched news resources, we further employ a joint ranking strategy to provide relevant, timely and popular news items as the answer of user searching queries. Extensive experiments on a large-scale news dataset collected from the web demonstrate the superior performance of the proposed approaches over related methods.
- Amitay, E., Sivan, R., and Soffer, A. 2004. Web-a-Where: Geotagging web content. In Proceedings of SIGIR. 273--280. Google Scholar
Digital Library
- Andogah, G. 2010. Geographically constrained information retrieva. Ph.D. thesis, University of Groningen.Google Scholar
- Brin, S. and Page, L. 1998. The anatomy of a large-scale hypertextual web search engine. In Proceedings of WWW. 107--117. Google Scholar
Digital Library
- Candeias, R. and Martins, B. 2011. Associating relevant photos to georeferenced textual documents through rank aggregation. In Proceedings of the Terra Cognita Workshop.Google Scholar
- Christel, M. G., Hauptmann, A. G., Wactlar, H. D., and Ng, T. D. 2002. Collages as dynamic summaries for news video. In Proceedings of MM. 561--569. Google Scholar
Digital Library
- Cilibrasi, R. L. and Vitanyi, P. M. B. 2007. The google similarity distance. IEEE Trans. Knowl. Data Eng. 19, 3, 370--383. Google Scholar
Digital Library
- Coyne, B. and Sproat, R. 2001. WordsEye: An automatic text-to-scene conversion system. In Proceedings of Computer Graphics and Interactive Techniques. 487--496. Google Scholar
Digital Library
- Delgado, D., Magalhaes, J., and Correia, N. 2010. Assisted news reading with automated illustrations. In Proceedings of MM. 1647--1650. Google Scholar
Digital Library
- Deschacht, K. and Moens, M. 2008. Finding the best picture: Cross-media retrieval of content. In Proceedings of ECIR. Google Scholar
Digital Library
- Ding, J., Gravano, L., and Shivakumar, N. 2000. Computing geographical scopes of web sources. In Proceedings of the International Conference on Very Large Data Bases. 545--556. Google Scholar
Digital Library
- Gey, F., Larson, R., Sanderson, M., Joho, H., Clough, P., and Petras, V. 2005. GeoCLEF: The CLEF 2005 cross-language geographic information retrieval track overview. In Proceedings of CLEF'05. 908--919. Google Scholar
Digital Library
- Gravier, G., Guinaudeau, C., Lecorvé, G., and Sébillot, P. 2011. Exploiting speech for automatic TV delinearization: From streams to cross-media semantic navigation. J. Image Video Proc.Google Scholar
- Huston, S. and Croft, W. B. 2010. Evaluating verbose query processing techniques. In Proceedings of SIGIR. 291--298. Google Scholar
Digital Library
- Järvelin, K. and Kekäläinen, J. 2002. Cumulated gain-based evaluation of IR techniques. ACM Trans. Inf. Syst. 20, 4, 422--446. Google Scholar
Digital Library
- Jiao, B., Yang, L., Xu, J., and Wu, F. 2010. Visual summarization of web pages. In Proceedings of SIGIR. 499--506. Google Scholar
Digital Library
- Joachims, T. 2002. Optimizing search engines using clickthrough data. In Proceedings of KDD. 31--43. Google Scholar
Digital Library
- Jones, K. S., Walker, S., and Robertson, S. E. 2000. A probabilistic model of information retrieval: Development and comparative experiments. Inf. Proc. Manage. 36, 6, 779--808. Google Scholar
Digital Library
- Joshi, D., Wang, J. Z., and Li, J. 2004. The story picturing engine: Finding elite images to illustrate a story using mutual reinforcement. In Proceedings of the ACM Workshop on Multimedia Information Retrieval. 119--126. Google Scholar
Digital Library
- King, B. M. and Minium, E. M. 1999. Statistical Reasoning in Psychology and Education. Wiley, New York.Google Scholar
- Kumaran, G. and Carvalho, V. R. 2009. Reducing long queries using query quality predictors. In Proceedings of SIGIR. 564--571. Google Scholar
Digital Library
- Law-To, J., Grefenstete, G., Gauvain, J.-L., Gravier, G., Lamel, L., and Despres, J. 2009. VoxaleadNews: Robust automatic segmentation of video content into browsable and searchable subjects. In Proceedings of MM. Google Scholar
Digital Library
- Leidner, J. L. 2008. Toponym resolution in text: Annotation, evaluation and applications of spatialgrounding of place names.dissertation.com.Google Scholar
- Li, Z., Liu, J., Zhu, X., Liu, T., and Lu, H. 2010a. Image annotation using multi-correlation probabilistic matrix factorization. In Proceedings of MM. 1187--1190. Google Scholar
Digital Library
- Li, Z., Liu, J., Zhu, X., and Lu, H. 2010b. Multi-modal multi-correlation person-centric news retrieval. In Proceedings of CIKM. Google Scholar
Digital Library
- Li, Z., Wang, M., Liu, J., Xu, C., and Lu, H. 2011. News contextualization with geographic and visual information. In Proceedings of MM. Google Scholar
Digital Library
- Liu, S., Zhou, M. X., Pan, S., Song, Y., Qian, W., Cai, W., and Lian, X. 2012. Tiara: Interactive, topic-based visual text summarization and analysis. ACM Trans. Intell. Syst. Tech. 3, 2, 25--25. Google Scholar
Digital Library
- Lu, X., Pang, Y., Hao, Q., and Zhang, L. 2009. Visualizing textual travelogue with location relevant images. In Proceedings of International Workshop on Location Based Social Networks. Google Scholar
Digital Library
- Martins, B. 2009. Geographically aware web textmining. Ph.D. thesis, University of Lisbon.Google Scholar
- McGurk, H. and MacDonald, J. 1976. Hearing lips and seeing voices. Nature 264, 5588, 746--748.Google Scholar
- Ohtsuki, K., Bessho, K., Matsuo, Y., Matsunaga, S., and Hayashi, Y. 2006. Automatic multimedia indexing: Combining audio, speech, and visual information to index broadcast news. IEEE Signal Process. Mag. 23, 2, 69--78.Google Scholar
Cross Ref
- Okuoka, T., Takahashi, T., Deguchi, D., Ide, I., and Murase, H. 2009. Labeling news topic threads with Wikipedia entries. In Proceedings of the IEEE International Symposium on Multimedia. 501--504. Google Scholar
Digital Library
- Olivares, X., Ciaramita, M., and van Zwol, R. 2008. Boosting image retrieval through aggregating search results based on visual annotations. In Proceedings of MM. 189--198. Google Scholar
Digital Library
- Page, L., Brin, S., Motwani, R., and Winograd, T. 1999. The PageRank citation ranking: Bringing order to the web. Tech. rep. Stanford Digital Library Technologies Project.Google Scholar
- Rother, C., Bordeaux, L., Hamadi, Y., and Autocollage, A. B. 2006. AutoCollage. In Proceedings of SIGGRAPH. Google Scholar
Digital Library
- Salakhutdinov, R. and Mnih, A. 2007. Probabilistic matrix factorization. In Proceedings of NIPS. 1257--1264.Google Scholar
- Strobelt, H., Oelke, D., Rohrdantz, C., Stoffel, A., Keim, D. A., and Deussen, O. 2009. Document cards: A top trumps visualization for documents. IEEE Trans. Visual. Comput. Graph. 15, 6, 1145--1152. Google Scholar
Digital Library
- Sturm, J. F. 2009. Site matters: The value of local newspaper web sites. Tech. rep., NAA.Google Scholar
- Teevan, J., Cutrell, E., Fisher, D., Drucker, S. M., Ramos, G., Andre, P., and Hu, C. 2009. Visual snippets: Summarizing web pages for search and revisitation. In Proceedings of International Conference on Human Factors in Computing Systems. 2023--2032. Google Scholar
Digital Library
- Wang, B., Li, Z., Li, M., and Ma, W.-Y. 2006a. Large-scale duplicate detection for web image search. In Proceedings of ICME. 353--356.Google Scholar
- Wang, J., Quan, L., Sun, J., Tang, X., and Shum, H.-Y. 2006b. Picture collage. In Proceedings of CVPR. Google Scholar
Digital Library
- Yan, R. and Hauptmann, A. G. 2003. The combination limit in multimedia retrieval. In Proceedings of MM. 339--342. Google Scholar
Digital Library
- Zhang, L., Chen, L., Jing, F., Deng, K., and Ma, W.-Y. 2006. Enjoyphoto—A verticcal image search engine for enjoying high-quality photos. In Proceedings of MM. 367--376. Google Scholar
Digital Library
- Zhao, R. and Grosky, W. I. 2002. Narrowing the semantic gap—Improved text-based web document retrieval using visual features. ACM Trans. on Multimedia 4, 2, 189--200. Google Scholar
Digital Library
- Zong, W., Wu, D., Sun, A., Lim, E.-P., and Goh, D. H.-L. 2005. On assigning place names to geography related web pages. In Proceedings of ACM/IEEE-CS Joint Conference on Digital Libraries. ACM, New York, USA, 354--362. Google Scholar
Digital Library
Index Terms
Enhancing news organization for convenient retrieval and browsing
Recommendations
News contextualization with geographic and visual information
MM '11: Proceedings of the 19th ACM international conference on MultimediaIn this paper, we investigate the contextualization of news documents with geographic and visual information. We propose a matrix factorization approach to analyze the location relevance for each news document. We also propose a method to enrich the ...
Multi-view Latent Hashing for Efficient Multimedia Search
MM '15: Proceedings of the 23rd ACM international conference on MultimediaHashing techniques have attracted broad research interests in recent multimedia studies. However, most of existing hashing methods focus on learning binary codes from data with only one single view, and thus cannot fully utilize the rich information ...
Auto-weighted multi-view co-clustering via fast matrix factorization
Highlights- Distinguishing the existing multi-view clustering methods, the proposed approaches involve constraints of indicator matrix in matrix decomposition. Due to ...
AbstractMulti-view clustering is a hot research topic in machine learning and pattern recognition, however, it remains high computational complexity when clustering multi-view data sets. Although a number of approaches have been proposed to ...






Comments