skip to main content
research-article

Enhancing news organization for convenient retrieval and browsing

Authors Info & Claims
Published:27 December 2013Publication History
Skip Abstract Section

Abstract

To facilitate users to access news quickly and comprehensively, we design a news search and browsing system named GeoVisNews, in which the news elements of “Where”, “Who”, “What” and “When” are enhanced via news geo-localization, image enrichment and joint ranking, respectively. For news geo-localization, an Ordinal Correlation Consistent Matrix Factorization (OCCMF) model is proposed to maintain the relevance rankings of locations to a specific news document and simultaneously capture intra-relations among locations and documents. To visualize news, we develop a novel method to enrich news documents with appropriate web images. Specifically, multiple queries are first generated from news documents for image search, and then the appropriate images are selected from the collected web images by an intelligent fusion approach based on multiple features. Obtaining the geo-localized and image enriched news resources, we further employ a joint ranking strategy to provide relevant, timely and popular news items as the answer of user searching queries. Extensive experiments on a large-scale news dataset collected from the web demonstrate the superior performance of the proposed approaches over related methods.

References

  1. Amitay, E., Sivan, R., and Soffer, A. 2004. Web-a-Where: Geotagging web content. In Proceedings of SIGIR. 273--280. Google ScholarGoogle ScholarDigital LibraryDigital Library
  2. Andogah, G. 2010. Geographically constrained information retrieva. Ph.D. thesis, University of Groningen.Google ScholarGoogle Scholar
  3. Brin, S. and Page, L. 1998. The anatomy of a large-scale hypertextual web search engine. In Proceedings of WWW. 107--117. Google ScholarGoogle ScholarDigital LibraryDigital Library
  4. Candeias, R. and Martins, B. 2011. Associating relevant photos to georeferenced textual documents through rank aggregation. In Proceedings of the Terra Cognita Workshop.Google ScholarGoogle Scholar
  5. Christel, M. G., Hauptmann, A. G., Wactlar, H. D., and Ng, T. D. 2002. Collages as dynamic summaries for news video. In Proceedings of MM. 561--569. Google ScholarGoogle ScholarDigital LibraryDigital Library
  6. Cilibrasi, R. L. and Vitanyi, P. M. B. 2007. The google similarity distance. IEEE Trans. Knowl. Data Eng. 19, 3, 370--383. Google ScholarGoogle ScholarDigital LibraryDigital Library
  7. Coyne, B. and Sproat, R. 2001. WordsEye: An automatic text-to-scene conversion system. In Proceedings of Computer Graphics and Interactive Techniques. 487--496. Google ScholarGoogle ScholarDigital LibraryDigital Library
  8. Delgado, D., Magalhaes, J., and Correia, N. 2010. Assisted news reading with automated illustrations. In Proceedings of MM. 1647--1650. Google ScholarGoogle ScholarDigital LibraryDigital Library
  9. Deschacht, K. and Moens, M. 2008. Finding the best picture: Cross-media retrieval of content. In Proceedings of ECIR. Google ScholarGoogle ScholarDigital LibraryDigital Library
  10. Ding, J., Gravano, L., and Shivakumar, N. 2000. Computing geographical scopes of web sources. In Proceedings of the International Conference on Very Large Data Bases. 545--556. Google ScholarGoogle ScholarDigital LibraryDigital Library
  11. Gey, F., Larson, R., Sanderson, M., Joho, H., Clough, P., and Petras, V. 2005. GeoCLEF: The CLEF 2005 cross-language geographic information retrieval track overview. In Proceedings of CLEF'05. 908--919. Google ScholarGoogle ScholarDigital LibraryDigital Library
  12. Gravier, G., Guinaudeau, C., Lecorvé, G., and Sébillot, P. 2011. Exploiting speech for automatic TV delinearization: From streams to cross-media semantic navigation. J. Image Video Proc.Google ScholarGoogle Scholar
  13. Huston, S. and Croft, W. B. 2010. Evaluating verbose query processing techniques. In Proceedings of SIGIR. 291--298. Google ScholarGoogle ScholarDigital LibraryDigital Library
  14. Järvelin, K. and Kekäläinen, J. 2002. Cumulated gain-based evaluation of IR techniques. ACM Trans. Inf. Syst. 20, 4, 422--446. Google ScholarGoogle ScholarDigital LibraryDigital Library
  15. Jiao, B., Yang, L., Xu, J., and Wu, F. 2010. Visual summarization of web pages. In Proceedings of SIGIR. 499--506. Google ScholarGoogle ScholarDigital LibraryDigital Library
  16. Joachims, T. 2002. Optimizing search engines using clickthrough data. In Proceedings of KDD. 31--43. Google ScholarGoogle ScholarDigital LibraryDigital Library
  17. Jones, K. S., Walker, S., and Robertson, S. E. 2000. A probabilistic model of information retrieval: Development and comparative experiments. Inf. Proc. Manage. 36, 6, 779--808. Google ScholarGoogle ScholarDigital LibraryDigital Library
  18. Joshi, D., Wang, J. Z., and Li, J. 2004. The story picturing engine: Finding elite images to illustrate a story using mutual reinforcement. In Proceedings of the ACM Workshop on Multimedia Information Retrieval. 119--126. Google ScholarGoogle ScholarDigital LibraryDigital Library
  19. King, B. M. and Minium, E. M. 1999. Statistical Reasoning in Psychology and Education. Wiley, New York.Google ScholarGoogle Scholar
  20. Kumaran, G. and Carvalho, V. R. 2009. Reducing long queries using query quality predictors. In Proceedings of SIGIR. 564--571. Google ScholarGoogle ScholarDigital LibraryDigital Library
  21. Law-To, J., Grefenstete, G., Gauvain, J.-L., Gravier, G., Lamel, L., and Despres, J. 2009. VoxaleadNews: Robust automatic segmentation of video content into browsable and searchable subjects. In Proceedings of MM. Google ScholarGoogle ScholarDigital LibraryDigital Library
  22. Leidner, J. L. 2008. Toponym resolution in text: Annotation, evaluation and applications of spatialgrounding of place names.dissertation.com.Google ScholarGoogle Scholar
  23. Li, Z., Liu, J., Zhu, X., Liu, T., and Lu, H. 2010a. Image annotation using multi-correlation probabilistic matrix factorization. In Proceedings of MM. 1187--1190. Google ScholarGoogle ScholarDigital LibraryDigital Library
  24. Li, Z., Liu, J., Zhu, X., and Lu, H. 2010b. Multi-modal multi-correlation person-centric news retrieval. In Proceedings of CIKM. Google ScholarGoogle ScholarDigital LibraryDigital Library
  25. Li, Z., Wang, M., Liu, J., Xu, C., and Lu, H. 2011. News contextualization with geographic and visual information. In Proceedings of MM. Google ScholarGoogle ScholarDigital LibraryDigital Library
  26. Liu, S., Zhou, M. X., Pan, S., Song, Y., Qian, W., Cai, W., and Lian, X. 2012. Tiara: Interactive, topic-based visual text summarization and analysis. ACM Trans. Intell. Syst. Tech. 3, 2, 25--25. Google ScholarGoogle ScholarDigital LibraryDigital Library
  27. Lu, X., Pang, Y., Hao, Q., and Zhang, L. 2009. Visualizing textual travelogue with location relevant images. In Proceedings of International Workshop on Location Based Social Networks. Google ScholarGoogle ScholarDigital LibraryDigital Library
  28. Martins, B. 2009. Geographically aware web textmining. Ph.D. thesis, University of Lisbon.Google ScholarGoogle Scholar
  29. McGurk, H. and MacDonald, J. 1976. Hearing lips and seeing voices. Nature 264, 5588, 746--748.Google ScholarGoogle Scholar
  30. Ohtsuki, K., Bessho, K., Matsuo, Y., Matsunaga, S., and Hayashi, Y. 2006. Automatic multimedia indexing: Combining audio, speech, and visual information to index broadcast news. IEEE Signal Process. Mag. 23, 2, 69--78.Google ScholarGoogle ScholarCross RefCross Ref
  31. Okuoka, T., Takahashi, T., Deguchi, D., Ide, I., and Murase, H. 2009. Labeling news topic threads with Wikipedia entries. In Proceedings of the IEEE International Symposium on Multimedia. 501--504. Google ScholarGoogle ScholarDigital LibraryDigital Library
  32. Olivares, X., Ciaramita, M., and van Zwol, R. 2008. Boosting image retrieval through aggregating search results based on visual annotations. In Proceedings of MM. 189--198. Google ScholarGoogle ScholarDigital LibraryDigital Library
  33. Page, L., Brin, S., Motwani, R., and Winograd, T. 1999. The PageRank citation ranking: Bringing order to the web. Tech. rep. Stanford Digital Library Technologies Project.Google ScholarGoogle Scholar
  34. Rother, C., Bordeaux, L., Hamadi, Y., and Autocollage, A. B. 2006. AutoCollage. In Proceedings of SIGGRAPH. Google ScholarGoogle ScholarDigital LibraryDigital Library
  35. Salakhutdinov, R. and Mnih, A. 2007. Probabilistic matrix factorization. In Proceedings of NIPS. 1257--1264.Google ScholarGoogle Scholar
  36. Strobelt, H., Oelke, D., Rohrdantz, C., Stoffel, A., Keim, D. A., and Deussen, O. 2009. Document cards: A top trumps visualization for documents. IEEE Trans. Visual. Comput. Graph. 15, 6, 1145--1152. Google ScholarGoogle ScholarDigital LibraryDigital Library
  37. Sturm, J. F. 2009. Site matters: The value of local newspaper web sites. Tech. rep., NAA.Google ScholarGoogle Scholar
  38. Teevan, J., Cutrell, E., Fisher, D., Drucker, S. M., Ramos, G., Andre, P., and Hu, C. 2009. Visual snippets: Summarizing web pages for search and revisitation. In Proceedings of International Conference on Human Factors in Computing Systems. 2023--2032. Google ScholarGoogle ScholarDigital LibraryDigital Library
  39. Wang, B., Li, Z., Li, M., and Ma, W.-Y. 2006a. Large-scale duplicate detection for web image search. In Proceedings of ICME. 353--356.Google ScholarGoogle Scholar
  40. Wang, J., Quan, L., Sun, J., Tang, X., and Shum, H.-Y. 2006b. Picture collage. In Proceedings of CVPR. Google ScholarGoogle ScholarDigital LibraryDigital Library
  41. Yan, R. and Hauptmann, A. G. 2003. The combination limit in multimedia retrieval. In Proceedings of MM. 339--342. Google ScholarGoogle ScholarDigital LibraryDigital Library
  42. Zhang, L., Chen, L., Jing, F., Deng, K., and Ma, W.-Y. 2006. Enjoyphoto—A verticcal image search engine for enjoying high-quality photos. In Proceedings of MM. 367--376. Google ScholarGoogle ScholarDigital LibraryDigital Library
  43. Zhao, R. and Grosky, W. I. 2002. Narrowing the semantic gap—Improved text-based web document retrieval using visual features. ACM Trans. on Multimedia 4, 2, 189--200. Google ScholarGoogle ScholarDigital LibraryDigital Library
  44. Zong, W., Wu, D., Sun, A., Lim, E.-P., and Goh, D. H.-L. 2005. On assigning place names to geography related web pages. In Proceedings of ACM/IEEE-CS Joint Conference on Digital Libraries. ACM, New York, USA, 354--362. Google ScholarGoogle ScholarDigital LibraryDigital Library

Index Terms

  1. Enhancing news organization for convenient retrieval and browsing

      Recommendations

      Comments

      Login options

      Check if you have access through your login credentials or your institution to get full access on this article.

      Sign in

      Full Access

      • Published in

        cover image ACM Transactions on Multimedia Computing, Communications, and Applications
        ACM Transactions on Multimedia Computing, Communications, and Applications  Volume 10, Issue 1
        December 2013
        166 pages
        ISSN:1551-6857
        EISSN:1551-6865
        DOI:10.1145/2559928
        Issue’s Table of Contents

        Copyright © 2013 ACM

        Publisher

        Association for Computing Machinery

        New York, NY, United States

        Publication History

        • Published: 27 December 2013
        • Accepted: 1 March 2013
        • Revised: 1 September 2012
        • Received: 1 June 2012
        Published in tomm Volume 10, Issue 1

        Permissions

        Request permissions about this article.

        Request Permissions

        Check for updates

        Qualifiers

        • research-article
        • Research
        • Refereed

      PDF Format

      View or Download as a PDF file.

      PDF

      eReader

      View online with eReader.

      eReader
      About Cookies On This Site

      We use cookies to ensure that we give you the best experience on our website.

      Learn more

      Got it!