skip to main content
10.1145/1367497.1367501acmconferencesArticle/Chapter ViewAbstractPublication PageswwwConference Proceedingsconference-collections
research-article

Improving relevance judgment of web search results with image excerpts

Published:21 April 2008Publication History

ABSTRACT

Current web search engines return result pages containing mostly text summary even though the matched web pages may contain informative pictures. A text excerpt (i.e. snippet) is generated by selecting keywords around the matched query terms for each returned page to provide context for user's relevance judgment. However, in many scenarios, we found that the pictures in web pages, if selected properly, could be added into search result pages and provide richer contextual description because a picture is worth a thousand words. Such new summary is named as image excerpts. By well designed user study, we demonstrate image excerpts can help users make much quicker relevance judgment of search results for a wide range of query types. To implement this idea, we propose a practicable approach to automatically generate image excerpts in the result pages by considering the dominance of each picture in each web page and the relevance of the picture to the query. We also outline an efficient way to incorporate image excerpts in web search engines. Web search engines can adopt our approach by slightly modifying their index and inserting a few low cost operations in their workflow. Our experiments on a large web dataset indicate the performance of the proposed approach is very promising.

References

  1. Google Trends. http://www.google.com/trends.Google ScholarGoogle Scholar
  2. D. Cai, X. He, Z. Li, and et al. Hierarchical clustering of www image search results using visual, textual and link information. In Proc. of ACM international conference on Multimedia, 2004. Google ScholarGoogle ScholarDigital LibraryDigital Library
  3. T. A. S. Coelho, P. Calado, and et al. Image retrieval using multiple evidence ranking. IEEE Transaction on Knowledge and Data Engineering, 2004. Google ScholarGoogle ScholarDigital LibraryDigital Library
  4. Y. Freund, R. Iyer, R. E. Schapire, and Y. Singer. An efficient boosting algorithm for combining preferences. Machine Learning Research, 2003. Google ScholarGoogle ScholarDigital LibraryDigital Library
  5. A. Hartmann and R. Lienhart. Automatic classification of images on the web. In Proc. of Storage and Retrieval for Media Databases, 2001.Google ScholarGoogle Scholar
  6. T. Joachims. Optimizing search engines using click-through data. In Proceedings of the ACM Conference on Knowledge Discovery and Data Mining, 2002. Google ScholarGoogle ScholarDigital LibraryDigital Library
  7. G. Lu and B. Willam. An integrated www image retrieval system. In Proc. Fifth Australian World Wide Web Conference, 2004.Google ScholarGoogle Scholar
  8. G. Rätsch, T. Onoda, and K.-R. Müler. Soft margins for adaboost. Machine Learning, 42(3):287--320, 2001. Google ScholarGoogle ScholarDigital LibraryDigital Library
  9. H. H. Tong, M. J. Li, H. J. Zhang, and et al. Learning no-reference quality metric by examples. In Proc. Of International Conference on Multi-Media Modeling, 2005. Google ScholarGoogle ScholarDigital LibraryDigital Library
  10. V. Vapnik. The nature of statistical learning theory. Statistics for Engineering and Information Science. Springer Verlag, Berlin, 2000. Google ScholarGoogle ScholarDigital LibraryDigital Library
  11. Z. Wang, A. Bovik, H. Sheikh, and E. Simoncelli. Image quality assessment: from error visibility to structural similarity. IEEE Transactions on Image Processing, 2004. Google ScholarGoogle ScholarDigital LibraryDigital Library
  12. R. Xiao, L. Zhu, and H. Zhang. Boosting chain learning for object detection. In Proc. of International Conference on Computer Vision, 2003. Google ScholarGoogle ScholarDigital LibraryDigital Library
  13. A. Woodruff, A. Faulring, R. Rosenholtz, J. Morrison, P. Pirolli, Using Thumbnails to Search the Web. SIGCHI'01, March 31-April 4, 2001, Seattle, USA. Google ScholarGoogle ScholarDigital LibraryDigital Library
  14. E. Ayers, and J. Stasko, Using Graphic History in Browsing the World Wide Web. In Proc. 4th Intl. WWW Conf., December 1995.Google ScholarGoogle Scholar
  15. A. L. Berger, and V.O. Mittal. OCELOT: A System for Summarizing Web Pages. In Proc. of the 23rd annual international ACM SIGIR, Athens, Greece, 2000. Google ScholarGoogle ScholarDigital LibraryDigital Library
  16. O. Buyukkokten, H. Garcia-Molina, and A. Paepcke. Seeing the whole in parts: text summarization for Web browsing on handheld devices. In Proc. of WWW10, Hong Kong, China, May 2001. Google ScholarGoogle ScholarDigital LibraryDigital Library
  17. A. Chapman (ed.). Making Sense: Teaching critical reading across the curriculum. The College Board: NY, 1993.Google ScholarGoogle Scholar
  18. V. Coltheart (ed.). Fleeting Memories: Cognition of Brief Visual Stimuli. MIT Press: Cambridge, MA, 1999 (pp. 32--70).Google ScholarGoogle Scholar
  19. M. Czerwinski, M. V. Dantzich, G. Robertson, and H. Hoffman. The contribution of thumbnail image, mouseover text and spatial location memory to web page retrieval in 3D. In Proc. INTERACT '99, 1999, 163--170.Google ScholarGoogle Scholar
  20. J. Y. Delort, B. Bouchon-Meunier, and M. Rifqi. Web document summarization by context. In Proc. Of WWW12, 2003.Google ScholarGoogle Scholar
  21. S. Dziadosz, and R. Chandraseka, Do Thumbnail Previews Help Users Make Better Relevance Decisions about Web Search Results? In Proc. Of SIGIR'02, August 11-15, 2002, Tampere, Finland. Google ScholarGoogle ScholarDigital LibraryDigital Library
  22. Google news search. http://news.google.comGoogle ScholarGoogle Scholar
  23. S. Kaasten, and S. Greenberg. Integrating Back, History and Bookmarks in Web Browsers. In Proc. of CHI'01, ACM Press, 379--380. Google ScholarGoogle ScholarDigital LibraryDigital Library
  24. T. Kopetzky, and M. Mühlhäuser. Visual Preview for Link Traversal on the WWW. In Proc. 8th Intl. WWW Conf., May 1999, 447--454. Google ScholarGoogle ScholarDigital LibraryDigital Library
  25. J. B. Morrison, P. Pirolli, and S. K. Card. A Taxonomic Analysis of What World Wide Web Activities Significantly, Impact People's Decisions and Actions. Xerox PARC Report UIR-R-2000--17.Google ScholarGoogle Scholar
  26. A. Paivio. Pictures and Words in Visual Search. Memory & Cognition 2, 3, 515--521, 1974.Google ScholarGoogle Scholar
  27. S. Brin, and L. Page, The anatomy of a large-scale hypertextual web search engine. Journal of Computer Networks and ISDN Systems, 1998. Google ScholarGoogle ScholarDigital LibraryDigital Library
  28. D. Shen, Z. Chen, Q. Yang, H. J. Zeng, B. Zhang, Y. Lu, and W. Y. Ma. Web-page Classification through Summarization. In Proc. of the 27th ACM International Conference of Information Retrieval (SIGIR-2004). Sheffield, UK. July 2004. Google ScholarGoogle ScholarDigital LibraryDigital Library
  29. M. Wynblatt, and D. Benson. Web Page Caricatures Multimedia Summaries for WWW Documents. In Proc. IEEE Intl. Conf. on Multimedia Computing and Systems, June 1998, 194--199. Google ScholarGoogle ScholarDigital LibraryDigital Library
  30. D. Shen, J. T. Sun, Q. Yang, and Z. Chen, Building Bridges for Web Query Classification, In Proc. Of SIGIR, 2006 Google ScholarGoogle ScholarDigital LibraryDigital Library
  31. A. Broder, A taxonomy of web search, SIGIR Forum, 2002 Google ScholarGoogle ScholarDigital LibraryDigital Library
  32. B. Wang, Z. Li, M. Li, W. Y. Ma. Large-scale Duplicate Detection for Web Image Search. In Proc. of ICME, 2006Google ScholarGoogle ScholarCross RefCross Ref

Index Terms

  1. Improving relevance judgment of web search results with image excerpts

        Recommendations

        Comments

        Login options

        Check if you have access through your login credentials or your institution to get full access on this article.

        Sign in
        • Published in

          cover image ACM Conferences
          WWW '08: Proceedings of the 17th international conference on World Wide Web
          April 2008
          1326 pages
          ISBN:9781605580852
          DOI:10.1145/1367497

          Copyright © 2008 ACM

          Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

          Publisher

          Association for Computing Machinery

          New York, NY, United States

          Publication History

          • Published: 21 April 2008

          Permissions

          Request permissions about this article.

          Request Permissions

          Check for updates

          Qualifiers

          • research-article

          Acceptance Rates

          Overall Acceptance Rate1,899of8,196submissions,23%

          Upcoming Conference

          WWW '24
          The ACM Web Conference 2024
          May 13 - 17, 2024
          Singapore , Singapore

        PDF Format

        View or Download as a PDF file.

        PDF

        eReader

        View online with eReader.

        eReader