10.1145/2396761.2398567acmconferencesArticle/Chapter ViewAbstractPublication PagescikmConference Proceedings
short-paper

GTE: a distributional second-order co-occurrence approach to improve the identification of top relevant dates in web snippets

ABSTRACT

In this paper, we present an approach to identify top relevant dates in Web snippets with respect to a given implicit temporal query. Our approach is two-fold. First, we propose a generic temporal similarity measure called GTE, which evaluates the temporal similarity between a query and a date. Second, we propose a classification model to accurately relate relevant dates to their corresponding query terms and withdraw irrelevant ones. We suggest two different solutions: a threshold-based classification strategy and a supervised classifier based on a combination of multiple similarity measures. We evaluate both strategies over a set of real-world text queries and compare the performance of our Web snippet approach with a query log approach over the same set of queries. Experiments show that determining the most relevant dates of any given implicit temporal query can be improved with GTE combined with the second order similarity measure InfoSimba, the Dice coefficient and the threshold-based strategy compared to (1) first-order similarity measures and (2) the query log based approach.

References

  1. Alonso, O., Baeza-Yates, R., and Gertz, M. (2009). Effectiveness of Temporal Snippets. In WSSP'09 - WWW'09. Madrid, Spain.Google ScholarGoogle Scholar
  2. Alonso, O., Gertz, M., and Baeza-Yates, R. (2011). Enhancing Document Snippets Using Temporal Information. In SPIRE'1 Italy. Google ScholarGoogle ScholarDigital LibraryDigital Library
  3. Berberich, K., Bedathur, S., Alonso, O., and Weikum, G. (2010). A Language Modeling Approach for Temporal Information Needs. In ECIR'10. UK. Google ScholarGoogle ScholarDigital LibraryDigital Library
  4. Campos, R., Dias, G., Jorge, A. M & Nunes, C. (2012). Enriching Temporal Query Understanding through Date Identification: How to Tag Implicit Temporal Queries? In WWW-TWAW'12, 41--48. Lyon. France. Google ScholarGoogle ScholarDigital LibraryDigital Library
  5. Campos, R., Jorge, A., & Dias, G. (2011). Using Web Snippets and Query-logs to Measure Implicit Temporal Intents in Queries. In SIGIR-QRU11, pp. 13--16. Beijing, China. July 28.Google ScholarGoogle Scholar
  6. Campos, R. (2011). http://www.ccc.ipt.pt/~ricardo/software.Google ScholarGoogle Scholar
  7. Dias, G., Alves, E., and Lopes, J. (2007). Topic Segmentation Algorithms for Text Summarization and Passage Retrieval: An Exhaustive Evaluation. In AAAI'07, 1334--1340. Canada. July 22-26. Google ScholarGoogle ScholarDigital LibraryDigital Library
  8. Freitag, D., Blume, M., Byrnes, J., Chow, E., Kapadia, S., Rohwer, R., et al. (2005). New Experiments in Distributional Representations of Synonymy. In CoNLL'05. Michigan, USA. Google ScholarGoogle ScholarDigital LibraryDigital Library
  9. Kawai, H., Jatowt, A., Tanaka, K., Kunieda, K., & Yamada, K. (2010). ChronoSeeker: Search Engine for Future and Past Events. In ICUIMC'10. Google ScholarGoogle ScholarDigital LibraryDigital Library
  10. Machado, D., Barbosa, T., Pais, S., Martins, B and Dias, G. (2009). Universal Mobile Information Retrieval. In HCII'09. USA. Google ScholarGoogle ScholarDigital LibraryDigital Library
  11. Metzler, D., Jones, R., Peng, F., and Zhang, R. (2009). Improving Search Relevance for Implicitly Temporal Queries. In SIGIR'09. Boston, USA. Google ScholarGoogle ScholarDigital LibraryDigital Library
  12. Pecina, P., and Schlesinger, P. (2006). Combining Association Measures for Collocation Extraction. In COLING/ACL'06. Sydney. Australia. Google ScholarGoogle ScholarDigital LibraryDigital Library
  13. Strötgen, J., Alonso, O., & Gertz, M. (2012). Identification of Top Relevant Temporal Expressions in Documents. In WWW-TWAW'12, 33--40. France. Google ScholarGoogle ScholarDigital LibraryDigital Library

Index Terms

  1. GTE

      Comments

      Login options

      Check if you have access through your login credentials or your institution to get full access on this article.

      Sign in

      PDF Format

      View or Download as a PDF file.

      PDF

      eReader

      View online with eReader.

      eReader
      About Cookies On This Site

      We use cookies to ensure that we give you the best experience on our website.

      Learn more

      Got it!