Abstract
Although query log analysis provides crucial insights about Web users’ search interests, conducting such analyses is almost impossible for some languages, as large-scale and public query logs are quite scarce. In this study, we first survey the existing query collections in Turkish and discuss their limitations. Next, we adopt a novel strategy to obtain a set of Turkish queries using the query autocompletion services from the four major search engines and provide the first large-scale analysis of Web queries and their results in Turkish.
- Ismail Sengor Altingovde, Roi Blanco, Berkant Barla Cambazoglu, Rifat Ozcan, Erdem Sarigil, and Özgür Ulusoy. 2012. Characterizing Web search queries that match very few or no results. In Proceedings of the 21st ACM International Conference on Information and Knowledge Management (CIKM’12). 2000--2004. Google Scholar
Digital Library
- Steven M. Beitzel, Eric C. Jensen, Abdur Chowdhury, Ophir Frieder, and David A. Grossman. 2007. Temporal analysis of a very large topically categorized Web query log. Journal of the Association for Information Science and Technology 58, 2, 166--178. Google Scholar
Digital Library
- Yiltan Bitirim, Yasar Tonta, and Hayri Sever. 2002. Information retrieval effectiveness of Turkish search engines. In Proceedings of the 2nd International Conference on Advances in Information Systems. 93--103. Google Scholar
Digital Library
- Andrei Broder. 2002. A taxonomy of Web search. SIGIR Forum 36, 2, 3--10. Google Scholar
Digital Library
- Fazli Can, Seyit Kocberber, Erman Balcik, Cihan Kaynak, Huseyin Cagdas Ocalan, and Onur M. Vursavas. 2008. Information retrieval on turkish texts. Journal of the Association for Information Science and Technology 59, 3 (2008), 407--421. Google Scholar
Digital Library
- Sergiu Chelaru, Ismail Sengor Altingovde, Stefan Siersdorfer, and Wolfgang Nejdl. 2013. Analyzing, detecting, and exploiting sentiment in Web queries. ACM Transactions on the Web 8, 1, 6. Google Scholar
Digital Library
- W. Bruce Croft, Donald Metzler, and Trevor Strohman. 2009. Search Engines: Information Retrieval in Practice. Pearson Education. Google Scholar
Digital Library
- Rabia Gulcin Demirci, Vildan Kismir, and Yiltan Bitirim. 2007. An evaluation of popular search engines on finding Turkish documents. In Proceedings of the International Conference on Internet and Web Applications and Services. 61. Google Scholar
Digital Library
- Jiafeng Guo, Gu Xu, Xueqi Cheng, and Hang Li. 2009. Named entity recognition in query. In Proceedings of the 32nd Annual International SIGIR Conference on Research and Development in Information Retrieval. 267--274. Google Scholar
Digital Library
- Greg Pass, Abdur Chowdhury, and Cayley Torgeson. 2006. A picture of search. In Proceedings of the 1st International Conference on Scalable Information Systems. Google Scholar
Digital Library
- Daniel E. Rose and Danny Levinson. 2004. Understanding user goals in Web search. In Proceedings of the 13th International Conference on World Wide Web. 13--19. Google Scholar
Digital Library
- Fabrizio Silvestri. 2010. Mining query logs: Turning search usage data into knowledge. Foundations and Trends in Information Retrieval 4, 1--2, 1--174. Google Scholar
Digital Library
- B. Tokgoz, Z. N. Ozcilnak, C. Cinar, M. T. Yalun, and Y. Bitirim. 2013. An evaluation of Turkish retrieval performance of popular search engines for Internet and image search by using common lists. In Proceedings of the Conference on Digital Information and Communication Technology and Its Applications. 148--153.Google Scholar
- Seyda Turker and Yiltan Bitirim. 2015. A study on the effect of using the wrongly spelled and/or pronounced Turkish words on Seb search engines. International Journal of Scientific Knowledge 5, 8, 1--7.Google Scholar
Index Terms
A “Suggested” Picture of Web Search in Turkish
Recommendations
A Large-Scale Query Spelling Correction Corpus
SIGIR '17: Proceedings of the 40th International ACM SIGIR Conference on Research and Development in Information RetrievalWe present a new large-scale collection of 54,772 queries with manually annotated spelling corrections. For 9,170 of the queries (16.74%), spelling variants that are different to the original query are proposed. With its size, our new corpus is an order ...
Turkish Language Resources: Morphological Parser, Morphological Disambiguator and Web Corpus
GoTAL '08: Proceedings of the 6th international conference on Advances in Natural Language ProcessingIn this paper, we propose a set of language resources for building Turkish language processing applications. Specifically, we present a finite-state implementation of a morphological parser, an averaged perceptron-based morphological disambiguator, and ...
Resources for Turkish morphological processing
We present a set of language resources and tools--a morphological parser, a morphological disambiguator, and a text corpus--for exploiting Turkish morphology in natural language processing applications. The morphological parser is a state-of-the-art ...






Comments