Abstract
Map search engines, such as Google Maps, Yahoo! Maps, and Microsoft Live Maps, allow users to explicitly specify a target geographic location, either in keywords or on the map, and to search businesses, people, and other information of that location. In this article, we report a first study on a million-entry map search log. We identify three key attributes of a map search record—the keyword query, the target location and the user location, and examine the characteristics of these three dimensions separately as well as the associations between them. Comparing our results with those previously reported on logs of general search engines and mobile search engines, including those for geographic queries, we discover the following unique features of map search: (1) People use longer queries and modify queries more frequently in a session than in general search and mobile search; People view fewer result pages per query than in general search; (2) The popular query topics in map search are different from those in general search and mobile search; (3) The target locations in a session change within 50 kilometers for almost 80% of the sessions; (4) Queries, search target locations and user locations (both at the city level) all follow the power law distribution; (5) One third of queries are issued for target locations within 50 kilometers from the user locations; (6) The distribution of a query over target locations appears to follow the geographic location of the queried entity.
- Andrade, L. and Silva, M. 2006. Relevance ranking for geographic IR. In Proceedings of the SIGIR Workshop on Geographical Information Retrieval.Google Scholar
- Arya, S. and Mount, D. 1993. Approximate nearest neighbor queries in fixed dimensions. In Proceedings of the 4th Annual ACM-SIAM Symposium on Discrete Algorithms. SIAM, Philadelphia, PA, 271--280. Google Scholar
Digital Library
- Blank, A. and Solomon, S. 2000. Power Laws and Cities Population. Arxiv preprint cond-mat/0003240.Google Scholar
- Broder, A. 2002. A taxonomy of web search. ACM Sigir Forum. 36, 10. Google Scholar
Digital Library
- Clauset, A., Shalizi, C., and Newman, M. 2007. Power-law distributions in empirical data. arxiv 706.Google Scholar
- Cohen, J., Cohen, P., West, S., and Aiken, L. 1983. Applied Multiple Regression/Correlation Analysis for the Behavioral Sciences. Erlbaum, Hillsdale, NJ.Google Scholar
- Corder, G. and Foreman, D. 2009. Nonparametric Statistics for Non-Statisticians: A Step-by-step Approach. Wiley-Blackwell.Google Scholar
- Delboni, T., Borges, K., and Laender, A. 2005. Geographic web search based on positioning expressions. In Proceedings of the Workshop on Geographic Information Retrieval. ACM, New York, 61--64. Google Scholar
Digital Library
- Henrich, A. and Luedecke, V. 2007. Characteristics of geographic information needs. In Proceedings of the 4th ACM Workshop on Geographical Information Retrieval. ACM, New York, 1--6. Google Scholar
Digital Library
- Jansen, B. and Spink, A. 2006. How are we searching the World Wide Web? A comparison of nine search engine transaction logs. Inform. Process. Manage. 42, 1, 248--263. Google Scholar
Digital Library
- Jansen, B., Spink, A., and Pederson, J. 2005. Trend analysis of AltaVista Web searching. J. Amer. Soc. Inform. Sci. Techn. 56, 6, 559--570. Google Scholar
Digital Library
- Jones, R., Zhang, V. W., Rey, B., Jhala, P., and Stipp, E. 2008. Geographic intention and modification in web search. Int. J. Geograph. Inform. Sci. 22, 3, 229--246. Google Scholar
Digital Library
- Kamvar, M. and Baluja, S. 2006. A large scale study of wireless search behavior: Google mobile search. In Proceedings of the SIGCHI Conference on Human Factors in Computing Systems. ACM, New York, 701--709. Google Scholar
Digital Library
- Lee, U., Liu, Z., and Cho, J. 2005. Automatic identification of user goals in Web search. In Proceedings of the 14th International Conference on World Wide Web. ACM, New York, 391--400. Google Scholar
Digital Library
- MetaCarta. Geosearch news: http://geosearch.metacarta.com/.Google Scholar
- Mitzenmacher, M. 2004. A brief history of generative models for power law and lognormal distributions. Inter, Math. 1, 2, 226--251.Google Scholar
Cross Ref
- Mount, D. M. and Arya, S. Aug 4, 2006. Ann—approximate nearest neighbor library. http://www.cs.umd.edu/mount/ANN/.Google Scholar
- Nguyen, B. V. and Kan, M.-Y. 2007. Functional faceted web query analysis. In Proceedings of the WWW'07 Workshop on Query Log Analysis.Google Scholar
- Pasley, R., Clough, P., Purves, R. S., and Twaroch, F. A. 2008. Mapping geographic coverage of the web. In Proceedings of the 16th ACM SIGSPATIAL International Conference on Advances in Geographic Information Systems (GIS'08). ACM, New York, 1--9. Google Scholar
Digital Library
- Pramudiono, I., Shintani, T., Takahashi, K., and Kitsuregawa, M. 2002. User behavior analysis of location aware search engine. In Proceedings of the 3rd International Conference on Mobile Data Management. 139--145. Google Scholar
Digital Library
- Rose, D. and Levinson, D. 2004. Understanding user goals in web search. In Proceedings of the 13th International Conference on the World Wide Web. ACM, Press, New York, 13--19. Google Scholar
Digital Library
- Sanderson, M. and Kohler, J. 2004. Analyzing geographic queries. In Proceedings of the SIGIR Workshop on Geographic Information Retrieval.Google Scholar
- Santos, D. and Chaves, M. 2006. The place of place in geographical IR. In Proceedings of the 3rd Workshop on Geographic Information Retrieval (SIGIR). 6, 5--8.Google Scholar
- Shen, D., Pan, R., Sun, J., Pan, J., Wu, K., Yin, J., and Yang, Q. 2005. Q2C@UST: Our winning solution to query classification in KDDCUP 2005. ACM SIGKDD Explor. Newsl., 100--110. Google Scholar
Digital Library
- Silverstein, C., Henzinger, M., Marais, H., and Moricz, M. 1999. Analysis of a very large Web search engine query log. SIGIR Forum 33, 1, 6--12. Google Scholar
Digital Library
- Spink, A. and Jansen, B. 2004. Web Search: Public Searching on the Web. Springer. Google Scholar
Digital Library
- Spink, A., Jansen, B., Wolfram, D., and Saracevic, T. 2002. From e-sex to e-commerce: Web search changes. Comput. 35, 3, 107--109. Google Scholar
Digital Library
- Spink, A., Wolfram, D., Jansen, M., and Saracevic, T. 2001. Searching the Web: The public and their queries. J. Amer. Soci. Inform. Sci. Techn. 52, 3, 226--234. Google Scholar
Digital Library
- Takahashi, K., Pramudiono, I., and Kitsuregawa, M. 2005. Geo-word centric association rule mining. In Proceedings of the 6th International Conference on Mobile Data Management. ACM, New York, 273--280. Google Scholar
Digital Library
- Teitler, B., Lieberman, M., Panozzo, D., Sankaranarayanan, J., Samet, H., and Sperling, J. 2008. NewsStand: A new view on news. In Proceedings of the 16th ACM SIGSPATIAL International Conference on Advances in Geographic Information Systems. ACM, New York. Google Scholar
Digital Library
- van Rijsbergen, C. 1979. Information Retrieval. Butterworth. Google Scholar
Digital Library
- Vapnik, V. 2000. The Nature of Statistical Learning Theory. Springer. Google Scholar
Digital Library
- Wang, C., Xie, X., Wang, L., Lu, Y., and Ma, W. 2005. Detecting geographic locations from web resources. In Proceedings of the Workshop on Geographic Information Retrieval. ACM, New York, 17--24. Google Scholar
Digital Library
- Xiao, X., Xie, X., Luo, Q., and Ma, W.-Y. 2008. Density based co-location pattern discovery. In Proceedings of the 16th ACM SIGSPATIAL International Conference on Advances in Geographic Information Systems (GIS'08). Google Scholar
Digital Library
- Yang, Y. and Pedersen, J. 1997. A comparative study on feature selection in text categorization. In Proceedings of the 14th International Conference on Machine Learning. 412--420. Google Scholar
Digital Library
- Zhang, V., Rey, B., Stipp, E., and Jones, R. 2006. Geomodification in query rewriting. In Proceedings of the 3th ACM Workshop on Geographical Information Retrieval.Google Scholar
Index Terms
A large-scale study on map search logs
Recommendations
Mining query subtopics from search log data
SIGIR '12: Proceedings of the 35th international ACM SIGIR conference on Research and development in information retrievalMost queries in web search are ambiguous and multifaceted. Identifying the major senses and facets of queries from search log data, referred to as query subtopic mining in this paper, is a very important issue in web search. Through search log analysis, ...
A large scale study of wireless search behavior: Google mobile search
CHI '06: Proceedings of the SIGCHI Conference on Human Factors in Computing SystemsWe present a large scale study of search patterns on Google's mobile search interface. Our goal is to understand the current state of wireless search by analyzing over 1 Million hits to Google's mobile search sites. Our study also includes the ...
Analysis of geographic queries in a search engine log
LOCWEB '08: Proceedings of the first international workshop on Location and the webGeography is becoming increasingly important in web search. Search engines can often return better results to users by analyzing features such as user location or geographic terms in web pages and user queries. This is also of great commercial value as ...






Comments