skip to main content
article

Web site personalization based on link analysis and navigational patterns

Published:01 October 2007Publication History
Skip Abstract Section

Abstract

The continuous growth in the size and use of the World Wide Web imposes new methods of design and development of online information services. The need for predicting the users' needs in order to improve the usability and user retention of a Web site is more than evident and can be addressed by personalizing it. Recommendation algorithms aim at proposing “next” pages to users based on their current visit and past users' navigational patterns. In the vast majority of related algorithms, however, only the usage data is used to produce recommendations, disregarding the structural properties of the Web graph. Thus important—in terms of PageRank authority score—pages may be underrated. In this work, we present UPR, a PageRank-style algorithm which combines usage data and link analysis techniques for assigning probabilities to Web pages based on their importance in the Web site's navigational graph. We propose the application of a localized version of UPR (l-UPR) to personalized navigational subgraphs for online Web page ranking and recommendation. Moreover, we propose a hybrid probabilistic predictive model based on Markov models and link analysis for assigning prior probabilities in a hybrid probabilistic model. We prove, through experimentation, that this approach results in more objective and representative predictions than the ones produced from the pure usage-based approaches.

References

  1. Aktas, M. S., Nacar, M. A., and Menczer, F. 2004. Personalizing PageRank based on domain profiles. In Proceedings of the WEBKDD 2004 Workshop (Seattle, WA, Aug. 2004).Google ScholarGoogle Scholar
  2. Borges, J. and Levene, M. 2000. Data mining of user navigation patterns. In Revised Papers from the International Workshop on Web Usage Analysis and User Profiling. Lecture Notes in Computer Science, vol. 1836. Springer, Berlin, Germany, 92--111. Google ScholarGoogle ScholarDigital LibraryDigital Library
  3. Borges, J. and Levene, M. 2004. A dynamic clustering-based Markov model for Web usage Mining. Technical Report. Available online at http://xxx.arxiv.org/abs/cs.IR/0406032.Google ScholarGoogle Scholar
  4. Borges, J. and Levene, M. 2006. Ranking pages by topology and popularity within Web sites. World Wide Web J. 9, 3 (Oct.), 301--316. Google ScholarGoogle ScholarDigital LibraryDigital Library
  5. Brin, S. and Page, L. 1998. The anatomy of a large-scale hypertextual Web search engine. Comput. Netw. 30, 1--7, 107--117. Google ScholarGoogle ScholarDigital LibraryDigital Library
  6. Cadez, I. Heckerman, D., Meek, C., Smyth, P., and White, S. 2000. Visualization of navigation patterns on a Web site using model based clustering. In Proceedings of the ACM KDD2000 Conference (Boston, MA). Google ScholarGoogle ScholarDigital LibraryDigital Library
  7. Cadez, I., Gaffney, S., and Smyth, P. 2006. A general probabilistic framework for clustering individuals and objects. In Proceedings of the ACM KDD2000 Conference (Boston, MA). Google ScholarGoogle ScholarDigital LibraryDigital Library
  8. Deshpande, M. and Karypis, G. 2001. Selective Markov models for predicting Web-page accesses. In Proceedings of the first SIAM International Conference on Data Mining.Google ScholarGoogle Scholar
  9. Eirinaki, M. 2004. Web mining: A roadmap. Technical Report. Available online at http://www.db-net.aueb.gr.Google ScholarGoogle Scholar
  10. Eirinaki, M. and Vazirgiannis, M. 2003. Web mining for Web personalization. ACM Trans. Internet Tech. 3, 1, 1--29. Google ScholarGoogle ScholarDigital LibraryDigital Library
  11. Eirinaki, M., Vazirgiannis, M., and Kapogiannis, D. 2005. Web path recommendations based on page ranking and Markov models. In Proceedings of the Seventh ACM International Workshop on Web Information and Data Management (WIDM 2005, Bremen, Germany, November). Google ScholarGoogle ScholarDigital LibraryDigital Library
  12. Eirinaki, M., Vazirgiannis, M., and Varlamis, I. 2003. SEWeP: Using site semantics and a taxonomy to enhance the Web personalization process. In Proceedings of the ACM KDD2003 Conference (Washington, DC, August). Google ScholarGoogle ScholarDigital LibraryDigital Library
  13. El-Sayed, M., Ruiz, C., Rundesteiner, E. A. 2004. FS-Miner: Efficient and incremental mining of frequent sequence patterns in Web logs. In Proceedings of the Sixth ACM International Workshop on Web Information and Data Management (WIDM 2004, Washington, DC, November). Google ScholarGoogle ScholarDigital LibraryDigital Library
  14. Haveliwala, T. 2002. Topic-sensitive PageRank. In Proceedings of the WWW2002 Conference (Hawaii, May). Google ScholarGoogle ScholarDigital LibraryDigital Library
  15. Huang, Z., Li, X., and Chen, H. 2005. Link prediction approach to collaborative filtering. In Proceedings of ACM JCDL'05 (Colorado). Google ScholarGoogle ScholarDigital LibraryDigital Library
  16. Kamvar, S. D., Haveliwala, T. H., and Golub, G. H. 2003a. Adaptive methods for the computation of PageRank. In Proceedings of the International Conference on the Numerical Solution of Markov Chains (September).Google ScholarGoogle Scholar
  17. Kamvar, S. D., Haveliwala, T. H., Manning, C. D., and Golub, G. H. 2003b. Extrapolation methods for accelerating PageRank computations. In Proceedings of the twelfth International World Wide Web Conference (WWW2003, May). Google ScholarGoogle ScholarDigital LibraryDigital Library
  18. Kendall, M. and Gibbons, J. D. 1990. Rank correlation methods. Oxford University Press, Oxford, U.K.Google ScholarGoogle Scholar
  19. Levene, M. and Loizou, G. 2003. Computing the entropy of user navigation in the Web. Int. J. Inform. Tech. Decis. Mak. 2, 459--476.Google ScholarGoogle ScholarCross RefCross Ref
  20. Manavoglu, D., Pavlov, D., and Giles, C. L. 2003. Probabilistic user behaviour models. In Proceedings of ICDM 2003. Google ScholarGoogle ScholarDigital LibraryDigital Library
  21. Motwani, R. and Raghavan, P. 1995. Randomized algorithms. Cambridge University Press, Cambridge, U.K. Google ScholarGoogle ScholarDigital LibraryDigital Library
  22. Nakagawa, M. and Mobasher, B. 2003. A hybrid Web personalization model based on site connectivity. In Proceedings of the Fifth WEBKDD Workshop (Washington, DC).Google ScholarGoogle Scholar
  23. Polyzotis, N. and Garofalakis, M. 2002. Structure and value synopses for XML data graphs. In Proceedings of the 28th VLDB Conference. Google ScholarGoogle ScholarDigital LibraryDigital Library
  24. Polyzotis, N., Garofalakis, M., and Ioannidis, Y. 2004. Approximate XML query answers. In Proceedings of SIGMOD 2004 (Paris, France, June). Google ScholarGoogle ScholarDigital LibraryDigital Library
  25. Richardson, M. and Domingos, P. 2002. The intelligent surfer: Probabilistic combination of link and content information in PageRank. Neur. Inform. Process. Syst. 14, 1441--1448.Google ScholarGoogle Scholar
  26. Sarukkai, R. R. 2000. Link prediction and path analysis using Markov chains. Comput. Netw. 33, 1--6, 337--386. Google ScholarGoogle ScholarDigital LibraryDigital Library
  27. Sen, R. and Hansen, M. 2003. Predicting a Web user's next access based on log data. J. Comput. Graph. Stat. 12, 1, 143--155.Google ScholarGoogle ScholarCross RefCross Ref
  28. Spiliopoulou, M. and Faulstich, L. C. 1998. WUM: A Web utilization miner. In Proceedings of the First International Workshop on the Web and Databases (WebDB 1998 Spain, March).Google ScholarGoogle Scholar
  29. Zhao, Q. and Bhowmick, S. S. 2004. Mining history of changes to Web access patterns. In Proceedings of PKDD 2004 (Italy, September). Google ScholarGoogle ScholarDigital LibraryDigital Library
  30. Zhu, J., Hong, J., and Hughes, J. G. 2002. Using Markov models for Web site link prediction. In Proceedings of ACM HT'02 (Maryland). Google ScholarGoogle ScholarDigital LibraryDigital Library

Index Terms

  1. Web site personalization based on link analysis and navigational patterns

        Recommendations

        Comments

        Login options

        Check if you have access through your login credentials or your institution to get full access on this article.

        Sign in

        Full Access

        PDF Format

        View or Download as a PDF file.

        PDF

        eReader

        View online with eReader.

        eReader
        About Cookies On This Site

        We use cookies to ensure that we give you the best experience on our website.

        Learn more

        Got it!