skip to main content
10.1145/383952.384003acmconferencesArticle/Chapter ViewAbstractPublication PagesirConference Proceedingsconference-collections
Article

Stable algorithms for link analysis

Published:01 September 2001Publication History

ABSTRACT

The Kleinberg HITS and the Google PageRank algorithms are eigenvector methods for identifying ``authoritative'' or ``influential'' articles, given hyperlink or citation information. That such algorithms should give reliable or consistent answers is surely a desideratum, and in~\cite{ijcaiPaper}, we analyzed when they can be expected to give stable rankings under small perturbations to the linkage patterns. In this paper, we extend the analysis and show how it gives insight into ways of designing stable link analysis methods. This in turn motivates two new algorithms, whose performance we study empirically using citation data and web hyperlink data.

References

  1. 1.B. Amento, L. G. Terveen, and W. C. Hill. Does "authority" mean quality? Predicting expert quality ratings of web documents. In Proc. 23rd Annual Intl. ACM SIGIR Conference, pages 296-303. ACM, 2000. Google ScholarGoogle ScholarDigital LibraryDigital Library
  2. 2.K. Bharat and M. Henzinger. Improved algorithms for topic distillation in a hyperlinked environment. In Proc. 21st Annual Intl. ACM SIGIR Conf., pages 104-111. ACM, 1998. Google ScholarGoogle ScholarDigital LibraryDigital Library
  3. 3.S. Brin and L. Page. The anatomy of a large-scale hypertextual (Web) search engine. In The Seventh International World Wide Web Conference, 1998. Google ScholarGoogle ScholarDigital LibraryDigital Library
  4. 4.Fan R. K. Chung. Spectral Graph Theory. American Mathematical Society, 1994.Google ScholarGoogle Scholar
  5. 5.D. Cohn and H. Chang. Probabilistically identifying authoritative documents. In Proc. 17th International Conference on Machine Learning, 2000. Google ScholarGoogle ScholarDigital LibraryDigital Library
  6. 6.S. Deerwester, S. Dumais, G. Furnas, T. Landauer, and R. Harshman. Indexing by latent semantic analysis. Journal of the American Society for Information Science, 41(6):391-407, 1990.Google ScholarGoogle ScholarDigital LibraryDigital Library
  7. 7.G. H. Golub and C. F. Van Loan. Matrix Computations. Johns Hopkins Univ. Press, 1996.Google ScholarGoogle Scholar
  8. 8.J. Kleinberg. Authoritative sources in a hyperlinked environment. Proc. 9th ACM-SIAM Symposium on Discrete Algorithms, 1998. Google ScholarGoogle ScholarDigital LibraryDigital Library
  9. 9.A. McCallum, K. Nigam, J. Rennie, and K. Seymore. Automating the contruction of Internet portals with machine learning. Information Retrieval Journal, 3:127-163, 2000. Google ScholarGoogle ScholarDigital LibraryDigital Library
  10. 10.A. Y. Ng, A. X. Zheng, and M. I. Jordan. Link analysis, eigenvectors, and stability. In Proc. 17th International Joint Conference on Artificial Intelligence, 2001. Google ScholarGoogle ScholarDigital LibraryDigital Library
  11. 11.F. Osareh. Bibliometrics, citation analysis and co-citation analysis: A review of literature I. Libri, 46:149-158, 1996.Google ScholarGoogle ScholarCross RefCross Ref
  12. 12.L. Page, S. Brin, R. Motwani, and T. Winograd. The PageRank citation ranking: Bringing order to the web. Unpublished Manuscript, 1998.Google ScholarGoogle Scholar
  13. 13.C. Papadimitriou, P. Raghavan, H. Tamaki, and S. Vempala. Latent semantic indexing: A probabilistic analysis. In Proc. SIGMODS/PODS, 1998. Google ScholarGoogle ScholarDigital LibraryDigital Library
  14. 14.Davood Rafiei and Alberto Mendelzon. What is this Page Known for? Computing Web Page Reputations. In Proc. WWW9 Conference, 2000. Google ScholarGoogle ScholarDigital LibraryDigital Library
  15. 15.G. W. Stewart and Ji-Guang Sun. Matrix Perturbation Theory. Academic Press, 1990.Google ScholarGoogle Scholar

Index Terms

  1. Stable algorithms for link analysis

            Recommendations

            Comments

            Login options

            Check if you have access through your login credentials or your institution to get full access on this article.

            Sign in
            • Published in

              cover image ACM Conferences
              SIGIR '01: Proceedings of the 24th annual international ACM SIGIR conference on Research and development in information retrieval
              September 2001
              454 pages
              ISBN:1581133316
              DOI:10.1145/383952

              Copyright © 2001 ACM

              Publisher

              Association for Computing Machinery

              New York, NY, United States

              Publication History

              • Published: 1 September 2001

              Permissions

              Request permissions about this article.

              Request Permissions

              Check for updates

              Qualifiers

              • Article

              Acceptance Rates

              SIGIR '01 Paper Acceptance Rate47of201submissions,23%Overall Acceptance Rate792of3,983submissions,20%

            PDF Format

            View or Download as a PDF file.

            PDF

            eReader

            View online with eReader.

            eReader
            About Cookies On This Site

            We use cookies to ensure that we give you the best experience on our website.

            Learn more

            Got it!