ABSTRACT
This paper describes a new paradigm for modeling traffic levels on the world wide web (WWW) using a method of entropy maximization. This traffic is subject to the conservation conditions of a circulation flow in the entire WWW, an aggregation of the WWW, or a subgraph of the WWW (such as an intranet or extranet). We specifically apply the primal and dual solutions of this model to the (static) ranking of web sites. The first of these uses an imputed measure of total traffic through a web page, the second provides an analogy of local "temperature", allowing us to quantify the "HOTness" of a page.
- A. Arasu, J. Novak, A. Tomkins and J. Tomlin, "PageRank Computation and the Structure of the Web: Experiments and Algorithms", Poster Proc. WWW2002, Hawaii, May 2002. http://www2002.org/CDROM/poster/173.pdfGoogle Scholar
- R. Balescu, "Equilibrium and Nonequilibrium Statistical Mechanics", Wiley, NY (1975).Google Scholar
- S. Brin and L. Page, "The Anatomy of a Large-Scale Hypertextual Web Search Engine", Proc. of WWW7, Brisbane, Australia, June 1998. See: http://www7.scu.edu.au/programme/fullpapers/1921/com1921.htm Google Scholar
Digital Library
- A. Broder, R. Kumar, F. Maghoul, P. Raghavan, S. Rajagopalan, R. Stata, A. Tomkins and J. Wiener, "Graph Structure in the Web", Proc. WWW9 conference, 309--320, May 2000. See also: http://www9.org/w9cdrom/160/160.html Google Scholar
Digital Library
- M. Charikar, R. Kumar, P. Raghavan, S. Rajagopalan and A. Tomkins, "On Targeting Markov Segments", in Proceedings of the ACM Symposium on Theory of Computing, ACM Press (1999). Google Scholar
Digital Library
- R. S. Dembo, J. M. Mulvey and S. A. Zenios, "Large-Scale Nonlinear Network Models and Their Application", Operations Research37, 353--372 (1989).Google Scholar
- C. Dwork, R. Kumar, M. Naor, D. Sivakumar," Rank Aggregation Methods for the Web", Proc WWW10 conference, Kong, May 2001. See: http://www10.org/cdrom/papers/577/index.html Google Scholar
Digital Library
- B. C. Eaves, A. J. Hoffman, U. G. Rothblum and H. Schneider, "Line-sum-symmetric Scalings of Square Non-negative Matrices", Math. Prog. Studies 25, 124--141 (1985).Google Scholar
Cross Ref
- R. Fagin, "Combining fuzzy information: an overview", SIGMOD Record 31, 109--118, June 2002. Google Scholar
Digital Library
- W. Feller, An Introduction to Probability Theory and its Applications, Vol 1 (3rd edition), Wiley, NY (1968).Google Scholar
- L. R. Ford, Jr. and D. R. Fulkerson, Flows in Networks, Princeton University Press, Princeton, NJ, (1962).Google Scholar
- G. H. Golub and C. F. Van Loan, Matrix Computations (3rd edition), Johns Hopkins University Press, Baltimore and London (1996). Google Scholar
Digital Library
- E. Jaynes, "Information Theory and Statistical Mechanics", Physical Review 106, 620--630 (1957).Google Scholar
- J. Kleinberg, "Authoritative Sources in a Hyperlinked Environment", JACM46, (1999). Google Scholar
Digital Library
- L. Page, S. Brin, R. Motwani and T. Winograd "The PageRank Citation Ranking: Bringing Order to the Web", Stanford Digital Library working paper SIDL-WP-1999-0120 of 11/11/1999). See: http://dbpubs.stanford.edu/pub/1999-66.Google Scholar
- R. B. Potts and R. M. Oliver, Flows in Transportation Networks, Academic Press, New York (1972).Google Scholar
- M. H. Schneider, "Matrix Scaling, Entropy Minimization and Conjugate Duality (II): The Dual Problem", Math. Prog. 48, 103--124 (1990). Google Scholar
Digital Library
- M. H. Schneider and S. A. Zenios, "A Comparative Study of Algorithms for Matrix Balancing", Operations Research38, 439--455 (1990). Google Scholar
Digital Library
- E. Schrödinger, Statistical Thermodynamics, Dover edition, Mineola, NY (1989).Google Scholar
- C. E. Shannon, "A Mathematical Theory of Communication", Bell Systems Tech. J. 27, 379, 623 (1948).Google Scholar
Cross Ref
- J. A. Tomlin, "An Entropy Approach to Unintrusive Targeted Advertising on the Web", Proc. WWW9 conference, 767--774, May 2000. See also: http://www9.org/w9cdrom/214/214.html. Google Scholar
Digital Library
- A. G. Wilson, "Notes on Some Concepts in Social Physics", Regional Science Association: Papers, XXII, Budapest Conference, 1968.Google Scholar
- A. G. Wilson, Entropy in Urban and Regional Modeling, Pion Press, London (1970).Google Scholar
Index Terms
A new paradigm for ranking pages on the world wide web
Recommendations
A novel crawling algorithm for web pages
AIRS'11: Proceedings of the 7th Asia conference on Information Retrieval TechnologyCrawler is a main component of search engines. In search engines, crawler part is responsible for discovering and downloading web pages. No search engine can cover whole of the web, thus it has to focus on the most valuable web pages. Several Crawling ...
Ranking Pages by Topology and Popularity within Web Sites
We compare two link analysis ranking methods of web pages in a site. The first, called Site Rank , is an adaptation of PageRank to the granularity of a web site and the second, called Popularity Rank , is based on the frequencies of user clicks on the ...





Comments