ABSTRACT
The problem of measuring "similarity" of objects arises in many applications, and many domain-specific measures have been developed, e.g., matching text across documents or computing overlap among item-sets. We propose a complementary approach, applicable in any domain with object-to-object relationships, that measures similarity of the structural context in which objects occur, based on their relationships with other objects. Effectively, we compute a measure that says "two objects are similar if they are related to similar objects:" This general similarity measure, called SimRank, is based on a simple and intuitive graph-theoretic model. For a given domain, SimRank can be combined with other domain-specific similarity measures. We suggest techniques for efficient computation of SimRank scores, and provide experimental results on two application domains showing the computational feasibility and effectiveness of our approach.
- http://www.google.com.Google Scholar
- Ricardo Baeza-Yates and Berthier Ribeiro-Neto. Modern Information Retrieval. Addison Wesley, Reading, Massachusetts, 1999. Google Scholar
Digital Library
- David Goldberg, David Nichols, Brian M. Oki, and Douglas Terry. Using collaborative filtering to weave an information tapestry. Communications of the ACM, 35(12):61--70, December 1992. Google Scholar
Digital Library
- Glen Jeh and Jennifer Widom. SimRank: A measure of structural-context similarity. Technical report, Stanford University Database Group, 2001. http://dbpubs.stanford,edu/pub/2001-41.Google Scholar
- Jon M. Kleinberg. Authoritative sources in a hyperlinked environment. In Proceedings of the Ninth Annual ACM-SIAM Symposium on Discrete Algorithms, San Francisco, California, January 1998. Google Scholar
Digital Library
- Joseph A. Konstan, Bradley N. Miller, David Maltz, Jonathan L. Herlocker, Lee R. Gordon, and John Riedl. GroupLens: Applying collaborative filtering to Usenet news. Communications of the ACM, 40(3):77--87, March 1997. Google Scholar
Digital Library
- László Lovász. Random Walks on Graphs: A Survey, volume 2, pages 1--46. Bolyai Society Mathematical Studies, 1993. Google Scholar
Digital Library
- Lawrence Page, Sergey Brin, Rajeev Motwani, and Terry Winograd. The PageRank citation ranking: Bringing order to the Web. Technical report, Stanford University Database Group, 1998. http://citeseer.nj.nec.com/368196.html.Google Scholar
- Upendra Shardanand and Pattie Maes. Social information filtering: Algorithms for automating "word of mouth". In Proceedings of the Conference on Human Factors in Computing Systems, Denver, Colorado, 1995. Google Scholar
Digital Library
- Henry Small. Co-citation in the scientific literature: A new measure of the relationship between two documents. Journal of the American Society for Information Science, 24:265--269, 1973.Google Scholar
Cross Ref
Index Terms
SimRank: a measure of structural-context similarity
Recommendations
Scalable similarity search for SimRank
SIGMOD '14: Proceedings of the 2014 ACM SIGMOD International Conference on Management of DataSimRank, proposed by Jeh and Widom, provides a good similarity score and has been successfully used in many of the above mentioned applications. While there are many algorithms proposed so far to compute SimRank, but unfortunately, none of them are ...
SimRank*: effective and scalable pairwise similarity search based on graph topology
Given a graph, how can we quantify similarity between two nodes in an effective and scalable wayý SimRank is an attractive measure of pairwise similarity based on graph topologies. Its underpinning philosophy that "two nodes are similar if they are ...
SimRank and its variants in academic literature data: measures and evaluation
SAC '16: Proceedings of the 31st Annual ACM Symposium on Applied ComputingSimRank is a well-known link-based similarity measure that can be applied on a citation graph to compute similarity of academic literature data. The intuition behind SimRank is that two objects are similar if they are referenced by similar objects. ...





Comments