skip to main content
research-article

A peer-to-peer recommender system based on spontaneous affinities

Published:23 February 2009Publication History
Skip Abstract Section

Abstract

Network analysis has proved to be very useful in many social and natural sciences, and in particular Small World topologies have been exploited in many application fields. In this article, we focus on P2P file sharing applications, where spontaneous communities of users are studied and analyzed. We define a family of structures that we call “Affinity Networks” (or even Graphs) that show self-organized interest-based clusters. Empirical evidence proves that affinity networks are small worlds and shows scale-free features. The relevance of this finding is augmented with the introduction of a proactive recommendation scheme, namely DeHinter, that exploits this natural feature. The intuition behind this scheme is that a user would trust her network of “elective affinities” more than anonymous and generic suggestions made by impersonal entities. The accuracy of the recommendation is evaluated by way of a 10-fold cross validation, and a prototype has been implemented for further feedbacks from the users.

References

  1. Abello, J., Buchsbaum, A. L., and Westbrook, J. 1998. A functional approach to external graph algorithms. In Proceedings of the European Symposium on Algorithms. 332--343. Google ScholarGoogle ScholarDigital LibraryDigital Library
  2. Achacoso, T. B. and Yamamoto, W. S. 1991. AYs Neuroanatomy of C Elegans for Computation. CRC-Press.Google ScholarGoogle Scholar
  3. Adamic, L. A. and Huberman, B. A. 2000. The nature of markets in the world wide web. Quarterly J. Electron. Commerce 1, 512.Google ScholarGoogle Scholar
  4. Albert, R. and Barabasi, A. L. 2002. Statistical mechanics of complex networks. Rev. Modern Physics 74, 1.Google ScholarGoogle ScholarCross RefCross Ref
  5. Balabanović, M. and Shoham, Y. 1997. Fab: content-based, collaborative recommendation. Comm. ACM 40, 3 66--72. Google ScholarGoogle ScholarDigital LibraryDigital Library
  6. Barabási, A.-L. 2003. Linked: How Everything Is Connected to Everything Else and What It Means for Business, Science, and Everyday Life. Plume Books. Google ScholarGoogle ScholarDigital LibraryDigital Library
  7. Barabási, A.-L. and Albert, R. 1999. Emergence of scaling in random networks. Science 286, 509.Google ScholarGoogle ScholarCross RefCross Ref
  8. Barford, P., Bestavros, A., Bradley, A., and Crovella, M. 1999. Changes in web client access patterns: Characteristics and caching implications. World Wide Web 2, 1-2, 15--28. Google ScholarGoogle ScholarDigital LibraryDigital Library
  9. Bhalla, U. S. and Iyengar, R. 1999. Emergent properties of networks of biological signaling pathways. Science 283, 381--387.Google ScholarGoogle ScholarCross RefCross Ref
  10. Breslau, L., Cao, P., Fan, L., Phillips, G., and Shenker, S. 1999. Web caching and zipf-like distributions: Evidence and implications. In Proceedings of INFOCOM. 126--134.Google ScholarGoogle Scholar
  11. Broder, A., Kumar, R., Maghoul, F., Raghavan, P., Rajagopalan, S., Stata, R., Tomkins, A., and Wiener, J. 2000. Graph structure in the web. Comput. Netw. 33, 309--320. Google ScholarGoogle ScholarDigital LibraryDigital Library
  12. Claypool, M., Gokhale, A., Miranda, T., Murnikov, P., Netes, D., and Sartin, M. 1999. Combining content-based and collaborative filters in an online newspaper. In Proceedings of the ACM SIGIR Workshop on Recommender Systems: Algorithms and Evaluation. ACM.Google ScholarGoogle Scholar
  13. Cohen, J. E., Briand, F., and Newman, C. M. 1986. A stochastic theory of community food webs III. Predicted and observed lengths of food. Royal Soc. London Proc. Series B 228, 317--353.Google ScholarGoogle Scholar
  14. Cox, R. A. K., Felton, J. M., and Chung, K. C. 1995. The concentration of commercial success in popular music: an analysis of the distribution of gold records. J. Cultural Economics 19, 333--340.Google ScholarGoogle ScholarCross RefCross Ref
  15. Crovella, M. E. and Bestavros, A. 1997. Self-similarity in World Wide Web traffic: Evidence and possible causes. IEEE/ACM Trans. Netw. 5, 6, 835--846. Google ScholarGoogle ScholarDigital LibraryDigital Library
  16. de Sola Pool, I. and Kochen, M. 1978. Contacts and influence. Social Netw. 1, 1--48.Google ScholarGoogle ScholarCross RefCross Ref
  17. DeRoure, D., Hall, W., Reich, S., Hill, G., Pikrakis, A., and Stairmand, M. 2001. MEMOIR—an open framework for enhanced navigation of distributed information. Inf. Process. Manage. 37, 1. Google ScholarGoogle ScholarDigital LibraryDigital Library
  18. deSolla Price, D. J. 1967. Networks of scientific papers. Science 155, 3767, 1213--1219.Google ScholarGoogle Scholar
  19. Erdős, P. and Rényi, A. 1959. On random graphs. Publicationes Mathematicae 6.Google ScholarGoogle Scholar
  20. Erdős, P. and Rényi, A. 1960. On the evolution of random graphs. Publications of the Mathematical Institute of the Hungarian Academy of Sciences 5.Google ScholarGoogle Scholar
  21. Erdős, P. and Rényi, A. 1961. On the strength of connectedness of a random graph. Acta Mathematica Scientia Hungary 12.Google ScholarGoogle Scholar
  22. Estoup, J. B. 1916. Les gammes stenographiques. Institut Stenographique de France.Google ScholarGoogle Scholar
  23. Faloutsos, M., Faloutsos, P., and Faloutsos, C. 1999. On power-law relationships of the internet topology. In Proceedings of the Conference on Applications, Technologies, Architectures, and Protocols for Computer Communication (SIGCOMM'99). Vol. 29. ACM Press, New York, NY, 251--262. Google ScholarGoogle ScholarDigital LibraryDigital Library
  24. Goldberg, D., Nichols, D., Oki, B. M., and Terry, D. 1992. Using collaborative filtering to weave an information tapestry. Comm. ACM 35, 12, 61--70. Google ScholarGoogle ScholarDigital LibraryDigital Library
  25. Gutenberg, B. and Richter, R. F. 1944. Frequency of earthquakes in california. Bul. Seismological Soc. Amer. 34, 185--188.Google ScholarGoogle Scholar
  26. Han, P., Xie, B., Yang, F., and Shen, R. 2004. A scalable p2p recommender system based on distributed collaborative filtering. Expert Syst. Appl. 27, 2, 203--210.Google ScholarGoogle ScholarCross RefCross Ref
  27. Hartwell, L. H., Hopfield, J. J., Leibler, S., and Murray, A. W. 1999. From molecular to modular cell biology. Nature 402, 6761 Suppl.Google ScholarGoogle Scholar
  28. Herlocker, J. L., Konstan, J. A., and Riedl, J. 2000. Explaining collaborative filtering recommendations. In Proceedings of the ACM Conference on Computer Supported Cooperative Work (CSCW'00). ACM Press, 241--250. Google ScholarGoogle ScholarDigital LibraryDigital Library
  29. Howe, A. E. and Dreilinger, D. 1997. SAVVYSEARCH: A metasearch engine that learns which search engines to query. AI Mag. 18, 2, 19--25.Google ScholarGoogle Scholar
  30. Iamnitchi, A., Ripeanu, M., and Foster, I. 2004. Small-world file-sharing communities. In The 23rd Conference of the IEEE Communications Society (INFOCOM'04). Vol. 2. 952--963.Google ScholarGoogle Scholar
  31. Jeong, H., Tombor, B., Albert, R., Oltvai, Z. N., and Barabási, A. L. 2000. The large-scale organization of metabolic networks. Nature 407, 6804, 651--654.Google ScholarGoogle Scholar
  32. Karinthy, F. 1929. Chains. Everything is Different. Atheneum Press.Google ScholarGoogle Scholar
  33. Kleinberg, J. M. 2000. The small-world phenomenon: An algorithmic perspective. In Proceedings of the 32nd ACM Symposium on Theory of Computing. Google ScholarGoogle ScholarDigital LibraryDigital Library
  34. Kleinfeld, J. S. 2001. Could it be a big world after all? the “six degrees of separation” myth. Society.Google ScholarGoogle Scholar
  35. Klingberg, T. and Manfredi, R. 2002. Gnutella protocol development. http://rfc-gnutella.sourceforge.net/src/rfc-0_6-draft.html (last access:1/13/09).Google ScholarGoogle Scholar
  36. Kohli, R. and Sah, R. 2003. Market shares: Some power law results and observations. Working paper 04.01, School of Public Policy, University of Chicago.Google ScholarGoogle Scholar
  37. Kohn, K. W. 1999. Molecular interaction map of the mammalian cell cycle control and DNA repair systems. Mol. Biol. Cell 10, 8, 2703--2734.Google ScholarGoogle ScholarCross RefCross Ref
  38. Konstan, J. A., Miller, B. N., Maltz, D., Herlocker, J. L., Gordon, L. R., and Riedl, J. 1997. Grouplens: applying collaborative filtering to usenet news. Comm. ACM 40, 3, 77--87. Google ScholarGoogle ScholarDigital LibraryDigital Library
  39. Krulwich, B. 1997. Lifestyle finder: Intelligent user profiling using large-scale demographic data. AI Maga. 18, 2, 37--45.Google ScholarGoogle Scholar
  40. Lang, K. 1995. NewsWeeder: learning to filter netnews. In Proceedings of the 12th International Conference on Machine Learning. Morgan Kaufmann Publishers Inc. 331--339.Google ScholarGoogle ScholarCross RefCross Ref
  41. Leibowitz, N., Ripeanu, M., and Wierzbicki, A. 2003. Deconstructing the kazaa network. In Proceedings of the 3rd IEEE Workshop on Internet Applications. IEEE Press. Google ScholarGoogle ScholarDigital LibraryDigital Library
  42. Lotka, A. J. 1926. The frequency distribution of scientific production. J. Wash. Acad. Sci. 16, 317--323.Google ScholarGoogle Scholar
  43. Lu, E. T. and Hamilton, R. J. 1991. Avalanches and the distribution of solar flares. Astrophysical J. 380, L89--L92.Google ScholarGoogle ScholarCross RefCross Ref
  44. Milgram, S. 1967. The small world problem. Psych. Today 2, 60--67.Google ScholarGoogle Scholar
  45. Miller, B. N., Konstan, J. A., and Riedl, J. 2004. Pocketlens: Toward a personal recommender system. ACM Trans. Inform. Syst. 22, 3, 437--476. Google ScholarGoogle ScholarDigital LibraryDigital Library
  46. Monasson, R. 1999. Diffusion, localization and dispersion relations on “small-world” lattices. European Physical J. B 12, 4, 555--567.Google ScholarGoogle ScholarCross RefCross Ref
  47. Montaner, M., López, B., and De La Rosa, J. L. 2003. A taxonomy of recommender agents on theinternet. AI. Rev. 19, 4, 285--330. Google ScholarGoogle ScholarDigital LibraryDigital Library
  48. Neukum, G. and Ivanov, B. A. 1994. Crater size distributions and impact probabilities on earth from lunar, terrestrial-planet, and asteroid cratering Data. In Hazards Due to Comets and Asteroids, T. Gehrels, M. S. Matthews, and A. M. Schumann, Eds. The University of Arizona Press, 359--416.Google ScholarGoogle Scholar
  49. Newman, M. E. 2001. The structure of scientific collaboration networks. Proc. Nat. Acad. Sci. 98, 2, 404--409.Google ScholarGoogle ScholarCross RefCross Ref
  50. Newman, M. E. J. 2003. The structure and function of complex networks. SIAM Rev. 45, 167.Google ScholarGoogle ScholarDigital LibraryDigital Library
  51. Newman, M. E. J. 2005. Power laws, pareto distributions and zipf's law. Contemp. Physics 46, 323.Google ScholarGoogle ScholarCross RefCross Ref
  52. Newman, M. E. J. and Watts, D. J. 1999. Renormalization group analysis of the small-world network model. Physics Lett. A 263, 341--346.Google ScholarGoogle ScholarCross RefCross Ref
  53. Oka, T., Morikawa, H., and Aoayama, T. 2004. Vineyard: A collaborative filtering service platform in distributed environment. In Proceedings of the IEEE/IPSJ Symposium on Applications and the Internet Workshops. Google ScholarGoogle ScholarDigital LibraryDigital Library
  54. Pagallo, U. 2006. Teoria giundica della Complessità. Dalla “Polis primitiva” di Socrate ai “mondi piccoli” dellinformatica—Un approccio evolutivo. Giapichelli, Torino, Italy.Google ScholarGoogle Scholar
  55. Pazzani, M. J. 1999. A framework for collaborative, content-based and demographic filtering. AI. Rev. 13, 5-6, 393--408. Google ScholarGoogle ScholarDigital LibraryDigital Library
  56. Pazzani, M. J., Muramatsu, J., and Billsus, D. 1996. Syskill webert: Identifying interesting web sites. In Proceedings of AAAI/IAAI, Vol. 1. 54--61.Google ScholarGoogle Scholar
  57. Phex Team. 2003. Phex file-sharing gnutella client. http://www.phex.org/mambo/(last access: 1/13/09).Google ScholarGoogle Scholar
  58. Pinkerton, B. 2000. Webcrawler: Finding what people want. Ph.D. thesis, University of Washington. Google ScholarGoogle ScholarDigital LibraryDigital Library
  59. Popescul, A., Ungar, L. H., Pennock, D. M., and Lawrence, S. 2001. Probabilistic models for unified collaborative and content-based recommendation in sparse-data environments. In Proceedings of the 17th Conference in Uncertainty in Artificial Intelligence (UAI'01). Morgan Kaufmann Publishers Inc., San Francisco, CA, 437--444. Google ScholarGoogle ScholarDigital LibraryDigital Library
  60. Redner, S. 1998. How popular is your paper? An empirical study of the citation distribution. European Physical J. B 4, 131.Google ScholarGoogle ScholarCross RefCross Ref
  61. Rekhter, Y., Moskowitz, B., Karrenberg, D., de Groot, G. J., and Lear, E. 1996. Address allocation for private internets. RFC 1918, Internet Engineering Task Force. Google ScholarGoogle ScholarDigital LibraryDigital Library
  62. Resnick, P. and Varian, H. R. 1997. Recommender systems—introduction to the special section. Comm. ACM 40, 3, 56--58. Google ScholarGoogle ScholarDigital LibraryDigital Library
  63. Rich, E. 1979. User modeling via stereotypes. Cognitive Sci. 3, 329--354.Google ScholarGoogle ScholarCross RefCross Ref
  64. Roberts, D. C. and Turcotte, D. L. 1998. Fractality and selforganized criticality of wars. Fractals 6, 351--357.Google ScholarGoogle ScholarCross RefCross Ref
  65. Ruffo, G. and Schifanella, R. 2007. Evaluating peer-to-peer recommender systems that exploit spontaneous affinities. In Proceedings of the ACM Symposium on Applied Computing (SAC'07). ACM, New York, NY 1574--1578. Google ScholarGoogle ScholarDigital LibraryDigital Library
  66. Ruffo, G., Schifanella, R., and Ghiringhello, E. 2006. A decentralized recommendation system based on self-organizing partnerships. Lecture Notes in Computer Science, vol. 3976. Springer, 618--629.Google ScholarGoogle Scholar
  67. Sarwar, B. M., Konstan, J. A., Borchers, A., Herlocker, J., Miller, B., and Riedl, J. 1998. Using filtering agents to improve prediction quality in the grouplens research collaborative filtering system. In Proceedings of the ACM Conference on Computer Supported Cooperative Work (CSCW'98). ACM Press, New York, NY, 345--354. Google ScholarGoogle ScholarDigital LibraryDigital Library
  68. Schifanella, R., Panisson, A., Gena, C., and Ruffo, G. 2008. Mobhinter: epidemic collaborative filtering and self-organization in mobile ad-hoc networks. In Proceedings of the ACM Conference on Recommender Systems (RecSys'08). ACM, New York, NY, 27--34. Google ScholarGoogle ScholarDigital LibraryDigital Library
  69. Seglen, P. O. 1992. The skewness of science. J. Amer. Soc. Inform. Sci. 43, 9, 628--638.Google ScholarGoogle ScholarCross RefCross Ref
  70. Shardanand, U. and Maes, P. 1995. Social information filtering: Algorithms for automating “word of mouth”. In Proceedings of the ACM Conference on Human Factors in Computing Systems (CHI'95). ACM, 210--217. Google ScholarGoogle ScholarDigital LibraryDigital Library
  71. Sripanidkulchai, K., Maggs, B., and Zhang, H. 2003. Efficient content location using interest-based locality in peer-topeer systems. In Proceedings of the InfoCom.Google ScholarGoogle Scholar
  72. Stutzbach, D., Rejaie, R., and Sen, S. 2005. Characterizing unstructured overlay topologies in modern p2p file-sharing systems. In Proceedings of the ACM SIGCOMM Internet Measurement Conference. Google ScholarGoogle ScholarDigital LibraryDigital Library
  73. Terveen, L. and Hill, W. 2001. Beyond recommender systems: Helping people help each other. In HCI in the New Millennium. Addison-Wesley, 487--509.Google ScholarGoogle Scholar
  74. Tveit, A. 2001. Peer-to-peer based recommendations for mobile commerce. In Proceedings of the 1st International Workshop on Mobile Commerce (WMC'01). ACM Press, New York, NY, 26--29. Google ScholarGoogle ScholarDigital LibraryDigital Library
  75. von Goethe, J. W. 1809. Die Wahlverwandtschaften. http://en.wikipedia.org/wiki/Elective_Affinities.Google ScholarGoogle Scholar
  76. Wang, J., Reinders, M. J. T., Lagendijk, R. L., and Pouwelse, J. 2005. Self-organizing distributed collaborative filtering. In Proceedings of the 28th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR'05). ACM Press, New York, NY, 659--660. Google ScholarGoogle ScholarDigital LibraryDigital Library
  77. Wasserman, S. and Faust, K. 1994. Social Network Analysis. Cambridge University Press, Cambridge, U.K.Google ScholarGoogle Scholar
  78. Watts, D. J. 1999. Small Worlds: The Dynamics of Networks Between Order and Randomness. Princeton University Press, Princeton, NJ. Google ScholarGoogle ScholarDigital LibraryDigital Library
  79. Watts, D. J. and Strogatz, S. H. 1998. Collective dynamics of ‘small-world’ networks. Nature 393, 6684, 440--442.Google ScholarGoogle Scholar
  80. Wei, Y. Z., Moreau, L., and Jennings, N. R. 2005. A market-based approach to recommender systems. ACM Trans. Inform. Syst. 23, 3, 227--266. Google ScholarGoogle ScholarDigital LibraryDigital Library
  81. Williams, R. J. and Martinez, N. D. 2000. Simple rules yield complex food webs. Nature 404, 6774, 180--183.Google ScholarGoogle Scholar
  82. Willis, J. C. and Yule, G. U. 1922. Some statistics of evolution and geographical distribution in plants and animals, and their significance. Nature 109, 177--179.Google ScholarGoogle ScholarCross RefCross Ref
  83. Xie, B., Han, P., and Shen, R. 2004. Pipecf: a scalable dht-based collaborative filtering recommendation system. In Proceedings of the 13th International World Wide Web Conference on Alternate Track Papers and Posters (WWW Alt.'04). ACM Press, New York, NY, 224--225. Google ScholarGoogle ScholarDigital LibraryDigital Library
  84. Yan, T. and Garcia-Molina, H. 1995. SIFT—A tool for wide-area information dissemination. In Proceedings of the USENIX Technical Conference. 177--186. Google ScholarGoogle ScholarDigital LibraryDigital Library
  85. Zanette, D. H. and Manrubia, S. C. 2001. Vertical transmission of culture and the distribution of family names. Physica A: Statist. Mechanics Appl. 295, 1-2, 1--8.Google ScholarGoogle ScholarCross RefCross Ref

Index Terms

  1. A peer-to-peer recommender system based on spontaneous affinities

          Recommendations

          Comments

          Login options

          Check if you have access through your login credentials or your institution to get full access on this article.

          Sign in

          Full Access

          PDF Format

          View or Download as a PDF file.

          PDF

          eReader

          View online with eReader.

          eReader
          About Cookies On This Site

          We use cookies to ensure that we give you the best experience on our website.

          Learn more

          Got it!