Abstract
Network analysis has proved to be very useful in many social and natural sciences, and in particular Small World topologies have been exploited in many application fields. In this article, we focus on P2P file sharing applications, where spontaneous communities of users are studied and analyzed. We define a family of structures that we call “Affinity Networks” (or even Graphs) that show self-organized interest-based clusters. Empirical evidence proves that affinity networks are small worlds and shows scale-free features. The relevance of this finding is augmented with the introduction of a proactive recommendation scheme, namely DeHinter, that exploits this natural feature. The intuition behind this scheme is that a user would trust her network of “elective affinities” more than anonymous and generic suggestions made by impersonal entities. The accuracy of the recommendation is evaluated by way of a 10-fold cross validation, and a prototype has been implemented for further feedbacks from the users.
- Abello, J., Buchsbaum, A. L., and Westbrook, J. 1998. A functional approach to external graph algorithms. In Proceedings of the European Symposium on Algorithms. 332--343. Google Scholar
Digital Library
- Achacoso, T. B. and Yamamoto, W. S. 1991. AYs Neuroanatomy of C Elegans for Computation. CRC-Press.Google Scholar
- Adamic, L. A. and Huberman, B. A. 2000. The nature of markets in the world wide web. Quarterly J. Electron. Commerce 1, 512.Google Scholar
- Albert, R. and Barabasi, A. L. 2002. Statistical mechanics of complex networks. Rev. Modern Physics 74, 1.Google Scholar
Cross Ref
- Balabanović, M. and Shoham, Y. 1997. Fab: content-based, collaborative recommendation. Comm. ACM 40, 3 66--72. Google Scholar
Digital Library
- Barabási, A.-L. 2003. Linked: How Everything Is Connected to Everything Else and What It Means for Business, Science, and Everyday Life. Plume Books. Google Scholar
Digital Library
- Barabási, A.-L. and Albert, R. 1999. Emergence of scaling in random networks. Science 286, 509.Google Scholar
Cross Ref
- Barford, P., Bestavros, A., Bradley, A., and Crovella, M. 1999. Changes in web client access patterns: Characteristics and caching implications. World Wide Web 2, 1-2, 15--28. Google Scholar
Digital Library
- Bhalla, U. S. and Iyengar, R. 1999. Emergent properties of networks of biological signaling pathways. Science 283, 381--387.Google Scholar
Cross Ref
- Breslau, L., Cao, P., Fan, L., Phillips, G., and Shenker, S. 1999. Web caching and zipf-like distributions: Evidence and implications. In Proceedings of INFOCOM. 126--134.Google Scholar
- Broder, A., Kumar, R., Maghoul, F., Raghavan, P., Rajagopalan, S., Stata, R., Tomkins, A., and Wiener, J. 2000. Graph structure in the web. Comput. Netw. 33, 309--320. Google Scholar
Digital Library
- Claypool, M., Gokhale, A., Miranda, T., Murnikov, P., Netes, D., and Sartin, M. 1999. Combining content-based and collaborative filters in an online newspaper. In Proceedings of the ACM SIGIR Workshop on Recommender Systems: Algorithms and Evaluation. ACM.Google Scholar
- Cohen, J. E., Briand, F., and Newman, C. M. 1986. A stochastic theory of community food webs III. Predicted and observed lengths of food. Royal Soc. London Proc. Series B 228, 317--353.Google Scholar
- Cox, R. A. K., Felton, J. M., and Chung, K. C. 1995. The concentration of commercial success in popular music: an analysis of the distribution of gold records. J. Cultural Economics 19, 333--340.Google Scholar
Cross Ref
- Crovella, M. E. and Bestavros, A. 1997. Self-similarity in World Wide Web traffic: Evidence and possible causes. IEEE/ACM Trans. Netw. 5, 6, 835--846. Google Scholar
Digital Library
- de Sola Pool, I. and Kochen, M. 1978. Contacts and influence. Social Netw. 1, 1--48.Google Scholar
Cross Ref
- DeRoure, D., Hall, W., Reich, S., Hill, G., Pikrakis, A., and Stairmand, M. 2001. MEMOIR—an open framework for enhanced navigation of distributed information. Inf. Process. Manage. 37, 1. Google Scholar
Digital Library
- deSolla Price, D. J. 1967. Networks of scientific papers. Science 155, 3767, 1213--1219.Google Scholar
- Erdős, P. and Rényi, A. 1959. On random graphs. Publicationes Mathematicae 6.Google Scholar
- Erdős, P. and Rényi, A. 1960. On the evolution of random graphs. Publications of the Mathematical Institute of the Hungarian Academy of Sciences 5.Google Scholar
- Erdős, P. and Rényi, A. 1961. On the strength of connectedness of a random graph. Acta Mathematica Scientia Hungary 12.Google Scholar
- Estoup, J. B. 1916. Les gammes stenographiques. Institut Stenographique de France.Google Scholar
- Faloutsos, M., Faloutsos, P., and Faloutsos, C. 1999. On power-law relationships of the internet topology. In Proceedings of the Conference on Applications, Technologies, Architectures, and Protocols for Computer Communication (SIGCOMM'99). Vol. 29. ACM Press, New York, NY, 251--262. Google Scholar
Digital Library
- Goldberg, D., Nichols, D., Oki, B. M., and Terry, D. 1992. Using collaborative filtering to weave an information tapestry. Comm. ACM 35, 12, 61--70. Google Scholar
Digital Library
- Gutenberg, B. and Richter, R. F. 1944. Frequency of earthquakes in california. Bul. Seismological Soc. Amer. 34, 185--188.Google Scholar
- Han, P., Xie, B., Yang, F., and Shen, R. 2004. A scalable p2p recommender system based on distributed collaborative filtering. Expert Syst. Appl. 27, 2, 203--210.Google Scholar
Cross Ref
- Hartwell, L. H., Hopfield, J. J., Leibler, S., and Murray, A. W. 1999. From molecular to modular cell biology. Nature 402, 6761 Suppl.Google Scholar
- Herlocker, J. L., Konstan, J. A., and Riedl, J. 2000. Explaining collaborative filtering recommendations. In Proceedings of the ACM Conference on Computer Supported Cooperative Work (CSCW'00). ACM Press, 241--250. Google Scholar
Digital Library
- Howe, A. E. and Dreilinger, D. 1997. SAVVYSEARCH: A metasearch engine that learns which search engines to query. AI Mag. 18, 2, 19--25.Google Scholar
- Iamnitchi, A., Ripeanu, M., and Foster, I. 2004. Small-world file-sharing communities. In The 23rd Conference of the IEEE Communications Society (INFOCOM'04). Vol. 2. 952--963.Google Scholar
- Jeong, H., Tombor, B., Albert, R., Oltvai, Z. N., and Barabási, A. L. 2000. The large-scale organization of metabolic networks. Nature 407, 6804, 651--654.Google Scholar
- Karinthy, F. 1929. Chains. Everything is Different. Atheneum Press.Google Scholar
- Kleinberg, J. M. 2000. The small-world phenomenon: An algorithmic perspective. In Proceedings of the 32nd ACM Symposium on Theory of Computing. Google Scholar
Digital Library
- Kleinfeld, J. S. 2001. Could it be a big world after all? the “six degrees of separation” myth. Society.Google Scholar
- Klingberg, T. and Manfredi, R. 2002. Gnutella protocol development. http://rfc-gnutella.sourceforge.net/src/rfc-0_6-draft.html (last access:1/13/09).Google Scholar
- Kohli, R. and Sah, R. 2003. Market shares: Some power law results and observations. Working paper 04.01, School of Public Policy, University of Chicago.Google Scholar
- Kohn, K. W. 1999. Molecular interaction map of the mammalian cell cycle control and DNA repair systems. Mol. Biol. Cell 10, 8, 2703--2734.Google Scholar
Cross Ref
- Konstan, J. A., Miller, B. N., Maltz, D., Herlocker, J. L., Gordon, L. R., and Riedl, J. 1997. Grouplens: applying collaborative filtering to usenet news. Comm. ACM 40, 3, 77--87. Google Scholar
Digital Library
- Krulwich, B. 1997. Lifestyle finder: Intelligent user profiling using large-scale demographic data. AI Maga. 18, 2, 37--45.Google Scholar
- Lang, K. 1995. NewsWeeder: learning to filter netnews. In Proceedings of the 12th International Conference on Machine Learning. Morgan Kaufmann Publishers Inc. 331--339.Google Scholar
Cross Ref
- Leibowitz, N., Ripeanu, M., and Wierzbicki, A. 2003. Deconstructing the kazaa network. In Proceedings of the 3rd IEEE Workshop on Internet Applications. IEEE Press. Google Scholar
Digital Library
- Lotka, A. J. 1926. The frequency distribution of scientific production. J. Wash. Acad. Sci. 16, 317--323.Google Scholar
- Lu, E. T. and Hamilton, R. J. 1991. Avalanches and the distribution of solar flares. Astrophysical J. 380, L89--L92.Google Scholar
Cross Ref
- Milgram, S. 1967. The small world problem. Psych. Today 2, 60--67.Google Scholar
- Miller, B. N., Konstan, J. A., and Riedl, J. 2004. Pocketlens: Toward a personal recommender system. ACM Trans. Inform. Syst. 22, 3, 437--476. Google Scholar
Digital Library
- Monasson, R. 1999. Diffusion, localization and dispersion relations on “small-world” lattices. European Physical J. B 12, 4, 555--567.Google Scholar
Cross Ref
- Montaner, M., López, B., and De La Rosa, J. L. 2003. A taxonomy of recommender agents on theinternet. AI. Rev. 19, 4, 285--330. Google Scholar
Digital Library
- Neukum, G. and Ivanov, B. A. 1994. Crater size distributions and impact probabilities on earth from lunar, terrestrial-planet, and asteroid cratering Data. In Hazards Due to Comets and Asteroids, T. Gehrels, M. S. Matthews, and A. M. Schumann, Eds. The University of Arizona Press, 359--416.Google Scholar
- Newman, M. E. 2001. The structure of scientific collaboration networks. Proc. Nat. Acad. Sci. 98, 2, 404--409.Google Scholar
Cross Ref
- Newman, M. E. J. 2003. The structure and function of complex networks. SIAM Rev. 45, 167.Google Scholar
Digital Library
- Newman, M. E. J. 2005. Power laws, pareto distributions and zipf's law. Contemp. Physics 46, 323.Google Scholar
Cross Ref
- Newman, M. E. J. and Watts, D. J. 1999. Renormalization group analysis of the small-world network model. Physics Lett. A 263, 341--346.Google Scholar
Cross Ref
- Oka, T., Morikawa, H., and Aoayama, T. 2004. Vineyard: A collaborative filtering service platform in distributed environment. In Proceedings of the IEEE/IPSJ Symposium on Applications and the Internet Workshops. Google Scholar
Digital Library
- Pagallo, U. 2006. Teoria giundica della Complessità. Dalla “Polis primitiva” di Socrate ai “mondi piccoli” dellinformatica—Un approccio evolutivo. Giapichelli, Torino, Italy.Google Scholar
- Pazzani, M. J. 1999. A framework for collaborative, content-based and demographic filtering. AI. Rev. 13, 5-6, 393--408. Google Scholar
Digital Library
- Pazzani, M. J., Muramatsu, J., and Billsus, D. 1996. Syskill webert: Identifying interesting web sites. In Proceedings of AAAI/IAAI, Vol. 1. 54--61.Google Scholar
- Phex Team. 2003. Phex file-sharing gnutella client. http://www.phex.org/mambo/(last access: 1/13/09).Google Scholar
- Pinkerton, B. 2000. Webcrawler: Finding what people want. Ph.D. thesis, University of Washington. Google Scholar
Digital Library
- Popescul, A., Ungar, L. H., Pennock, D. M., and Lawrence, S. 2001. Probabilistic models for unified collaborative and content-based recommendation in sparse-data environments. In Proceedings of the 17th Conference in Uncertainty in Artificial Intelligence (UAI'01). Morgan Kaufmann Publishers Inc., San Francisco, CA, 437--444. Google Scholar
Digital Library
- Redner, S. 1998. How popular is your paper? An empirical study of the citation distribution. European Physical J. B 4, 131.Google Scholar
Cross Ref
- Rekhter, Y., Moskowitz, B., Karrenberg, D., de Groot, G. J., and Lear, E. 1996. Address allocation for private internets. RFC 1918, Internet Engineering Task Force. Google Scholar
Digital Library
- Resnick, P. and Varian, H. R. 1997. Recommender systems—introduction to the special section. Comm. ACM 40, 3, 56--58. Google Scholar
Digital Library
- Rich, E. 1979. User modeling via stereotypes. Cognitive Sci. 3, 329--354.Google Scholar
Cross Ref
- Roberts, D. C. and Turcotte, D. L. 1998. Fractality and selforganized criticality of wars. Fractals 6, 351--357.Google Scholar
Cross Ref
- Ruffo, G. and Schifanella, R. 2007. Evaluating peer-to-peer recommender systems that exploit spontaneous affinities. In Proceedings of the ACM Symposium on Applied Computing (SAC'07). ACM, New York, NY 1574--1578. Google Scholar
Digital Library
- Ruffo, G., Schifanella, R., and Ghiringhello, E. 2006. A decentralized recommendation system based on self-organizing partnerships. Lecture Notes in Computer Science, vol. 3976. Springer, 618--629.Google Scholar
- Sarwar, B. M., Konstan, J. A., Borchers, A., Herlocker, J., Miller, B., and Riedl, J. 1998. Using filtering agents to improve prediction quality in the grouplens research collaborative filtering system. In Proceedings of the ACM Conference on Computer Supported Cooperative Work (CSCW'98). ACM Press, New York, NY, 345--354. Google Scholar
Digital Library
- Schifanella, R., Panisson, A., Gena, C., and Ruffo, G. 2008. Mobhinter: epidemic collaborative filtering and self-organization in mobile ad-hoc networks. In Proceedings of the ACM Conference on Recommender Systems (RecSys'08). ACM, New York, NY, 27--34. Google Scholar
Digital Library
- Seglen, P. O. 1992. The skewness of science. J. Amer. Soc. Inform. Sci. 43, 9, 628--638.Google Scholar
Cross Ref
- Shardanand, U. and Maes, P. 1995. Social information filtering: Algorithms for automating “word of mouth”. In Proceedings of the ACM Conference on Human Factors in Computing Systems (CHI'95). ACM, 210--217. Google Scholar
Digital Library
- Sripanidkulchai, K., Maggs, B., and Zhang, H. 2003. Efficient content location using interest-based locality in peer-topeer systems. In Proceedings of the InfoCom.Google Scholar
- Stutzbach, D., Rejaie, R., and Sen, S. 2005. Characterizing unstructured overlay topologies in modern p2p file-sharing systems. In Proceedings of the ACM SIGCOMM Internet Measurement Conference. Google Scholar
Digital Library
- Terveen, L. and Hill, W. 2001. Beyond recommender systems: Helping people help each other. In HCI in the New Millennium. Addison-Wesley, 487--509.Google Scholar
- Tveit, A. 2001. Peer-to-peer based recommendations for mobile commerce. In Proceedings of the 1st International Workshop on Mobile Commerce (WMC'01). ACM Press, New York, NY, 26--29. Google Scholar
Digital Library
- von Goethe, J. W. 1809. Die Wahlverwandtschaften. http://en.wikipedia.org/wiki/Elective_Affinities.Google Scholar
- Wang, J., Reinders, M. J. T., Lagendijk, R. L., and Pouwelse, J. 2005. Self-organizing distributed collaborative filtering. In Proceedings of the 28th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR'05). ACM Press, New York, NY, 659--660. Google Scholar
Digital Library
- Wasserman, S. and Faust, K. 1994. Social Network Analysis. Cambridge University Press, Cambridge, U.K.Google Scholar
- Watts, D. J. 1999. Small Worlds: The Dynamics of Networks Between Order and Randomness. Princeton University Press, Princeton, NJ. Google Scholar
Digital Library
- Watts, D. J. and Strogatz, S. H. 1998. Collective dynamics of ‘small-world’ networks. Nature 393, 6684, 440--442.Google Scholar
- Wei, Y. Z., Moreau, L., and Jennings, N. R. 2005. A market-based approach to recommender systems. ACM Trans. Inform. Syst. 23, 3, 227--266. Google Scholar
Digital Library
- Williams, R. J. and Martinez, N. D. 2000. Simple rules yield complex food webs. Nature 404, 6774, 180--183.Google Scholar
- Willis, J. C. and Yule, G. U. 1922. Some statistics of evolution and geographical distribution in plants and animals, and their significance. Nature 109, 177--179.Google Scholar
Cross Ref
- Xie, B., Han, P., and Shen, R. 2004. Pipecf: a scalable dht-based collaborative filtering recommendation system. In Proceedings of the 13th International World Wide Web Conference on Alternate Track Papers and Posters (WWW Alt.'04). ACM Press, New York, NY, 224--225. Google Scholar
Digital Library
- Yan, T. and Garcia-Molina, H. 1995. SIFT—A tool for wide-area information dissemination. In Proceedings of the USENIX Technical Conference. 177--186. Google Scholar
Digital Library
- Zanette, D. H. and Manrubia, S. C. 2001. Vertical transmission of culture and the distribution of family names. Physica A: Statist. Mechanics Appl. 295, 1-2, 1--8.Google Scholar
Cross Ref
Index Terms
A peer-to-peer recommender system based on spontaneous affinities
Recommendations
Evaluating peer-to-peer recommender systems that exploit spontaneous affinities
SAC '07: Proceedings of the 2007 ACM symposium on Applied computingThe validation of a recommender system is always a quite hazardous task, because of the difficulty of modeling the tastes of a given user. Novel (decentralized) recommender systems are proposed and evaluated by way of well known logs of user profiles ...
Peer Interest-based Discovery for Decentralized Peer-to-Peer Systems
3PGCIC '10: Proceedings of the 2010 International Conference on P2P, Parallel, Grid, Cloud and Internet ComputingThe success of content distribution oriented peer-to-peer systems heavily depends on the resource discovery mechanism. In case of large-scale distributed systems, this mechanism must be scalable and robust. The paper proposes a structured solution for ...
Indexing through Querying in Unstructured Peer-to-Peer Overlay Networks
APNOMS '08: Proceedings of the 11th Asia-Pacific Symposium on Network Operations and Management: Challenges for Next Generation Network Operations and Service ManagementThe efficiency of a Peer-to-Peer file sharing overlay is measured in terms of the scalability and versatility of its object lookup strategy. In these networks peers carry out distributed query relaying to discover the service providers. Existing lookup ...






Comments