Abstract
The technique of collaborative filtering is especially successful in generating personalized recommendations. More than a decade of research has resulted in numerous algorithms, although no comparison of the different strategies has been made. In fact, a universally accepted way of evaluating a collaborative filtering algorithm does not exist yet. In this work, we compare different techniques found in the literature, and we study the characteristics of each one, highlighting their principal strengths and weaknesses. Several experiments have been performed, using the most popular metrics and algorithms. Moreover, two new metrics designed to measure the precision on good items have been proposed.
The results have revealed the weaknesses of many algorithms in extracting information from user profiles especially under sparsity conditions. We have also confirmed the good results of SVD-based techniques already reported by other authors. As an alternative, we present a new approach based on the interpretation of the tendencies or differences between users and items. Despite its extraordinary simplicity, in our experiments, it obtained noticeably better results than more complex algorithms. In fact, in the cases analyzed, its results are at least equivalent to those of the best approaches studied. Under sparsity conditions, there is more than a 20% improvement in accuracy over the traditional user-based algorithms, while maintaining over 90% coverage. Moreover, it is much more efficient computationally than any other algorithm, making it especially adequate for large amounts of data.
- Aggarwal, C. C., Wolf, J. L., Wu, K.-L., and Yu, P. S. 1999. Horting hatches an egg: a new graph-theoretic approach to collaborative filtering. In Proceedings of the 5th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD'99). ACM, New York, NY, 201--212. Google Scholar
Digital Library
- Balabanović, M. and Shoham, Y. 1997. Fab: content-based, collaborative recommendation. Comm. ACM 40, 3, 66--72. Google Scholar
Digital Library
- Basilico, J. and Hofmann, T. 2004. Unifying collaborative and content-based filtering. In Proceedings of the 21st International Conference on Machine Learning (ICML'04). ACM, New York, NY, 9. Google Scholar
Digital Library
- Bennett, J. and Lanning, S. 2007. The netflix prixe. In Proceedings of KDD Cup and Workshop (KDDCup'07). ACM, 4.Google Scholar
- Billsus, D. and Pazzani, M. J. 1998. Learning collaborative information filters. In Proceedings of the 15th International Conference on Machine Learning. Morgan Kaufmann, San Francisco, CA, 46--54. Google Scholar
Digital Library
- Breese, J. S., Heckerman, D., and Kadie, C. 1998. Empirical analysis of predictive algorithms for collaborative filtering. In Proceedings of the 14th Annual Conference on Uncertainty in Artificial Intelligence. 43--52. Google Scholar
Digital Library
- Calderón-Benavides, M. L., González-Caro, C. N., de J. Pérez-Alcázar, J., García-Díaz, J. C., and Delgado, J. 2004. A comparison of several predictive algorithms for collaborative filtering on multi-valued ratings. In Proceedings of the ACM Symposium on Applied Computing (SAC'04). ACM, New York, NY, 1033--1039. Google Scholar
Digital Library
- Canny, J. 2002. Collaborative filtering with privacy via factor analysis. In Proceedings of the 25th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR'02). ACM, New York, NY, 238--245. Google Scholar
Digital Library
- Chirita, P.-A., Nejdl, W., and Zamfir, C. 2005. Preventing shilling attacks in online recommender systems. In Proceedings of the 7th Annual ACM International Workshop on Web Information and Data Management (WIDM'05). ACM, New York, NY, 67--74. Google Scholar
Digital Library
- Ellenberg, J. 2008. This psychologist might outsmart the math brains competing for the Netflix prize. Wired Maga. 16, 3.Google Scholar
- Fisher, D., Hildrum, K., Hong, J., Newman, M., Thomas, M., and Vuduc, R. 2000. Swami (poster session): A framework for collaborative filtering algorithm development and evaluation. In Proceedings of the 23rd Annual International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR'00). ACM, New York, NY, 366--368. Google Scholar
Digital Library
- Foltz, P. W. and Dumais, S. T. 1992. Personalized information delivery: an analysis of information filtering methods. Comm. ACM 35, 12, 51--60. Google Scholar
Digital Library
- Funk, S. 2006. Netflix update: Try this at home. http://sifter.org/simon/journal/20061211.html.Google Scholar
- George, T. and Merugu, S. 2005. A scalable collaborative filtering framework based on co-clustering. In Proceedings of the 5th IEEE International Conference on Data Mining (ICDM'05). IEEE Computer Society, Los Alamitos, CA, 625--628. Google Scholar
Digital Library
- Goldberg, D., Nichols, D., Oki, B. M., and Terry, D. 1992. Using collaborative filtering to weave an information tapestry. Comm. ACM 35, 12, 61--70. Google Scholar
Digital Library
- Goldberg, K., Roeder, T., Gupta, D., and Perkins, C. 2001. Eigentaste: A constant time collaborative filtering algorithm. Inform. Retr. 4, 2, 133--151. Google Scholar
Digital Library
- Herlocker, J., Konstan, J. A., and Riedl, J. 2002. An empirical analysis of design choices in neighborhood-based collaborative filtering algorithms. Inform. Retr. 5, 4, 287--310. Google Scholar
Digital Library
- Herlocker, J. L., Konstan, J. A., Borchers, A., and Riedl, J. 1999. An algorithmic framework for performing collaborative filtering. In Proceedings of the 22nd Annual International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR'99). ACM, New York, NY, 230--237. Google Scholar
Digital Library
- Herlocker, J. L., Konstan, J. A., Terveen, L. G., and Riedl, J. T. 2004. Evaluating collaborative filtering recommender systems. ACM Trans. Inform. Syst. 22, 1, 5--53. Google Scholar
Digital Library
- Hofmann, T. 2004. Latent semantic models for collaborative filtering. ACM Trans. Inform. Syst. 22, 1, 89--115. Google Scholar
Digital Library
- Huang, Z., Chen, H., and Zeng, D. 2004. Applying associative retrieval techniques to alleviate the sparsity problem in collaborative filtering. ACM Trans. Inform. Syst. 22, 1, 116--142. Google Scholar
Digital Library
- Huang, Z., Zeng, D., and Chen, H. 2007. A comparison of collaborative-filtering recommendation algorithms for e-commerce. IEEE Intell. Syst. 22, 5, 68--78. Google Scholar
Digital Library
- Jin, R., Si, L., and Zhai, C. 2003. Preference-based graphic models for collaborative filtering. In Proceedings of the 19th Conference in Uncertainty in Artificial Intelligence. 329--336. Google Scholar
Digital Library
- Karypis, G. 2001. Evaluation of item-based top-n recommendation algorithms. In Proceedings of the 10th International Conference on Information and Knowledge Management (CIKM'01). ACM, New York, NY, 247--254. Google Scholar
Digital Library
- Kohrs, A. and Mérialdo, B. 1999. Clustering for collaborative filtering applications. In Proceedings of the International Conference on Computational Intelligence for Modeling, Control and Automation (CIMCA'99).Google Scholar
- Koren, Y. 2008. Factorization meets the neighborhood: a multifaceted collaborative filtering model. In Proceeding of the 14th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD'08). ACM, New York, NY, 426--434. Google Scholar
Digital Library
- Lam, S. K. and Riedl, J. 2004. Shilling recommender systems for fun and profit. In Proceedings of the 13th International Conference on World Wide Web (WWW'04). ACM, New York, NY, 393--402. Google Scholar
Digital Library
- Lemire, D. and Maclachlan, A. 2005. Slope one predictors for online rating-based collaborative filtering. In Proceedings of SIAM Data Mining (SDM'05).Google Scholar
- Marlin, B. 2004. Collaborative filtering: A machine learning perspective. M.S. thesis, University of Toronto.Google Scholar
- Marlin, B. M. and Zemel, R. S. 2009. Collaborative prediction and ranking with non-random missing data. In Proceedings of the 3rd ACM Conference on Recommender Systems (RecSys'09). ACM, New York, NY, 5--12. Google Scholar
Digital Library
- McNee, S. M., Riedl, J., and Konstan, J. A. 2006. Being accurate is not enough: how accuracy metrics have hurt recommender systems. In Extended Abstracts on Human Factors in Computing Systems (CHI'06). ACM, New York, NY, 1097--1101. Google Scholar
Digital Library
- Melville, P., Mooney, R., and Nagarajan, R. 2001. Content-boosted collaborative filtering. In Proceedings of the ACM SIGIR Workshop on Recommender Systems.Google Scholar
- Mobasher, B., Burke, R., Bhaumik, R., and Williams, C. 2007. Toward trustworthy recommender systems: An analysis of attack models and algorithm robustness. ACM Trans. Internet Technol. 7, 4, 23. Google Scholar
Digital Library
- Paterek, A. 2007. Improving regularized singular value decomposition for collaborative filtering. In Proceedings of the KDD Cup Workshop at the 13th ACM International Conference on Knowledge Discovery and Data Mining (SIGKDD'07). 39--42.Google Scholar
- Pennock, D., Horvitz, E., Lawrence, S., and Giles, C. L. 2000. Collaborative filtering by personality diagnosis: A hybrid memory- and model-based approach. In Proceedings of the 16th Conference on Uncertainty in Artificial Intelligence (UAI'2000). 473--480. Google Scholar
Digital Library
- Rennie, J. D. M. and Srebro, N. 2005. Fast maximum margin matrix factorization for collaborative prediction. In Proceedings of the 22nd International Conference on Machine Learning (ICML'05). ACM, New York, NY, 713--719. Google Scholar
Digital Library
- Resnick, P., Iacovou, N., Suchak, M., Bergstrom, P., and Riedl, J. 1994. Grouplens: an open architecture for collaborative filtering of netnews. In Proceedings of the ACM Conference on Computer Supported Cooperative Work (CSCW'94). ACM, New York, NY, 175--186. Google Scholar
Digital Library
- Sandvig, J. J., Mobasher, B., and Burke, R. 2007. Robustness of collaborative recommendation based on association rule mining. In Proceedings of the ACM Conference on Recommender Systems (RecSys'07). ACM, New York, NY, 105--112. Google Scholar
Digital Library
- Sarwar, B., Karypis, G., Konstan, J., and Reidl, J. 2001. Item-based collaborative filtering recommendation algorithms. In Proceedings of the 10th International Conference on World Wide Web (WWW'01). ACM, New York, NY, 285--295. Google Scholar
Digital Library
- Sarwar, B., Karypis, G., Konstan, J., and Riedl, J. 2000. Application of dimensionality reduction in recommender systems--a case study. In Proceedings of the ACM WebKDD Workshop.Google Scholar
- Schein, A. I., Popescul, A., Ungar, L. H., and Pennock, D. M. 2002. Methods and metrics for cold-start recommendations. In Proceedings of the 25th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR'02). ACM, New York, NY, 253--260. Google Scholar
Digital Library
- Shafiei, M. and Milios, E. 2006. Model-based Overlapping Co-Clustering. In Proceedings of the 4th Workshop on Text Mining at the 6th SIAM International Conference on Data Mining.Google Scholar
- Shardanand, U. 1994. Social information filtering for music recommendation. M.S. thesis, Massachussets Institute of Technology.Google Scholar
- Shardanand, U. and Maes, P. 1995. Social information filtering: algorithms for automating “word of mouth”. In Proceedings of the SIGCHI Conference on Human Factors in Computing Systems (CHI'95). ACM Press/Addison-Wesley Publishing Co., New York, NY, 210--217. Google Scholar
Digital Library
- Si, L. and Jin, R. 2003. A flexible mixture model for collaborative filtering. In Proceedings of the 20th International Conference on Machine Learning.Google Scholar
- Ungar, L. and Foster, D. 1998. Clustering methods for collaborative filtering. In Proceedings of the Workshop on Recommendation Systems. AAAI Press, Menlo Park, CA.Google Scholar
- Vucetic, S. and Obradovic, Z. 2000. A regression-based approach for scaling-up personalized recommender systems in e-commerce. In Proceedings of the ACM WebKDD Workshop.Google Scholar
- Wang, J., de Vries, A. P., and Reinders, M. J. T. 2006. Unifying user-based and item-based collaborative filtering approaches by similarity fusion. In Proceedings of the 29th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR'06). ACM, New York, NY, 501--508. Google Scholar
Digital Library
- Xue, G.-R., Lin, C., Yang, Q., Xi, W., Zeng, H.-J., Yu, Y., and Chen, Z. 2005. Scalable collaborative filtering using cluster-based smoothing. In Proceedings of the 28th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR'05). ACM, New York, NY, 114--121. Google Scholar
Digital Library
Index Terms
Comparison of collaborative filtering algorithms: Limitations of current techniques and proposals for scalable, high-performance recommender systems
Recommendations
Trust-based collaborative filtering: tackling the cold start problem using regular equivalence
RecSys '18: Proceedings of the 12th ACM Conference on Recommender SystemsUser-based Collaborative Filtering (CF) is one of the most popular approaches to create recommender systems. This approach is based on finding the most relevant k users from whose rating history we can extract items to recommend. CF, however, suffers ...
Performance Comparison of Combined Collaborative Filtering Algorithms for Recommender Systems
COMPSACW '12: Proceedings of the 2012 IEEE 36th Annual Computer Software and Applications Conference WorkshopsRecommender systems have a goal to make personalized recommendations by using filtering algorithms. Collaborative filtering (CF) is one of the most popular techniques for recommender systems. As usual, huge number of the datasets on the Internet ...
Merging trust in collaborative filtering to alleviate data sparsity and cold start
Providing high quality recommendations is important for e-commerce systems to assist users in making effective selection decisions from a plethora of choices. Collaborative filtering is a widely accepted technique to generate recommendations based on ...






Comments