ABSTRACT
We examine the effect of modeling a researcher's past works in recommending scholarly papers to the researcher. Our hypothesis is that an author's published works constitute a clean signal of the latent interests of a researcher. A key part of our model is to enhance the profile derived directly from past works with information coming from the past works' referenced papers as well as papers that cite the work. In our experiments, we differentiate between junior researchers that have only published one paper and senior researchers that have multiple publications. We show that filtering these sources of information is advantageous -- when we additionally prune noisy citations, referenced papers and publication history, we achieve statistically significant higher levels of recommendation accuracy.
References
- M. Balabanovic and Y. Shoham. Fab: Content-Based, Collaborative Recommendation. Communications of the ACM, 40(3):66--72, 1997. Google Scholar
Digital Library
- C. Basu, H. Hirsh, and W. Cohen. Recommendation as Classification: Using Social and Content-Based Information in Recommendation. In Proc. of the 15th National Conference on Artificial Intelligence (AAAI '98), pages 714--720, 1998. Google Scholar
Digital Library
- S. Bird, R. Dale, B. J. Dorr, B. Gibson, M. T. Joseph, M.-Y. Kan, D. Lee, B. Powley, D. R. Radev, and Y. F. Tan. The ACL Anthology Reference Corpus: A Reference Dataset for Bibliographic Research in Computational Linguistics. In Proc. of the 6th International Conference on Language Resources and Evaluation Conference (LREC'08), pages 1755--1759, 2008.Google Scholar
- J. Bollen, M. A. Rodriguez, and H. V. D. Sompel. Journal Status. Scientometrics, 69(3):669--687, 2006.Google Scholar
Cross Ref
- J. S. Breese, D. Heckerman, and C. Kadie. Empirical Analysis of Predictive Algorithms for Collaborative Filtering. In Proc. of the 14th Conference on Uncertanity in Artificial Intelligence (UAI '98), pages 43--52, 1998. Google Scholar
Digital Library
- C. Castillo, D. Donato, A. Gionis, V. Murdock, and F. Silvestri. Know Your Neighbors: Web Spam Detection Using Web Topology. In Proc. of the 30th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR 2007), pages 423--430, 2007. Google Scholar
Digital Library
- P. Chen, H. Xie, S. Maslov, and S. Redner. Finding Scientific Gems with Google's PageRank Algorithm. Journal of Informetrics, 1(1):8--15, 2007.Google Scholar
Cross Ref
- W. Chu and S.-T. Park. Personalized Recommendation on Dynamic Content Using Predictive Bilinear Models. In Proc. of the 18th International World Wide Web Conference (WWW2009), 2009. 691--700. Google Scholar
Digital Library
- E. Garfield. Citation Indexing: Its Theory and Application in Science, Technology, and Humanities. New York: John Wiley and Sons, 1979.Google Scholar
- D. Goldberg, D. Nichols, B. M. Oki, and D. B. Terry. Using Collaborative Filtering to Weave an Information Tapestry. Communications of the ACM, 35(12):61--70, 1992. Google Scholar
Digital Library
- M. Gori and A. Pucci. Research Paper Recommender Systems: A Random-Walk Based Approach. In Proc. of the 2006 IEEE/WIC/ACM International Conference on Web Intelligence (WI 2006), pages 778--781, 2006. Google Scholar
Digital Library
- K. Jarvelin and J. Kekalainen. IR Evaluation Methods for Retrieving Highly Relevant Documents. In Proc. of the 22nd Annual International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR 2000), pages 41--48, 2000. Google Scholar
Digital Library
- H. Kautz, B. Selman, and M. Shah. Referral Web: Combining Social Networks and Collaborative Filtering. Communications of the ACM, 40(3):63--65, 1997. Google Scholar
Digital Library
- H.-N. Kim, I. Ha, S.-H. Lee, and G.-S. Jo. A Collaborative Approach to User Modeling for Personalized Content Recommendations. In Proc. of the 11th International Conference on Asian Digital Libraries (ICADL2008), Lecture Notes in Computer Science (LNCS), Vol. 5362, pages 215--224. Springer-Verlag, 2008. Google Scholar
Digital Library
- J. A. Konstan, B. N. Miller, D. Maltz, J. L. Herlocker, L. R. Gordon, and J. Riedl. GroupLens: Applying Collaborative Filtering to Usenet News. Communications of the ACM, 40(3):77--87, 1997. Google Scholar
Digital Library
- M. Krapivin and M. Marchese. Focused PageRank in Scientific Papers Ranking. In Proc. of the 11th International Conference on Asian Digital Libraries (ICADL 2008), Lecture Notes in Computer Science (LNCS), Vol. 5362, pages 144--153, 2008. Google Scholar
Digital Library
- S. M. McNee, I. Albert, D. Cosley, S. L. P. Gopalkrishnan, A. M. Rashid, J. S. Konstan, and J. Riedl. Predicting User Interests from Contextual Information. In Proc. of the 2002 ACM Conference on Computer Supported Cooperative Work (CSCW '02), pages 116--125, 2002. Google Scholar
Digital Library
- P. Melville, R. J. Mooney, and R. Nagarajan. Content-Boosted Collaborative Filtering for Improved Recommendations. In Proc. of the 18th National Conference on Artificial Intelligence (AAAI2002), pages 187--192, 2002. Google Scholar
Digital Library
- F. Narin. Evaluative Bibliometrics: The Use of Publication and Citation Analysis in the Evaluation of Scientific Activity. Cherry Hill, N.J.: Computer Horizons, 1976.Google Scholar
- L. Page, S. Brin, R. Motwani, and T. Winograd. The PageRank Citation Ranking: Bringing Order to the Web. Technical Report SIDL-WP-1999-0120, Stanford Digital Library Technologies Project, 1998.Google Scholar
- S.-T. Park, D. Pennock, O. Madani, N. Good, and D. DeCoste. Naïve Filterbots for Robust Cold-Start Recommendations. In Proc. of the 12th ACM SIGKDD International Conference on Knowledge Discovery and Data mining (KDD'06), pages 699--705, 2006. Google Scholar
Digital Library
- M. F. Porter. An Algorithm for Suffix Stripping. Program, 14(3):pages 130--137, 1980.Google Scholar
- X. Qi and B. D. Davison. Classifiers Without Borders: Incorporating Fielded Text From Neighboring Web Pages. In Proc. of the 31st Annual International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR 2008), pages 643--650, 2008. Google Scholar
Digital Library
- P. Resnick, N. Iacovou, M. Suchak, and J. R. P. Bergstorm. GroupLens: An Open Architecture for Collaborative Filtering of Netnews. In Proc. of the ACM 1994 Conference on Computer Supported Cooperative Work (CSCW '94), pages 175--186, 1994. Google Scholar
Digital Library
- G. Salton and M. J. McGill. Introduction to Modern Information Retrieval. McGraw-Hill, 1983. Google Scholar
Digital Library
- B. M. Sarwar, G. Karypis, and J. A. Konstan. Analysis of Recommendation Algorithms for E-commerce. In Proc. of the 2nd ACM Conference on Electronic Commerce (EC '00), pages 158--167, 2000. Google Scholar
Digital Library
- H. Sayyadi and L. Getoor. FutureRank: Ranking Scientific Articles by Predicting their Future PageRank. In Proc. of the 9th SIAM International Conference on Data Mining, pages 533--544, 2009.Google Scholar
Cross Ref
- C. Shahabi and Y.-S. Chen. An Adaptive Recommendation System without Explicit Acquisition of User Relevance Feedback. Distributed and Parallel Databases, 14(3):173--192, 2003. Google Scholar
Digital Library
- X. Shen, B. Tan, and C. Zhai. Context-Sensitive Information Retrieval Using Implicit Feedback. In Proc. of the 28th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR 2005), pages 43--50, 2005. Google Scholar
Digital Library
- M. D. Smucker, J. Allan, and B. Carterette. A Comparison of Statistical Significance Tests for Information Retrieval. In Proc. of the 16th International Conference on Information and Knowledge Management (CIKM'07), 2007. 623-632. Google Scholar
Digital Library
- K. Sugiyama, K. Hatano, and M. Yoshikawa. Adaptive Web Search Based on User Profile Constructed without Any Effort from Users. In Proc. of the 13th International World Wide Web Conference (WWW2004), pages 675--684, 2004. Google Scholar
Digital Library
- K. Sugiyama, K. Hatano, M. Yoshikawa, and S. Uemura. Refinement of TF-IDF Schemes for Web Pages Using their Hyperlinked Neighboring Pages. In Proc. of the 14th ACM Conference on Hypertext and Hypermedia (HT '03), pages 198--207, 2003. Google Scholar
Digital Library
- Y. Sun and C. Giles. Popularity Weighted Ranking for Academic Digital Libraries. In Proc. of the 29th European Conference on Information Retrieval (ECIR 2007), pages 605--612, 2007. Google Scholar
Digital Library
- B. Tan, X. Shen, and C. Zhai. Mining Long-Term Search History to Improve Search History. In Proc. of the 12th ACM SIGKDD International Conference on Knowledge Discovery and Data mining (KDD'06), pages 718--723, 2006. Google Scholar
Digital Library
- J. Teevan, S. T. Dumais, and E. Horvitz. Personalizing Search via Automated Analysis of Interests and Activities. In Proc. of the 28th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR 2005), pages 449--456, 2005. Google Scholar
Digital Library
- R. Torres, S. M. McNee, M. Abel, J. A. Konstan, and J. Riedl. Enhancing Digital Libraries with TechLens. In Proc. of the 4th ACM/IEEE Joint Conference on Digital Libraries (JCDL 2004), pages 228--236, 2004. Google Scholar
Digital Library
- E. M. Voorhees. The TREC-8 Question Answering Track Report. In Proc. of the 8th Text REtrieval Conference (TREC-8), pages 77--82, 1999.Google Scholar
- R. W. White, P. Bailey, and L. Chen. Predicting User Interests from Contextual Information. In Proc. of the 32nd Annual International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR 2009), pages 363--370, 2009. Google Scholar
Digital Library
- X. Su and T. M. Khoshgoftaar and R. Greiner. Imputed Neighborhood Based Collaborative Filtering. In Proc. of the 2008 IEEE/WIC/ACM International Conference on Web Intelligence (WI 2008), pages 633--639, 2008. Google Scholar
Digital Library
- D. Yang, B. Wei, J. Wu, Y. Zhang, and L. Zhang. CARES: A Ranking-Oriented CADAL Recommender System. In Proc. of the 9th ACM/IEEE Joint Conference on Digital Libraries (JCDL 2009), pages 203--211, 2009. Google Scholar
Digital Library
Index Terms
Scholarly paper recommendation via user's recent research interests





Comments