ABSTRACT
When you write papers, how many times do you want to make some citations at a place but you are not sure which papers to cite? Do you wish to have a recommendation system which can recommend a small number of good candidates for every place that you want to make some citations? In this paper, we present our initiative of building a context-aware citation recommendation system. High quality citation recommendation is challenging: not only should the citations recommended be relevant to the paper under composition, but also should match the local contexts of the places citations are made. Moreover, it is far from trivial to model how the topic of the whole paper and the contexts of the citation places should affect the selection and ranking of citations. To tackle the problem, we develop a context-aware approach. The core idea is to design a novel non-parametric probabilistic model which can measure the context-based relevance between a citation context and a document. Our approach can recommend citations for a context effectively. Moreover, it can recommend a set of citations for a paper with high quality. We implement a prototype system in CiteSeerX. An extensive empirical evaluation in the CiteSeerX digital library against many baselines demonstrates the effectiveness and the scalability of our approach.
References
- R. Agrawal, T. Imielinski and A. Swami. Mining Association Rules Between Sets of Items in Large Databases. SIGMOD, 1993. Google Scholar
Digital Library
- S. Aya, C. Lagoze and T. Joachims. Citation Classification and its Applications. ICKM'05.Google Scholar
- K. Banaszek, G. D'Ariano, M. Paris and M. Sacchi. Maximum-likelihood estimation of the density matrix. Physical Review A, 1999.Google Scholar
- C. Basu, H. Hirsh, W. Cohen and C. Nevill-Manning. Technical Paper Recommendation: A Study in Combining Multiple Information Sources. J. of Artificial Intelligence Research, 2001. Google Scholar
Digital Library
- D. Blei, A. Ng and M. Jordan. Latent dirichlet allocation. J. Machine Learning Research 2003. Google Scholar
Digital Library
- K. Chandrasekaran, S. Gauch, P. Lakkaraju and H. Luong. Concept-Based Document Recommendations for CiteSeer Authors. Adaptive Hypermedia and Adaptive Web-Based Systems, Springer, 2008. Google Scholar
Digital Library
- D. Cohn and T. Hofmann. The missing link - a probabilistic model of document content and hypertext connectivity. NIPS'01.Google Scholar
- F. Diaz. Regularizing Ad Hoc Retrieval Scores. CIKM'05. Google Scholar
Digital Library
- E. Erosheva, S. Fienberg and J. Lafferty. Mixed membership models of scientific publications. PNAS 2004.Google Scholar
Cross Ref
- A. Gleason. Measures on the Closed Subspaces of a Hilbert Space. J. of Mathematics and Mechanics, 1957.Google Scholar
- S. Huang, G. Xue, B. Zhang, Z. Chen, Y. Yu and W. Ma. TSSP: A Reinforcement Algorithm to Find Related Papers. WI'04. Google Scholar
Digital Library
- A. Jeffrey and H. Dai. Handbook of Mathematical Formulas and Integrals. Academic Press, 2008.Google Scholar
- J. Kleinberg. Authoritative sources in a hyperlinked environment. J. of the ACM, 1999. Google Scholar
Digital Library
- J. Lafferty and G. Lebanon. Diffusion Kernels on Statistical Manifolds. J. of Machine Learning Research, 2005. Google Scholar
Digital Library
- D. Liben-Nowell and J. Kleinberg. The link prediction problem for social networks. CIKM'03. Google Scholar
Digital Library
- S. McNee, I. Albert, D. Cosley, P. Gopalkrishnan, S. Lam, A. Rashid, J. Konstan and J. Riedl. On the Recommending of Citations for Research Papers. CSCW'02. Google Scholar
Digital Library
- M. Melucci. A basis for information retrieval in context. TOIS, 2008. Google Scholar
Digital Library
- R. Nallapati, A. Ahmed, E. Xing and W. Cohen. Joint latent topic models for text and citations. SIGKDD'08. Google Scholar
Digital Library
- Z. Nie, Y. Zhang, J. Wen and W. Ma. Object-Level Ranking: Bringing Order to Web Objects. WWW'05. Google Scholar
Digital Library
- C. Rijsbergen. The Geometry of Information Retrieval. Cambridge University Press, 2004. Google Scholar
Digital Library
- A. Ritchie. Citation context analysis for information retrieval. PhD thesis, University of Cambridge, 2008.Google Scholar
- B. Shaparenko and T. Joachims. Identifying the Original Contribution of a Document via Language Modeling. ECML, 2009.Google Scholar
Digital Library
- T. Strohman, W. Croft and D. Jensen. Recommending Citations for Academic Papers. SIGIR'07 and Technical Report, http://ciir-publications.cs.umass.edu/getpdf.php?id=610. Google Scholar
Digital Library
- J. Tang and J. Zhang. A Discriminative Approach to Topic-Based Citation Recommendations PAKDD'09. Google Scholar
Digital Library
- R. Torres, S. McNee, M. Abel, J. Konstan and J. Riedl. Enhancing Digitial Libraries with Techlens. JCDL'04. Google Scholar
Digital Library
- F. Wang, B. Chen and Z. Miao. A Survey on Reviewer Assignment Problem. IEA/AIE'08. Google Scholar
Digital Library
- D. Zhou, S. Zhu, K. Yu, X. Song, B. Tseng, H. Zha and L. Giles. Learning Multiple Graphs for Document Recommendations. WWW'08. Google Scholar
Digital Library
Index Terms
Context-aware citation recommendation





Comments