ABSTRACT
When we write or prepare to write a research paper, we always have appropriate references in mind. However, there are most likely references we have missed and should have been read and cited. As such a good citation recommendation system would not only improve our paper but, overall, the efficiency and quality of literature search.
Usually, a citation's context contains explicit words explaining the citation. Using this, we propose a method that "translates" research papers into references. By considering the citations and their contexts from existing papers as parallel data written in two different "languages", we adopt the translation model to create a relationship between these two "vocabularies".
Experiments on both CiteSeer and CiteULike dataset show that our approach outperforms other baseline methods and increase the precision, recall and f-measure by at least 5% to 10%, respectively. In addition, our approach runs much faster in the both training and recommending stage, which proves the effectiveness and the scalability of our work.
References
- A. Berger and J. Lafferty. Information retrieval as statistical translation. In Proc. of SIGIR'99, pages 222--229. ACM, 1999. Google Scholar
Digital Library
- D. M. Blei, A. Y. Ng, and M. I. Jordan. Latent dirichlet allocation. Journal of Machine Learning Research, pages 993--1022, 2003. Google Scholar
Digital Library
- P. F. Brown, V. J. D. Pietra, S. A. D. Pietra, and R. L. Mercer. The mathematics of statistical machine translation: parameter estimation. Comput. Linguist., 19:263--311. Google Scholar
Digital Library
- C. Buckley and E. Voorhees. Retrieval evaluation with incomplete information. In Proc. of SIGIR'04, pages 25--32, 2004. Google Scholar
Digital Library
- A. P. Dempster, N. M. Laird, and D. B. Rubin. Maximum likelihood from incomplete data via the em algorithm. Journal of the Royal Statistical Society: Series B, pages 1--38, 1977.Google Scholar
Cross Ref
- E. Erosheva, S. Fienberg, and J. Lafferty. Mixed membership models of scientific publications. In Proc. of the National Academy of Sciences, 2004.Google Scholar
Cross Ref
- Q. He, J. Pei, D. Kifer, P. Mitra, and C. L. Giles. Context-aware citation recommendation. In Proc. of WWW'10, pages 421--430. ACM, 2010. Google Scholar
Digital Library
- T. Hofmann. Probabilistic latent semantic indexing. In Proc. of SIGIR'99, pages 50--57. ACM, 1999. Google Scholar
Digital Library
- S. Kataria, P. Mitra, and S. Bhatia. Utilizing context in generative bayesian models for linked corpus. In Proc. of AAAI'10, 2010.Google Scholar
- S. Kataria, P. Mitra, C. Caragea, and C. L. Giles. Context sensitive topic models for author influence in document networks. In Proc. of IJCAI'11, pages 2274--2280, 2011. Google Scholar
Digital Library
- Z. Liu, X. Chen, and M. Sun. A simple word trigger method for social tag suggestion. In Proc. of EMNLP'11. ACL, 2011. Google Scholar
Digital Library
- Y. Lu, J. He, D. Shan, and H. Yan. Recommending citations with translation model. In Proc. of CIKM'11, pages 2017--2020. ACM, 2011. Google Scholar
Digital Library
- S. M. McNee, I. Albert, D. Cosley, P. Gopalkrishnan, S. K. Lam, A. M. Rashid, J. A. Konstan, and J. Riedl. On the recommending of citations for research papers. In Proc. of CSCW'02, pages 116--125. ACM, 2002. Google Scholar
Digital Library
- V. Murdock. Simple translation models for sentence retrieval in factoid question answering. In Proc. of SIGIR'04, pages 31--35, 2004.Google Scholar
- V. Murdock and W. B. Croft. A translation model for sentence retrieval. In Proc. of HLT/EMNLP, HLT '05, pages 684--691. ACL, 2005. Google Scholar
Digital Library
- R. M. Nallapati, A. Ahmed, E. P. Xing, and W. W. Cohen. Joint latent topic models for text and citations. In Proc. of SIGKDD'08, pages 542--550. ACM, 2008. Google Scholar
Digital Library
- F. J. Och and H. Ney. Improved statistical alignment models. In Proc. of ACL'00, 2000. Google Scholar
Digital Library
- A. Ritchie, S. Robertson, and S. Teufel. Comparing citation contexts for information retrieval. In Proc. of CIKM'08, pages 213--222. ACM, 2008. Google Scholar
Digital Library
- A. Ritchie, S. Teufel, and S. Robertson. Using terms from citations for ir: some first results. In Proc. of ECIR'08, pages 211--221. Springer-Verlag, 2008. Google Scholar
Digital Library
- T. Strohman, W. B. Croft, and D. Jensen. Recommending citations for academic papers. In Proc. of SIGIR'07, pages 705--706. ACM, 2007. Google Scholar
Digital Library
- J. Tang and J. Zhang. A discriminative approach to topic-based citation recommendation. In Proc. of PAKDD'09, pages 572--579. Springer-Verlag, 2009. Google Scholar
Digital Library
- E. Voorhees. The trec-8 question answering track report. In Proc. of TREC'00, pages 77--82, 2000.Google Scholar
Index Terms
Recommending citations: translating papers into references





Comments