10.1145/280765.280786acmconferencesArticle/Chapter ViewAbstractPublication PagesaamasConference Proceedingsconference-collections
Article
Free Access

CiteSeer: an autonomous Web agent for automatic retrieval and identification of interesting publications

Authors Info & Claims
Online:01 May 1998Publication History

ABSTRACT

Research papers available on the World Wide Web (WWW or Web) are often poorly organized, often exist in forms opaque to search engines (e.g. Postscript), and increase in quantity daily. Significant amounts of time and effort are typically needed in order to find interesting and relevant publications on the Web. We have developed a Web based information agent that assists the user in the process of performing a scientific literature search. Given a set of keywords, the agent uses Web search engines and heuristics to locate and download papers. The papers are parsed in order to extract information features such as the abstract and individually identified citations. The agents Web interface can be used to find relevant papers in the database using keyword searches, or by navigating the links between papers formed by the citations. Links to both citing and cited publications can be followed. In addition to simple browsing and keyword searches, the agent can find papers which are similar to a given paper using word information and by analyzing common citations made by the papers.

References

  1. 1.Institute for scientific information (http://www. isinet.com), 1997.Google ScholarGoogle Scholar
  2. 2.Keycite (http://www.westgroup.com/keycite/), 1997.Google ScholarGoogle Scholar
  3. 3.BALABANOVIC, M. An adaptive Web page recommendation service. In Proceedings of the First International Conference on Autonomous Agents (February 1997). Google ScholarGoogle ScholarDigital LibraryDigital Library
  4. 4.CAMERON, R. D, A universal citation database as a catalyst for reform in scholarly communication. First Monday (February 1997).Google ScholarGoogle Scholar
  5. 5.EDWARDS, P., GREEN, C. L., LOCKIER, P. C., AND LUKINS, T. Exploiting learning technologies for World Wide Web agents, In IEEE Colloquium on intelligent World Wide Web Agents, Digest No: 97/118 (March 1997).Google ScholarGoogle Scholar
  6. 6.GARFIELD, E. The concept of citation indexing: A unique and innovative tool for navigating the research literature. Current Contents January 3 (1994).Google ScholarGoogle Scholar
  7. 7.LEVENSHTEIN, V. I. Binary codes capable of correcting spurious insertions and deletions of ones (original in Russian). Russian Problemy Peredachi lnfonnatsii 1 (3anuary 1965), 12-25.Google ScholarGoogle Scholar
  8. 8.LOKE, S. W., DAVlSON, A., AND STERLING, L. ClFI: An intelligent agent for citation finding on the World-Wide Web. Technical Report 96~4 Dept. of Computer Science, University of Melbourne, 1996.Google ScholarGoogle Scholar
  9. 9.Menczer, F. Arachnid: Adaptive retrieval agents choosing heuristic neighborhoods for information discovery, in Machine Learning: Proceedings of the fourteenth International Conference (July 1997), pp. 227-235. Google ScholarGoogle ScholarDigital LibraryDigital Library
  10. 10.MOUKAS, A. Amalthaea: Information discovery and filtering using a multiagent evolving ecosystem. In Proceedings of the Cottference on Practical Applications of Agents and Mul. tlagent Technology (April 1996).Google ScholarGoogle Scholar
  11. 11.PAZZANI, M,, MURAMATSU, J., AND BILLSUS, D. "Syskill & Webert; Identifying interesting Web sites". In Proceedhtgs of the National Conference on Artificial Intelligence (AAA I96) (1996), Google ScholarGoogle ScholarDigital LibraryDigital Library
  12. 12.PORTER, M, F. "an algorithm for suffix stripping". Program 14 (3 1980), 130--137.Google ScholarGoogle ScholarCross RefCross Ref
  13. 13.Salton, G. Automatic indexing using bibliographic citations, Journal of Docmnentation 27 (2 1971), 98-110.Google ScholarGoogle Scholar
  14. 14.SALTON, G., AND BUCKLEY, C. "Term weighting approaches in automatic text retrieval". Tech Report 87-881 Dept, of Computer Science, Comell University, 1997. Google ScholarGoogle ScholarDigital LibraryDigital Library
  15. 15.SALTON, G,, AND YANG, C. On the specification of term values in automatic indexing. Journal of Documentation 29 (April 1973), 351-372.Google ScholarGoogle ScholarCross RefCross Ref
  16. 16.SPERTUS, E. ParaSite: Mining structural information on the Web. In Proceeding of The Sixth International World Wide Web Conference (April 1997). Google ScholarGoogle ScholarDigital LibraryDigital Library
  17. 17.STARR, B., ACKERMAN, M. S., AND PAZZANI, M. Do-I- Care: Tell me what's changed on the Web. In Proceedings of the AAAI Spring Symposium on Machine Learning in lnfor. mation Access Technical Papers (March 1996).Google ScholarGoogle Scholar
  18. 18.Yianilos, P. The Likelt intelligent string comparison facility. NEC Institute Tech Report 97-093, 1997.Google ScholarGoogle Scholar
  19. 19.Yianilos, P. N. Data structures and algorithms for nearest neighbor search in general metric spaces. In Proceedings of the 4th ACM-SIAM Symposium on Discrete Algorithms (1993), pp. :311-321. Google ScholarGoogle ScholarDigital LibraryDigital Library

Index Terms

  1. CiteSeer: an autonomous Web agent for automatic retrieval and identification of interesting publications

              Comments

              Login options

              Check if you have access through your login credentials or your institution to get full access on this article.

              Sign in
              • Published in

                ACM Conferences cover image
                AGENTS '98: Proceedings of the second international conference on Autonomous agents
                May 1998
                484 pages
                ISBN:0897919831
                DOI:10.1145/280765

                Copyright © 1998 ACM

                Publisher

                Association for Computing Machinery

                New York, NY, United States

                Publication History

                • Online: 1 May 1998

                Permissions

                Request permissions about this article.

                Request Permissions

                Qualifiers

                • Article

                Acceptance Rates

                AGENTS '98 Paper Acceptance Rate 57 of 180 submissions, 32%
                Overall Acceptance Rate 182 of 599 submissions, 30%

              PDF Format

              View or Download as a PDF file.

              PDF

              eReader

              View online with eReader.

              eReader
              About Cookies On This Site

              We use cookies to ensure that we give you the best experience on our website.

              Learn more

              Got it!