ABSTRACT
Research papers available on the World Wide Web (WWW or Web) are often poorly organized, often exist in forms opaque to search engines (e.g. Postscript), and increase in quantity daily. Significant amounts of time and effort are typically needed in order to find interesting and relevant publications on the Web. We have developed a Web based information agent that assists the user in the process of performing a scientific literature search. Given a set of keywords, the agent uses Web search engines and heuristics to locate and download papers. The papers are parsed in order to extract information features such as the abstract and individually identified citations. The agents Web interface can be used to find relevant papers in the database using keyword searches, or by navigating the links between papers formed by the citations. Links to both citing and cited publications can be followed. In addition to simple browsing and keyword searches, the agent can find papers which are similar to a given paper using word information and by analyzing common citations made by the papers.
References
- 1.Institute for scientific information (http://www. isinet.com), 1997.Google Scholar
- 2.Keycite (http://www.westgroup.com/keycite/), 1997.Google Scholar
- 3.BALABANOVIC, M. An adaptive Web page recommendation service. In Proceedings of the First International Conference on Autonomous Agents (February 1997). Google Scholar
Digital Library
- 4.CAMERON, R. D, A universal citation database as a catalyst for reform in scholarly communication. First Monday (February 1997).Google Scholar
- 5.EDWARDS, P., GREEN, C. L., LOCKIER, P. C., AND LUKINS, T. Exploiting learning technologies for World Wide Web agents, In IEEE Colloquium on intelligent World Wide Web Agents, Digest No: 97/118 (March 1997).Google Scholar
- 6.GARFIELD, E. The concept of citation indexing: A unique and innovative tool for navigating the research literature. Current Contents January 3 (1994).Google Scholar
- 7.LEVENSHTEIN, V. I. Binary codes capable of correcting spurious insertions and deletions of ones (original in Russian). Russian Problemy Peredachi lnfonnatsii 1 (3anuary 1965), 12-25.Google Scholar
- 8.LOKE, S. W., DAVlSON, A., AND STERLING, L. ClFI: An intelligent agent for citation finding on the World-Wide Web. Technical Report 96~4 Dept. of Computer Science, University of Melbourne, 1996.Google Scholar
- 9.Menczer, F. Arachnid: Adaptive retrieval agents choosing heuristic neighborhoods for information discovery, in Machine Learning: Proceedings of the fourteenth International Conference (July 1997), pp. 227-235. Google Scholar
Digital Library
- 10.MOUKAS, A. Amalthaea: Information discovery and filtering using a multiagent evolving ecosystem. In Proceedings of the Cottference on Practical Applications of Agents and Mul. tlagent Technology (April 1996).Google Scholar
- 11.PAZZANI, M,, MURAMATSU, J., AND BILLSUS, D. "Syskill & Webert; Identifying interesting Web sites". In Proceedhtgs of the National Conference on Artificial Intelligence (AAA I96) (1996), Google Scholar
Digital Library
- 12.PORTER, M, F. "an algorithm for suffix stripping". Program 14 (3 1980), 130--137.Google Scholar
Cross Ref
- 13.Salton, G. Automatic indexing using bibliographic citations, Journal of Docmnentation 27 (2 1971), 98-110.Google Scholar
- 14.SALTON, G., AND BUCKLEY, C. "Term weighting approaches in automatic text retrieval". Tech Report 87-881 Dept, of Computer Science, Comell University, 1997. Google Scholar
Digital Library
- 15.SALTON, G,, AND YANG, C. On the specification of term values in automatic indexing. Journal of Documentation 29 (April 1973), 351-372.Google Scholar
Cross Ref
- 16.SPERTUS, E. ParaSite: Mining structural information on the Web. In Proceeding of The Sixth International World Wide Web Conference (April 1997). Google Scholar
Digital Library
- 17.STARR, B., ACKERMAN, M. S., AND PAZZANI, M. Do-I- Care: Tell me what's changed on the Web. In Proceedings of the AAAI Spring Symposium on Machine Learning in lnfor. mation Access Technical Papers (March 1996).Google Scholar
- 18.Yianilos, P. The Likelt intelligent string comparison facility. NEC Institute Tech Report 97-093, 1997.Google Scholar
- 19.Yianilos, P. N. Data structures and algorithms for nearest neighbor search in general metric spaces. In Proceedings of the 4th ACM-SIAM Symposium on Discrete Algorithms (1993), pp. :311-321. Google Scholar
Digital Library
Index Terms
CiteSeer: an autonomous Web agent for automatic retrieval and identification of interesting publications





Comments