ABSTRACT
CiteSeer is currently a very large source of meta-data information on the World Wide Web (WWW). This meta-data is the key material for the Semantic Web. Still, CiteSeer is not yet a Semantic-enabled service and therefore its meta-data, although potentially usable by Semantic Web agents, is not yet reachable using the Semantic Web mechanisms. The complexity of CiteSeer, that is the range of tasks it supports, make the transition to a Semantic-enabled service a non-trivial task. While human users tend to perceive CiteSeer as a single well-integrated service, we believe it is best seen - from a machine perspective - as a collection of services, each service performing a specific task. In this paper we show our approach to enable CiteSeer on the Semantic Web in order to allow the use of its meta-data through the Semantic Web. We first introduce an intuitive Application Programming Interface (API) to the CiteSeer software, then show that an efficient integration of CiteSeer in the Semantic Web can be best achieved by independently integrating the services that comprise it. We believe the effort presented here towards the Semantic-integration of a complex Information Retrieval system could be used as an integration model for arbitrary systems.
References
- CiteSeer-API, http://citeseer.ist.psu.edu/api/Google Scholar
- CiteSeer.IST, http://citeseer.ist.psu.edu/Google Scholar
- Crespo, A.; Garcia-Molina, H. Archival Storage for Digital Libraries, Third ACM Conference on Digital Libraries. Pittsburgh, PA, USA, June 23-26, 1998 Google Scholar
Digital Library
- Dublin Core Metadata Initiative, http://dublincore.org/Google Scholar
- C.L. Giles, K. Bollacker, S. Lawrence, "CiteSeer: An Automatic Citation Indexing System", In Proceedings of the 3rd ACM Conference on Digital Libraries (DL'98), pp 89--98, 1998. Google Scholar
Digital Library
- S. Lawrence, K. Bollacker, C.L. Giles, "Distributed Error Correction", In Proceedings of the 4th ACM Conference on Digital Libraries, p. 232, 1999. Google Scholar
Digital Library
- S. Lawrence, K. Bollacker and C.L. Giles, "Indexing and Retrieval of Scientific Literature", In Proceedings of the Eighth International Conference on Information and Knowledge Management (CIKM 99), pp 139--146, Kansas City, Missouri, November 2-6, 1999. Google Scholar
Digital Library
- "The Open Archives Initiative Protocol for Metadata Harvesting", http://www.openarchives.org/OAI/openarchivesprotocol.htm.Google Scholar
- OWL Web Ontology Language Reference, http://www.w3.org/TR/2004/REC-owl-ref-20040210/Google Scholar
- OWL-S, http://www.daml.org/services/owl-s/1.0/Google Scholar
- Y. Petinot, P.B. Teregowda, H. Han, C.L. Giles, S. Lawrence, A. Rangaswamy and N. Pal, "eBizSearch: an OAI-Compliant Digital Library for eBusiness", In Proceedings of the ACM/IEEE Joint Conference on Digital Libraries (JCDL 2003), pp 199--209, Houston (TX), May 2003. Google Scholar
Digital Library
- Resource Description Framework, http://www.w3.org/RDF/Google Scholar
- Simple Object Access Protocol, http://www.w3.org/TR/soap/Google Scholar
- Web Service Description Language, http://www.w3.org/TR/wsdlGoogle Scholar
- A. Ankolekar et al., "DAML-S: Web Service Description for the Semantic Web," Proc. 1st Int'l Semantic Web Conf. (ISWC 02), 2002. Google Scholar
Digital Library
- DSpace Federation, http://www.dspace.org/Google Scholar
- Fedora, http://www.fedora.info/Google Scholar
- M. Paolucci, N. Srinivasan, K. P. Sycara, T. Nishimura, "Towards a Semantic Choreography of Web Services: From WSDL to DAML-S", In Proceedings of the International Conference on Web Services (ICWS 2003), pp 22--26, 2003.Google Scholar
- D. McComb, "Semantics in business systems: the savvy manager's guide: the discipline underlying web-services, business rules, and the semantic web", Morgan Kaufman, 2004.Google Scholar
- UDDI Spec TC, "Using WSDL in a UDDI Registry, Version 1.08", http://www.oasis-open.org/committees/uddi-spec/doc/bp/uddi-spec-tc-bp-using-wsdl-v108-20021110.htmGoogle Scholar
- Homepage SDSC Storage Resource Broker (SRB), <http://www.npaci.edu/DICE/SRB/>Google Scholar
Index Terms
A service-oriented architecture for digital libraries





Comments