ABSTRACT
Although recording of usage data is common in scholarly information services, its exploitation for the creation of value-added services remains limited due to concerns regarding, among others, user privacy, data validity, and the lack of accepted standards for the representation, sharing and aggregation of usage data. This paper presents a technical, standards-based architecture for sharing usage information, which we have designed and implemented. In this architecture, OpenURL-compliant linking servers aggregate usage information of a specific user community as it navigates the distributed information environment that it has access to. This usage information is made OAI-PMH harvestable so that usage information exposed by many linking servers can be aggregated to facilitate the creation of value-added services with a reach beyond that of a single community or a single information service. This paper also discusses issues that were encountered when implementing the proposed approach, and it presents preliminary results obtained from analyzing a usage data set containing about 3,500,000 requests aggregated by a federation of linking servers at the California State University system over a 20 month period.
References
- Chris Anderson. The long tail. Wired, 12(10), 2005.Google Scholar
- Monica Bianchini, Marco Gori, and Franco Scarselli. Inside pagerank. ACM Trans. Inter. Tech., 5(1):92--128, 2005. Google Scholar
Digital Library
- Johan Bollen, Herbert Van de Sompel, Joan Smith, and Rick Luce. Toward alternative metrics of journal impact: a comparison of download and citation data. Information Processing and Management, 41:1419--1440, 2005. Google Scholar
Digital Library
- Johan Bollen and Rick Luce. Evaluation of digital library impact and user communities by analysis of usage patterns. D-Lib Magazine, 8(6), 2002.Google Scholar
Cross Ref
- Johan Bollen, Rick Luce, Somesekhar Vemulapalli, and Weining Xu. Detecting research trends in digital library readership. In Proceedings of the 7th European Conference on Digital Libraries (LNCS 2769), pages 24--28, Trondheim, Norway, August 18 2003. Springer-Verlag.Google Scholar
Cross Ref
- Sergey Brin and Lawrence Page. The anatomy of a large-scale hypertextual web search engine. Computer Networks and ISDN Systems, 30(1-7):107--117, 1998. Google Scholar
Digital Library
- E. H. Chi, J. Pitkow, J. Mackinlay, P. Pirolli, R. Gosslweiler, and S. K. Card. Visualizing the evolution of web ecologies. In Conference on Human Factors in Computing Systems (CHI 98), volume 1998, April 1998. Google Scholar
Digital Library
- S. J. Darmoni, F. Roussel, J. Benichou, B. Thirion, and N. Pinhas. Reading factor: a new bibliometric criterion for managing digital libraries. Journal of the Medical Library Association, 90(3):323--327, 2002.Google Scholar
- Herbert Van de Sompel and Oren Beit Arie. Open linking in the scholarly information environment using the OpenURL framework. D-Lib Magazine, 7(3), 2001.Google Scholar
- Herbert Van de Sompel, Jeffrey A. Young, and Thomas B. Hickey. Using the OAI-PMH .. Differently. D-Lib Magazine, 9(7-8), July 2003.Google Scholar
- Marcos Andre Goncalves, Ming Luo, Rao Shen, Mir Farooq Ali, and Edward A. Fox. An XML log standard and tool for digital library logging analysis. In M. Agosti and C. Thanos, editors, ECDL 2002: LNCS 2458, pages 129--143, Berlin, September 2002. Springer-Verlag. Google Scholar
Digital Library
- San Yih Hwang, Wen Chiang Hsiung, and Wan Shiou Yang. A prototype WWW literature recommendation system for digital libraries. Online Information Review, 27(3):169--182, 2003.Google Scholar
Cross Ref
- I. T. Jolliffe. Principal Component Analysis. Springer Verlag, New York, 2002.Google Scholar
- Jon Kleinberg. Bursty and hierarchical structure in streams. In KDD '02: Proc. of the 8th ACM SIGKDD, pages 91--101. ACM Press, 2002. Google Scholar
Digital Library
- R. Kothari, P. Mittal, V. Jain, and M. Mohania. On using page co-occurrences for computing clickstream similarity. In Third SIAM International Conference on Data Mining, pages 154--165, San Francisco, CA, May 2003.Google Scholar
- Bamshad Mobasher, Honghua Dai, Tao Luo, and Miki Nakagawa. Effective personalization based on association rule discovery from web usage data. In WIDM '01: Proceedings of the 3rd international Workshop on Web Information and Data Management, pages 9--15. ACM Press, 2001. Google Scholar
Digital Library
- Badrul Sarwar, George Karypis, Joseph Konstan, and John Reidl. Item-based collaborative filtering recommendation algorithms. In Proc. of the 10th Int. Conf. on WWW, pages 285--295. ACM Press, 2001. Google Scholar
Digital Library
Index Terms
An architecture for the aggregation and analysis of scholarly usage data





Comments