10.1145/1141753.1141821acmconferencesArticle/Chapter ViewAbstractPublication PagesjcdlConference Proceedingsconference-collections
Article

An architecture for the aggregation and analysis of scholarly usage data

Online:11 June 2006Publication History

ABSTRACT

Although recording of usage data is common in scholarly information services, its exploitation for the creation of value-added services remains limited due to concerns regarding, among others, user privacy, data validity, and the lack of accepted standards for the representation, sharing and aggregation of usage data. This paper presents a technical, standards-based architecture for sharing usage information, which we have designed and implemented. In this architecture, OpenURL-compliant linking servers aggregate usage information of a specific user community as it navigates the distributed information environment that it has access to. This usage information is made OAI-PMH harvestable so that usage information exposed by many linking servers can be aggregated to facilitate the creation of value-added services with a reach beyond that of a single community or a single information service. This paper also discusses issues that were encountered when implementing the proposed approach, and it presents preliminary results obtained from analyzing a usage data set containing about 3,500,000 requests aggregated by a federation of linking servers at the California State University system over a 20 month period.

References

  1. Chris Anderson. The long tail. Wired, 12(10), 2005.Google ScholarGoogle Scholar
  2. Monica Bianchini, Marco Gori, and Franco Scarselli. Inside pagerank. ACM Trans. Inter. Tech., 5(1):92--128, 2005. Google ScholarGoogle ScholarDigital LibraryDigital Library
  3. Johan Bollen, Herbert Van de Sompel, Joan Smith, and Rick Luce. Toward alternative metrics of journal impact: a comparison of download and citation data. Information Processing and Management, 41:1419--1440, 2005. Google ScholarGoogle ScholarDigital LibraryDigital Library
  4. Johan Bollen and Rick Luce. Evaluation of digital library impact and user communities by analysis of usage patterns. D-Lib Magazine, 8(6), 2002.Google ScholarGoogle ScholarCross RefCross Ref
  5. Johan Bollen, Rick Luce, Somesekhar Vemulapalli, and Weining Xu. Detecting research trends in digital library readership. In Proceedings of the 7th European Conference on Digital Libraries (LNCS 2769), pages 24--28, Trondheim, Norway, August 18 2003. Springer-Verlag.Google ScholarGoogle ScholarCross RefCross Ref
  6. Sergey Brin and Lawrence Page. The anatomy of a large-scale hypertextual web search engine. Computer Networks and ISDN Systems, 30(1-7):107--117, 1998. Google ScholarGoogle ScholarDigital LibraryDigital Library
  7. E. H. Chi, J. Pitkow, J. Mackinlay, P. Pirolli, R. Gosslweiler, and S. K. Card. Visualizing the evolution of web ecologies. In Conference on Human Factors in Computing Systems (CHI 98), volume 1998, April 1998. Google ScholarGoogle ScholarDigital LibraryDigital Library
  8. S. J. Darmoni, F. Roussel, J. Benichou, B. Thirion, and N. Pinhas. Reading factor: a new bibliometric criterion for managing digital libraries. Journal of the Medical Library Association, 90(3):323--327, 2002.Google ScholarGoogle Scholar
  9. Herbert Van de Sompel and Oren Beit Arie. Open linking in the scholarly information environment using the OpenURL framework. D-Lib Magazine, 7(3), 2001.Google ScholarGoogle Scholar
  10. Herbert Van de Sompel, Jeffrey A. Young, and Thomas B. Hickey. Using the OAI-PMH .. Differently. D-Lib Magazine, 9(7-8), July 2003.Google ScholarGoogle Scholar
  11. Marcos Andre Goncalves, Ming Luo, Rao Shen, Mir Farooq Ali, and Edward A. Fox. An XML log standard and tool for digital library logging analysis. In M. Agosti and C. Thanos, editors, ECDL 2002: LNCS 2458, pages 129--143, Berlin, September 2002. Springer-Verlag. Google ScholarGoogle ScholarDigital LibraryDigital Library
  12. San Yih Hwang, Wen Chiang Hsiung, and Wan Shiou Yang. A prototype WWW literature recommendation system for digital libraries. Online Information Review, 27(3):169--182, 2003.Google ScholarGoogle ScholarCross RefCross Ref
  13. I. T. Jolliffe. Principal Component Analysis. Springer Verlag, New York, 2002.Google ScholarGoogle Scholar
  14. Jon Kleinberg. Bursty and hierarchical structure in streams. In KDD '02: Proc. of the 8th ACM SIGKDD, pages 91--101. ACM Press, 2002. Google ScholarGoogle ScholarDigital LibraryDigital Library
  15. R. Kothari, P. Mittal, V. Jain, and M. Mohania. On using page co-occurrences for computing clickstream similarity. In Third SIAM International Conference on Data Mining, pages 154--165, San Francisco, CA, May 2003.Google ScholarGoogle Scholar
  16. Bamshad Mobasher, Honghua Dai, Tao Luo, and Miki Nakagawa. Effective personalization based on association rule discovery from web usage data. In WIDM '01: Proceedings of the 3rd international Workshop on Web Information and Data Management, pages 9--15. ACM Press, 2001. Google ScholarGoogle ScholarDigital LibraryDigital Library
  17. Badrul Sarwar, George Karypis, Joseph Konstan, and John Reidl. Item-based collaborative filtering recommendation algorithms. In Proc. of the 10th Int. Conf. on WWW, pages 285--295. ACM Press, 2001. Google ScholarGoogle ScholarDigital LibraryDigital Library

Index Terms

  1. An architecture for the aggregation and analysis of scholarly usage data

        Comments

        Login options

        Check if you have access through your login credentials or your institution to get full access on this article.

        Sign in

        PDF Format

        View or Download as a PDF file.

        PDF

        eReader

        View online with eReader.

        eReader
        About Cookies On This Site

        We use cookies to ensure that we give you the best experience on our website.

        Learn more

        Got it!