10.1145/1378889.1378912acmconferencesArticle/Chapter ViewAbstractPublication PagesjcdlConference Proceedingsconference-collections
research-article

A data model and architecture for long-term preservation

Authors Info & Claims
Online:16 June 2008Publication History

ABSTRACT

The National Geospatial Digital Archive, one of eight initial projects funded under the Library of Congress's NDIIPP program, has been researching how geospatial data can be preserved on a national scale and be made available to future generations. In this paper we describe an archive architecture that provides a minimal approach to the long-term preservation of digital objects based on co-archiving of object semantics, uniform representation of objects and semantics, explicit storage of all objects and semantics as files, and abstraction of the underlying storage system. This architecture ensures that digital objects can be easily migrated from archive to archive over time and that the objects can, in principle, be made usable again at any point in the future; its primary benefit is that it serves as a fallback strategy against, and as a foundation for, more sophisticated (and costly) preservation strategies. We describe an implementation of this architecture in a protoype archive running at UCSB that also incorporates a suite of ingest and access components.

References

  1. Stephen L. Abrams (2005). "Establishing a Global Digital Format Registry." Library Trends 54(1) (Summer 2005). http://muse.jhu.edu/journals/library_trends/v054/54.1abrams.pdfGoogle ScholarGoogle Scholar
  2. Micah Beck, Terry Moore, James S. Plank, and Martin Swany (2000). "Logistical Networking: Sharing More Than the Wires." In Active Middleware Services (Salim Hariri, Craig A. Lee, and Cauligi S. Raghavendra, eds.) (Norwell, Massachusetts: Kluwer Academic Publishers, 2000).Google ScholarGoogle Scholar
  3. Tim Berners-Lee, Roy T. Fielding, and Larry Masinter (2005). Uniform Resource Identifier (URI): Generic Syntax. IETF RFC 3986. http://www.ietf.org/rfc/rfc3986.txtGoogle ScholarGoogle Scholar
  4. Consultative Committee for Space Data Systems (2002). Reference Model for an Open Archival Information System (OAIS). CCSDS 650.0-B-1, Blue Book (January 2002). http://public.ccsds.org/publications/archive/650x0b1.pdfGoogle ScholarGoogle Scholar
  5. Morgan V. Cundiff (2004). "An Introduction to the Metadata Encoding and Transmission Standard (METS)". Library Hi Tech 22(1): 52--64. doi:10.1108/07378830410524495Google ScholarGoogle ScholarCross RefCross Ref
  6. Margaret Hedstrom (2001). "Exploring the Concept of Temporal Interoperability as a Framework for Digital Preservation." Third DELOS Workshop on Interoperability and Mediation in Heterogeneous Digital Libraries (September 8-9, 2001; Darmstadt, Germany). http://www.ercim.org/publication/ws-proceedings/DelNoe03/10.pdfGoogle ScholarGoogle Scholar
  7. Greg Janée and James Frew (2002). "The ADEPT Digital Library Architecture." Proceedings of the Second ACM/IEEE-CS Joint Conference on Digital Libraries (JCDL) (July 14-18, 2002; Portland, Oregon): 342--350. doi:10.1145/544220.544306 Google ScholarGoogle ScholarDigital LibraryDigital Library
  8. Greg Janée, James Frew, and David Valentine (2003). "Content Access Characterization in Digital Libraries." Proceedings of the 2003 Joint Conference on Digital Libraries (JCDL) (May 27-31, 2003; Houston, Texas): 261--262. doi:10.1109/JCDL.2003.1204874 Google ScholarGoogle ScholarDigital LibraryDigital Library
  9. Greg Janée and James Frew (2005). "A Hybrid Declarative/Procedural Metadata Mapping Language Based on Python." Research and Advanced Technology for Digital Libraries: Proceedings of the 9th European Conference (ECDL) (September 18-23, 2005; Vienna, Austria): 302--313. doi:10.1007/11551362_27 Google ScholarGoogle ScholarDigital LibraryDigital Library
  10. David Cay Johnston (2003). "At I.R.S., a Systems Update Gone Awry." New York Times, December 11, 2003. http://www.nytimes.com/2003/12/11/business/11irs.htmlGoogle ScholarGoogle Scholar
  11. Carl Lagoze, Sandy Payette, Edwin Shin, and Chris Wilper (2006). "Fedora: An Architecture for Complex Objects and their Relationships." International Journal on Digital Libraries 6(2) (April 2006): 124--138. doi:10.1007/s00799-005-0130-3 Google ScholarGoogle ScholarDigital LibraryDigital Library
  12. Mike Linda (2006). "OMPS Aggregation and Packaging." 2006 CLASS Users' Workshop (August 7-8, 2006; Boulder, Colorado). http://ngdc.noaa.gov/dmsp/2nd_class_workshop/class.htmlGoogle ScholarGoogle Scholar
  13. Arcot Rajasekar, Mike Wan, Reagan Moore, and Wayne Schroeder (2006). "A Prototype Rule-based Distributed Data Management System." HPDC Workshop on Next-Generation Distributed Data Management (June 20, 2006; Paris, France). http://irods.sdsc.edu/pubs/RODs-paper.docGoogle ScholarGoogle Scholar
  14. Clay Shirky (2005). "AIHT: Conceptual Issues from Practical Tests." D-Lib Magazine 11(12) (December 2005). doi:10.1045/december2005-shirkyGoogle ScholarGoogle ScholarCross RefCross Ref
  15. Julie Sweetkind-Singer, Mary Lynette Larsgaard, and Tracy Erwin (2006). "Digital Preservation of Geospatial Data." Library Trends 55(2) (Fall 2006). http://muse.jhu.edu/journals/library_trends/v055/55.2sweetkind-singer.pdfGoogle ScholarGoogle Scholar
  16. Herbert Van de Sompel and Carl Lagoze (2007). "Interoperability for the Discovery, Use, and Re-Use of Units of Scholarly Communication." CTWatch Quarterly 3(3) (August 2007): 32--41. http://www.ctwatch.org/quarterly/articles/2007/08/interoperability-for-the-discovery-use-and-re-use-of-units-of-scholarly-communication/Google ScholarGoogle Scholar

Index Terms

  1. A data model and architecture for long-term preservation

            Comments

            Login options

            Check if you have access through your login credentials or your institution to get full access on this article.

            Sign in
            • Published in

              ACM Conferences cover image
              JCDL '08: Proceedings of the 8th ACM/IEEE-CS joint conference on Digital libraries
              June 2008
              490 pages
              ISBN:9781595939982
              DOI:10.1145/1378889

              Copyright © 2008 ACM

              Publisher

              Association for Computing Machinery

              New York, NY, United States

              Publication History

              • Online: 16 June 2008

              Permissions

              Request permissions about this article.

              Request Permissions

              Qualifiers

              • research-article

              Acceptance Rates

              JCDL '08 Paper Acceptance Rate 33 of 117 submissions, 28%
              Overall Acceptance Rate 334 of 1,195 submissions, 28%

            PDF Format

            View or Download as a PDF file.

            PDF

            eReader

            View online with eReader.

            eReader
            About Cookies On This Site

            We use cookies to ensure that we give you the best experience on our website.

            Learn more

            Got it!