skip to main content
research-article

Cumulus: Filesystem backup to the cloud

Published:14 December 2009Publication History
Skip Abstract Section

Abstract

Cumulus is a system for efficiently implementing filesystem backups over the Internet, specifically designed under a thin cloud assumption—that the remote datacenter storing the backups does not provide any special backup services, but only a least-common-denominator storage interface. Cumulus aggregates data from small files for storage and uses LFS-inspired segment cleaning to maintain storage efficiency. While Cumulus can use virtually any storage service, we show its efficiency is comparable to integrated approaches.

References

  1. Agrawal, N., Bolosky, W. J., Douceur, J. R., and Lorch, J. R. 2007. A five-year study of file-system metadata. ACM Trans. Stor. 3, 3, 9. Google ScholarGoogle ScholarDigital LibraryDigital Library
  2. Amazon Web Services. 2009. Amazon Simple Storage Service. http://aws.amazon.com/s3/.Google ScholarGoogle Scholar
  3. boto 2009. boto: Python interface to Amazon Web Services. http://code.google.com/p/boto/.Google ScholarGoogle Scholar
  4. Cox, L. P., Murray, C. D., and Noble, B. D. 2002. Pastiche: Making backup cheap and easy. In Proceedings of the 5th Symposium on Operating Systems Design and Implementation (OSDI). USENIX, 285--298. Google ScholarGoogle ScholarDigital LibraryDigital Library
  5. Douceur, J. R., Adya, A., Bolosky, W. J., Simon, D., and Theimer, M. 2002. Reclaiming space from duplicate files in a serverless distributed file system. Tech. rep. MSR-TR-2002-30.Google ScholarGoogle Scholar
  6. Escoto, B. 2009. rdiff-backup. http://www.nongnu.org/rdiff-backup/.Google ScholarGoogle Scholar
  7. Escoto, B. and Loafman, K. 2009. Duplicity. http://duplicity.nongnu.org/.Google ScholarGoogle Scholar
  8. Fitzpatrick, B. 2009. Brackup. http://code.google.com/p/brackup/, http://brad.livejournal.com/tag/brackup.Google ScholarGoogle Scholar
  9. fuse 2009. FUSE: Filesystem in userspace. http://fuse.sourceforge.net/.Google ScholarGoogle Scholar
  10. Garnaat, M. 2009. Sqlite. http://www.sqlite.org/.Google ScholarGoogle Scholar
  11. Henson, V. 2003. An analysis of compare-by-hash. Proceedings of the 9th Workshop on Hot Topics in Operating Systems (HotOS IX). Google ScholarGoogle ScholarDigital LibraryDigital Library
  12. Henson, V. 2007. The code monkey's guide to cryptographic hashes for content-based addressing. http://www.linuxworld.com/news/2007/111207-hash.html.Google ScholarGoogle Scholar
  13. jungledisk 2009. Jungle disk. http://www.jungledisk.com/.Google ScholarGoogle Scholar
  14. librsync 2009. librsync. http://librsync.sourceforge.net/.Google ScholarGoogle Scholar
  15. Muthitacharoen, A., Chen, B., and Mazières, D. 2001. A low-bandwidth network file system. In Proceedings of the 18th ACM Symposium on Operating Systems Principles (SOSP). ACM, 174--187. Google ScholarGoogle ScholarDigital LibraryDigital Library
  16. Preston, W. C. 2006. Backup&Recovery. O'Reilly. Google ScholarGoogle ScholarDigital LibraryDigital Library
  17. Quinlan, S. and Dorward, S. 2002. Venti: a new approach to archival storage. In Proceedings of the 1st USENIX Conference on File and Storage Technologies (FAST). USENIX Association. Google ScholarGoogle ScholarDigital LibraryDigital Library
  18. Rosenblum, M. and Ousterhout, J. K. 1992. The design and implementation of a log-structured file system. ACM Trans. Comput. Syst. 10, 1, 26--52. Google ScholarGoogle ScholarDigital LibraryDigital Library
  19. Summers, B. and Wilson, C. 2009. Box backup. http://www.boxbackup.org/.Google ScholarGoogle Scholar
  20. Tridgell, A. 1999. Efficient algorithms for sorting and synchronization. Ph.D. thesis, Australian National University.Google ScholarGoogle Scholar
  21. Wang, J. and Hu, Y. 2002. WOLF--A novel reordering write buffer to boost the performance of log-structured file systems. In Proceedings of the 1st USENIX Conference on File and Storage Technologies (FAST). USENIX Association. Google ScholarGoogle ScholarDigital LibraryDigital Library
  22. Weatherspoon, H., Eaton, P., Chun, B.-G., and Kubiatowicz, J. 2007. Antiquity: Exploiting a secure log for wide-area distributed storage. In Proceedings of the 2nd ACM SIGOPS/EuroSys European Conference on Computer Systems (EuroSys'07). ACM, New York, NY, 371--384. Google ScholarGoogle ScholarDigital LibraryDigital Library
  23. Wheeler, D. A. 2009. SLOCCount. http://www.dwheeler.com/sloccount/.Google ScholarGoogle Scholar
  24. Zhu, B., Li, K., and Patterson, H. 2008. Avoiding the disk bottleneck in the data domain deduplication file system. In Proceedings of the 6th USENIX Conference on File and Storage Technologies (FAST). USENIX Association, 269--282. Google ScholarGoogle ScholarDigital LibraryDigital Library

Index Terms

  1. Cumulus: Filesystem backup to the cloud

        Recommendations

        Comments

        Login options

        Check if you have access through your login credentials or your institution to get full access on this article.

        Sign in

        Full Access

        • Published in

          cover image ACM Transactions on Storage
          ACM Transactions on Storage  Volume 5, Issue 4
          December 2009
          155 pages
          ISSN:1553-3077
          EISSN:1553-3093
          DOI:10.1145/1629080
          Issue’s Table of Contents

          Copyright © 2009 ACM

          Publisher

          Association for Computing Machinery

          New York, NY, United States

          Publication History

          • Published: 14 December 2009
          • Received: 1 August 2009
          • Accepted: 1 August 2009
          Published in tos Volume 5, Issue 4

          Permissions

          Request permissions about this article.

          Request Permissions

          Check for updates

          Qualifiers

          • research-article
          • Research
          • Refereed

        PDF Format

        View or Download as a PDF file.

        PDF

        eReader

        View online with eReader.

        eReader
        About Cookies On This Site

        We use cookies to ensure that we give you the best experience on our website.

        Learn more

        Got it!