skip to main content
research-article

HPDA: A hybrid parity-based disk array for enhanced performance and reliability

Authors Info & Claims
Published:24 February 2012Publication History
Skip Abstract Section

Abstract

Flash-based Solid State Drive (SSD) has been productively shipped and deployed in large scale storage systems. However, a single flash-based SSD cannot satisfy the capacity, performance and reliability requirements of the modern storage systems that support increasingly demanding data-intensive computing applications. Applying RAID schemes to SSDs to meet these requirements, while a logical and viable solution, faces many challenges. In this article, we propose a Hybrid Parity-based Disk Array architecture (short for HPDA), which combines a group of SSDs and two hard disk drives (HDDs) to improve the performance and reliability of SSD-based storage systems. In HPDA, the SSDs (data disks) and part of one HDD (parity disk) compose a RAID4 disk array. Meanwhile, a second HDD and the free space of the parity disk are mirrored to form a RAID1-style write buffer that temporarily absorbs the small write requests and acts as a surrogate set during recovery when a disk fails. The write data is reclaimed to the data disks during the lightly loaded or idle periods of the system. Reliability analysis shows that the reliability of HPDA, in terms of MTTDL (Mean Time To Data Loss), is better than that of either pure HDD-based or SSD-based disk array. Our prototype implementation of HPDA and the performance evaluations show that HPDA significantly outperforms either HDD-based or SSD-based disk array.

References

  1. Agrawal, N., Bolosky, W. J., Douceur, J. R., and Lorch, J. R. 2007. A five-year study of file-system metadata. In Proceedings of the 5th USENIX Conference on File and Storage Technologies (FAST'07). USENIX, Berkeley, CA, 31--45. Google ScholarGoogle ScholarDigital LibraryDigital Library
  2. Agrawal, N., Prabhakaran, V., Wobber, T., Davis, J., Manasse, M., and Panigrahy, R. 2008. Design tradeoffs for SSD performance. In Proceedings of the USENIX Annual Technical Conference (USENIX'08). USENIX, Berkeley, CA, 57--70. Google ScholarGoogle ScholarDigital LibraryDigital Library
  3. Andersen, D. G., Franklin, J., Kaminsky, M., Phanishayee, A., Tan, L., and Vasudevan, V. 2009. FAWN: A Fast Array of Wimpy Nodes. In Proceedings of the ACM SIGOPS 22nd Symposium on Operating Systems Principles (SOSP'09). 1--14. Google ScholarGoogle ScholarDigital LibraryDigital Library
  4. Balakrishnan, M., Kadav, A., Prabhakaran, V., and Malkhi, D. 2010. Differential RAID: Rethinking RAID for SSD reliability. In Proceedings of the 5th ACM European Conference on Computer Systems (EuroSys'10). ACM, New York, 15--26. Google ScholarGoogle ScholarDigital LibraryDigital Library
  5. Chen, F., Koufaty, D. A., and Zhang, X. 2009. Understanding intrinsic characteristics and system implications of flash memory based solid state drives. In Proceedings of the ACM SIGMETRICS Conference on Measurement and Modeling of Computer Systems (SIGMETRICS/Performance'09). 181--192. Google ScholarGoogle ScholarDigital LibraryDigital Library
  6. Chen, F., Luo, T., and Zhang, X. 2011. CAFTL: A Content-Aware Flash Translation Layer enhancing the lifespan of flash memory based solid state drives. In Proceedings of the 9th USENIX Conference on File and Storage Technologies (FAST'11). USENIX, Berkeley, CA, 77--90. Google ScholarGoogle ScholarDigital LibraryDigital Library
  7. Chen, Y., Hsu, W., and Young, H. 2000. Logging RAID---An approach to fast, reliable, and low-cost disk arrays. In Proceedings of the 6th International Euro-Par Conference on Parallel Processing (Euro-Par'00). 1302--1311. Google ScholarGoogle ScholarDigital LibraryDigital Library
  8. Dirik, C. and Jacob, B. 2009. The performance of PC solid-state disks as a function of bandwidth, concurrency, device architecture, and system organization. In Proceedings of the 36th International Symposium on Computer Architecture (ISCA'09). Google ScholarGoogle ScholarDigital LibraryDigital Library
  9. Elerath, J. G. and Pecht, M. 2007. Enhanced reliability modeling of raid storage systems. In Proceedings of the 37th Annual IEEE/IFIP International Conference on Dependable Systems and Networks (DSN'07). 175--184. Google ScholarGoogle ScholarDigital LibraryDigital Library
  10. Golding, R., Bosch, P., and Staelin, C. 1995. Idleness is not sloth. In Proceedings of the USENIX Technical Conference (USENIX'95). USENIX, Berkeley, CA, 201--212. Google ScholarGoogle ScholarDigital LibraryDigital Library
  11. Greenan, K. M., Long, D. D. E., Miller, E. L., Schwarz, T. J. E., and Wildani, A. 2009. Building flexible, fault-tolerant flash-based storage systems. In Proceedings of the 5th Workshop on Hot Topics in System Dependability (HotDep'09).Google ScholarGoogle Scholar
  12. Greenan, K. M., Plank, J. S., and Wylie, J. J. 2010. Mean time to meaningless: MTTDL, Markov models, and storage system reliability. In Proceedings of the Workshop on Hot Topics in Storage and File Systems (HotStorage'10). USENIX, Berkeley, CA. Google ScholarGoogle ScholarDigital LibraryDigital Library
  13. Gupta, A., Kim, Y., and Urgaonkar, B. 2009. DFTL: A flash translation layer employing demand-based selective caching of page-level address mappings. In Proceedings of the 14th International Conference on Architectural Support for Programming Languages and Operating Systems (ASPLOS'09). 229--240. Google ScholarGoogle ScholarDigital LibraryDigital Library
  14. Gupta, A., Pisolkar, R., Urgaonkar, B., and Sivasubramaniam, A. 2011. Leveraging value locality in optimizing NAND flash-based SSDs. In Proceedings of the 9th USENIX Conference on File and Storage Technologies (FAST'11). USENIX, Berkeley, CA, 91--104. Google ScholarGoogle ScholarDigital LibraryDigital Library
  15. Hu, Y. and Yang, Q. 1996. DCD---Disk caching disk: A new approach for boosting I/O performance. In Proceedings of the 23rd Annual International Symposium on Computer Architecture (ISCA'96). 169--178. Google ScholarGoogle ScholarDigital LibraryDigital Library
  16. Hu, Y., Yang, Q., and Nightingale, T. 1999. RAPID-Cache---A Reliable and Inexpensive Write Cache for Disk I/O Systems. In Proceedings of the 5th International Symposium on High Performance Computer Architecture (HPCA'99). 204--213. Google ScholarGoogle ScholarDigital LibraryDigital Library
  17. Im, S. and Shin, D. 2011. Flash-Aware RAID techniques for dependable and high-performance flash memory SSD. IEEE Trans. Comput. 60, 1, 80--92. Google ScholarGoogle ScholarDigital LibraryDigital Library
  18. Intel Technical Report. 1998. Understanding the flash translation layer (FTL) specification.Google ScholarGoogle Scholar
  19. IOmeter. http://sourceforge.net/projects/iometer.Google ScholarGoogle Scholar
  20. Kadav, A., Balakrishnan, M., Prabhakaran, V., and Malkhi, D. 2009. Differential RAID: Rethinking RAID for SSD reliability. In Proceedings of the Workshop on Hot Topics in Storage and File Systems (HotStorage'09).Google ScholarGoogle Scholar
  21. Kim, H. and Ahn, S. 2008. BPLRU: A buffer management scheme for improving random writes in flash storage. In Proceedings of the 6th USENIX Conference on File and Storage Technologies (FAST'08). USENIX, Berkeley, CA, 239--252. Google ScholarGoogle ScholarDigital LibraryDigital Library
  22. Kim, Y., Gupta, A., and Urgaonkar, B. 2008. MixedStore: An enterprise-scale storage system combining solid-state and hard disk drives. Tech. rep., Department of Computer Science and Engineering, The Pennsylvania State University.Google ScholarGoogle Scholar
  23. Kim, Y., Oral, S., Shipman, G., Lee, J., Dillow, D., and Wang, F. 2011. Harmonia: A globally coordinated garbage collector for arrays of solid-state drives. In Proceedings of the IEEE Symposium on Massive Storage Systems and Technologies (MSST'11). IEEE. Google ScholarGoogle ScholarDigital LibraryDigital Library
  24. Koltsidas, I. and Viglas, S. D. 2008. Flashing up the storage layer. In Proceedings of the 34th International Conference on Very Large Data Bases (VLDB'08). 514--525. Google ScholarGoogle ScholarDigital LibraryDigital Library
  25. Mao, B., Feng, D., Jiang, H., Wu, S., Chen, J., and Zeng, L. 2008. GRAID: A green RAID storage architecture with improved energy efficiency and reliability. In Proceedings of the 16th Annual Meeting of the IEEE International Symposium on Modeling, Analysis, and Simulation of Computer and Telecommunication Systems (MASCOTS'08). 113--120.Google ScholarGoogle Scholar
  26. Menon, J. 1995. A performance comparison of RAID-5 and log-structured arrays. In Proceedings of the 4th International Symposium on High Performance Distributed Computing (HPDC'95). ACM, New York. Google ScholarGoogle ScholarDigital LibraryDigital Library
  27. Mi, N., Casale, G., Cherkasova, L., and Smirni, E. 2008. Burstiness in multi-tier applications: Symptoms, causes, and new models. In Proceedings of the 9th ACM/IFIP/USENIX International Middleware Conference (Middleware'08). ACM, New York, 265--286. Google ScholarGoogle ScholarDigital LibraryDigital Library
  28. Mogi, K. and Kitsuregawa, M. 1996. Hot mirroring: a method of hiding parity update penalty and degradation during rebuilds for RAID5. In Proceedings of the ACM SIGMOD International Conference on Management of Data (SIGMOD'96). ACM, New York, 183--194. Google ScholarGoogle ScholarDigital LibraryDigital Library
  29. Narayanan, D., Thereska, E., Donnelly, A., Elnikety, S., and Rowstron, A. 2009. Migrating server storage to SSDs: Analysis of tradeoffs. In Proceedings of the 4th European Conference on Computer Systems (EuroSys'09). Google ScholarGoogle ScholarDigital LibraryDigital Library
  30. OLTP Trace. http://traces.cs.umass.edu/index.php/Storage/Storage.Google ScholarGoogle Scholar
  31. Patterson, D., Gibson, G., and Katz, R. 1988. A case for redundant arrays of inexpensive disks (RAID). In Proceedings of the ACM SIGMOD International Conference on Management of Data (SIGMOD'88). ACM, New York, 109--116. Google ScholarGoogle ScholarDigital LibraryDigital Library
  32. Pâris, J., Amer, A., and Long, D. D. E. 2009. Using storage class memories to increase the reliability of two-dimensional RAID arrays. In Proceedings of the 17th Annual Meeting of the IEEE International Symposium on Modeling, Analysis, and Simulation of Computer and Telecommunication Systems (MASCOTS'09). 1--8.Google ScholarGoogle Scholar
  33. Riska, A. and Riedel, E. 2006. Disk drive level workload characterization. In Proceedings of the USENIX Annual Technical Conference (USENIX'06). USENIX, Berkeley, CA, 97--102. Google ScholarGoogle ScholarDigital LibraryDigital Library
  34. Rosenblum, M. and Ousterhout, J. K. 1992. The design and implementation of a log-structured file system. ACM Trans. Comput. Syst. 10, 1, 26--52. Google ScholarGoogle ScholarDigital LibraryDigital Library
  35. Ruemmler, C. and Wilkes, J. 1993. UNIX disk access patterns. In Proceedings of the USENIX Winter Technical Conference (USENIX'93). USENIX, Berkeley, CA, 405--420.Google ScholarGoogle Scholar
  36. Samsung Report. http://news.cnet.com/8301-13924_3-9876557-64.html.Google ScholarGoogle Scholar
  37. SanDisk SSD. http://www.sandisk.com/.Google ScholarGoogle Scholar
  38. Savage, S. and Wilkes, J. 1996. AFRAID: A Frequently Redundant Array of Independent Disks. In Proceedings of the USENIX Annual Technical Conference (USENIX'96). USENIX, Berkeley, CA, 27--39. Google ScholarGoogle ScholarDigital LibraryDigital Library
  39. Schroeder, B. and Gibson, G. A. 2007. Disk failures in the real world: What does an MTTF of 1,000,000 hours mean to you? In Proceedings of the 5th USENIX Conference on File and Storage Technologies (FAST'07). USENIX, Berkeley, CA, 1--16. Google ScholarGoogle ScholarDigital LibraryDigital Library
  40. Soundararajan, G., Prabhakaran, V., Balakrishnan, M., and Wobber, T. 2010. Extending SSD lifetimes with disk-based write caches. In Proceedings of the 8th USENIX Conference on File and Storage Technologies (FAST'10). USENIX, Berkeley, CA, 101--114. Google ScholarGoogle ScholarDigital LibraryDigital Library
  41. SSSI, S. http://www.snia.org/forums/sssi/.Google ScholarGoogle Scholar
  42. Stodolsky, D., Gibson, G., and Holland, M. 1993. Parity logging overcoming the small write problem in redundant disk arrays. In Proceedings of the 20th Annual International Symposium on Computer Architecture (ISCA'93). 64--75. Google ScholarGoogle ScholarDigital LibraryDigital Library
  43. Storer, M. W., Greenan, K. M., Miller, E. L., and Voruganti, K. 2008. Pergamum: Replacing tape with energy efficient, reliable, disk-based archival storage. In Proceedings of the 6th USENIX Conference on File and Storage Technologies (FAST'08). USENIX, Berkeley, CA, 1--16. Google ScholarGoogle ScholarDigital LibraryDigital Library
  44. Tian, L., Feng, D., Jiang, H., Zhou, K., Zeng, L., Chen, J., Wang, Z., and Song, Z. 2007. PRO: A popularity-based multi-threaded reconstruction optimization for RAID-structured storage systems. In Proceedings of the 5th USENIX Conference on File and Storage Technologies (FAST'07). USENIX, Berkeley, CA, 277--290. Google ScholarGoogle ScholarDigital LibraryDigital Library
  45. Uysal, M., Merchantand, A., and Alvarez, G. A. 2003. Using MEMS-based storage in disk arrays. In Proceedings of the 2nd USENIX Conference on File and Storage Technologies (FAST'03). USENIX, Berkeley, CA, 89--102. Google ScholarGoogle ScholarDigital LibraryDigital Library
  46. Wilkes, J., Golding, R., Staelin, C., and Sullivan, T. 1995. The HP AutoRAID hierarchical storage system. Oper. Syst. Rev. 29, 5, 96--108. Google ScholarGoogle ScholarDigital LibraryDigital Library
  47. Wu, S., Jiang, H., Feng, D., Tian, L., and Mao, B. 2009. WorkOut: I/O workload outsourcing for boosting RAID reconstruction performance. In Proceedings of the 7th USENIX Conference on File and Storage Technologies (FAST'09). USENIX, Berkeley, CA, 239--252. Google ScholarGoogle ScholarDigital LibraryDigital Library
  48. Xie, T. and Sun, Y. 2010. Dynamic data reallocation in hybrid disk arrays. IEEE Trans. Parall. Distrib. Syst. 21, 9, 1330--1341. Google ScholarGoogle ScholarDigital LibraryDigital Library
  49. Zhu, Q., Chen, Z., Tan, L., Zhou, Y., Keeton, K., and Wilkes, J. 2005. Hibernator: Helping disk arrays sleep through the winter. In Proceedings of the ACM SIGOPS 20th Symposium on Operating Systems Principles (SOSP'05). ACM, New York, 177--190. Google ScholarGoogle ScholarDigital LibraryDigital Library

Index Terms

  1. HPDA: A hybrid parity-based disk array for enhanced performance and reliability

                Recommendations

                Comments

                Login options

                Check if you have access through your login credentials or your institution to get full access on this article.

                Sign in

                Full Access

                • Published in

                  cover image ACM Transactions on Storage
                  ACM Transactions on Storage  Volume 8, Issue 1
                  February 2012
                  92 pages
                  ISSN:1553-3077
                  EISSN:1553-3093
                  DOI:10.1145/2093139
                  Issue’s Table of Contents

                  Copyright © 2012 ACM

                  Publisher

                  Association for Computing Machinery

                  New York, NY, United States

                  Publication History

                  • Published: 24 February 2012
                  • Accepted: 1 August 2011
                  • Revised: 1 July 2011
                  • Received: 1 December 2010
                  Published in tos Volume 8, Issue 1

                  Permissions

                  Request permissions about this article.

                  Request Permissions

                  Check for updates

                  Qualifiers

                  • research-article
                  • Research
                  • Refereed

                PDF Format

                View or Download as a PDF file.

                PDF

                eReader

                View online with eReader.

                eReader
                About Cookies On This Site

                We use cookies to ensure that we give you the best experience on our website.

                Learn more

                Got it!