Abstract
Flash-based Solid State Drive (SSD) has been productively shipped and deployed in large scale storage systems. However, a single flash-based SSD cannot satisfy the capacity, performance and reliability requirements of the modern storage systems that support increasingly demanding data-intensive computing applications. Applying RAID schemes to SSDs to meet these requirements, while a logical and viable solution, faces many challenges. In this article, we propose a Hybrid Parity-based Disk Array architecture (short for HPDA), which combines a group of SSDs and two hard disk drives (HDDs) to improve the performance and reliability of SSD-based storage systems. In HPDA, the SSDs (data disks) and part of one HDD (parity disk) compose a RAID4 disk array. Meanwhile, a second HDD and the free space of the parity disk are mirrored to form a RAID1-style write buffer that temporarily absorbs the small write requests and acts as a surrogate set during recovery when a disk fails. The write data is reclaimed to the data disks during the lightly loaded or idle periods of the system. Reliability analysis shows that the reliability of HPDA, in terms of MTTDL (Mean Time To Data Loss), is better than that of either pure HDD-based or SSD-based disk array. Our prototype implementation of HPDA and the performance evaluations show that HPDA significantly outperforms either HDD-based or SSD-based disk array.
- Agrawal, N., Bolosky, W. J., Douceur, J. R., and Lorch, J. R. 2007. A five-year study of file-system metadata. In Proceedings of the 5th USENIX Conference on File and Storage Technologies (FAST'07). USENIX, Berkeley, CA, 31--45. Google Scholar
Digital Library
- Agrawal, N., Prabhakaran, V., Wobber, T., Davis, J., Manasse, M., and Panigrahy, R. 2008. Design tradeoffs for SSD performance. In Proceedings of the USENIX Annual Technical Conference (USENIX'08). USENIX, Berkeley, CA, 57--70. Google Scholar
Digital Library
- Andersen, D. G., Franklin, J., Kaminsky, M., Phanishayee, A., Tan, L., and Vasudevan, V. 2009. FAWN: A Fast Array of Wimpy Nodes. In Proceedings of the ACM SIGOPS 22nd Symposium on Operating Systems Principles (SOSP'09). 1--14. Google Scholar
Digital Library
- Balakrishnan, M., Kadav, A., Prabhakaran, V., and Malkhi, D. 2010. Differential RAID: Rethinking RAID for SSD reliability. In Proceedings of the 5th ACM European Conference on Computer Systems (EuroSys'10). ACM, New York, 15--26. Google Scholar
Digital Library
- Chen, F., Koufaty, D. A., and Zhang, X. 2009. Understanding intrinsic characteristics and system implications of flash memory based solid state drives. In Proceedings of the ACM SIGMETRICS Conference on Measurement and Modeling of Computer Systems (SIGMETRICS/Performance'09). 181--192. Google Scholar
Digital Library
- Chen, F., Luo, T., and Zhang, X. 2011. CAFTL: A Content-Aware Flash Translation Layer enhancing the lifespan of flash memory based solid state drives. In Proceedings of the 9th USENIX Conference on File and Storage Technologies (FAST'11). USENIX, Berkeley, CA, 77--90. Google Scholar
Digital Library
- Chen, Y., Hsu, W., and Young, H. 2000. Logging RAID---An approach to fast, reliable, and low-cost disk arrays. In Proceedings of the 6th International Euro-Par Conference on Parallel Processing (Euro-Par'00). 1302--1311. Google Scholar
Digital Library
- Dirik, C. and Jacob, B. 2009. The performance of PC solid-state disks as a function of bandwidth, concurrency, device architecture, and system organization. In Proceedings of the 36th International Symposium on Computer Architecture (ISCA'09). Google Scholar
Digital Library
- Elerath, J. G. and Pecht, M. 2007. Enhanced reliability modeling of raid storage systems. In Proceedings of the 37th Annual IEEE/IFIP International Conference on Dependable Systems and Networks (DSN'07). 175--184. Google Scholar
Digital Library
- Golding, R., Bosch, P., and Staelin, C. 1995. Idleness is not sloth. In Proceedings of the USENIX Technical Conference (USENIX'95). USENIX, Berkeley, CA, 201--212. Google Scholar
Digital Library
- Greenan, K. M., Long, D. D. E., Miller, E. L., Schwarz, T. J. E., and Wildani, A. 2009. Building flexible, fault-tolerant flash-based storage systems. In Proceedings of the 5th Workshop on Hot Topics in System Dependability (HotDep'09).Google Scholar
- Greenan, K. M., Plank, J. S., and Wylie, J. J. 2010. Mean time to meaningless: MTTDL, Markov models, and storage system reliability. In Proceedings of the Workshop on Hot Topics in Storage and File Systems (HotStorage'10). USENIX, Berkeley, CA. Google Scholar
Digital Library
- Gupta, A., Kim, Y., and Urgaonkar, B. 2009. DFTL: A flash translation layer employing demand-based selective caching of page-level address mappings. In Proceedings of the 14th International Conference on Architectural Support for Programming Languages and Operating Systems (ASPLOS'09). 229--240. Google Scholar
Digital Library
- Gupta, A., Pisolkar, R., Urgaonkar, B., and Sivasubramaniam, A. 2011. Leveraging value locality in optimizing NAND flash-based SSDs. In Proceedings of the 9th USENIX Conference on File and Storage Technologies (FAST'11). USENIX, Berkeley, CA, 91--104. Google Scholar
Digital Library
- Hu, Y. and Yang, Q. 1996. DCD---Disk caching disk: A new approach for boosting I/O performance. In Proceedings of the 23rd Annual International Symposium on Computer Architecture (ISCA'96). 169--178. Google Scholar
Digital Library
- Hu, Y., Yang, Q., and Nightingale, T. 1999. RAPID-Cache---A Reliable and Inexpensive Write Cache for Disk I/O Systems. In Proceedings of the 5th International Symposium on High Performance Computer Architecture (HPCA'99). 204--213. Google Scholar
Digital Library
- Im, S. and Shin, D. 2011. Flash-Aware RAID techniques for dependable and high-performance flash memory SSD. IEEE Trans. Comput. 60, 1, 80--92. Google Scholar
Digital Library
- Intel Technical Report. 1998. Understanding the flash translation layer (FTL) specification.Google Scholar
- IOmeter. http://sourceforge.net/projects/iometer.Google Scholar
- Kadav, A., Balakrishnan, M., Prabhakaran, V., and Malkhi, D. 2009. Differential RAID: Rethinking RAID for SSD reliability. In Proceedings of the Workshop on Hot Topics in Storage and File Systems (HotStorage'09).Google Scholar
- Kim, H. and Ahn, S. 2008. BPLRU: A buffer management scheme for improving random writes in flash storage. In Proceedings of the 6th USENIX Conference on File and Storage Technologies (FAST'08). USENIX, Berkeley, CA, 239--252. Google Scholar
Digital Library
- Kim, Y., Gupta, A., and Urgaonkar, B. 2008. MixedStore: An enterprise-scale storage system combining solid-state and hard disk drives. Tech. rep., Department of Computer Science and Engineering, The Pennsylvania State University.Google Scholar
- Kim, Y., Oral, S., Shipman, G., Lee, J., Dillow, D., and Wang, F. 2011. Harmonia: A globally coordinated garbage collector for arrays of solid-state drives. In Proceedings of the IEEE Symposium on Massive Storage Systems and Technologies (MSST'11). IEEE. Google Scholar
Digital Library
- Koltsidas, I. and Viglas, S. D. 2008. Flashing up the storage layer. In Proceedings of the 34th International Conference on Very Large Data Bases (VLDB'08). 514--525. Google Scholar
Digital Library
- Mao, B., Feng, D., Jiang, H., Wu, S., Chen, J., and Zeng, L. 2008. GRAID: A green RAID storage architecture with improved energy efficiency and reliability. In Proceedings of the 16th Annual Meeting of the IEEE International Symposium on Modeling, Analysis, and Simulation of Computer and Telecommunication Systems (MASCOTS'08). 113--120.Google Scholar
- Menon, J. 1995. A performance comparison of RAID-5 and log-structured arrays. In Proceedings of the 4th International Symposium on High Performance Distributed Computing (HPDC'95). ACM, New York. Google Scholar
Digital Library
- Mi, N., Casale, G., Cherkasova, L., and Smirni, E. 2008. Burstiness in multi-tier applications: Symptoms, causes, and new models. In Proceedings of the 9th ACM/IFIP/USENIX International Middleware Conference (Middleware'08). ACM, New York, 265--286. Google Scholar
Digital Library
- Mogi, K. and Kitsuregawa, M. 1996. Hot mirroring: a method of hiding parity update penalty and degradation during rebuilds for RAID5. In Proceedings of the ACM SIGMOD International Conference on Management of Data (SIGMOD'96). ACM, New York, 183--194. Google Scholar
Digital Library
- Narayanan, D., Thereska, E., Donnelly, A., Elnikety, S., and Rowstron, A. 2009. Migrating server storage to SSDs: Analysis of tradeoffs. In Proceedings of the 4th European Conference on Computer Systems (EuroSys'09). Google Scholar
Digital Library
- OLTP Trace. http://traces.cs.umass.edu/index.php/Storage/Storage.Google Scholar
- Patterson, D., Gibson, G., and Katz, R. 1988. A case for redundant arrays of inexpensive disks (RAID). In Proceedings of the ACM SIGMOD International Conference on Management of Data (SIGMOD'88). ACM, New York, 109--116. Google Scholar
Digital Library
- Pâris, J., Amer, A., and Long, D. D. E. 2009. Using storage class memories to increase the reliability of two-dimensional RAID arrays. In Proceedings of the 17th Annual Meeting of the IEEE International Symposium on Modeling, Analysis, and Simulation of Computer and Telecommunication Systems (MASCOTS'09). 1--8.Google Scholar
- Riska, A. and Riedel, E. 2006. Disk drive level workload characterization. In Proceedings of the USENIX Annual Technical Conference (USENIX'06). USENIX, Berkeley, CA, 97--102. Google Scholar
Digital Library
- Rosenblum, M. and Ousterhout, J. K. 1992. The design and implementation of a log-structured file system. ACM Trans. Comput. Syst. 10, 1, 26--52. Google Scholar
Digital Library
- Ruemmler, C. and Wilkes, J. 1993. UNIX disk access patterns. In Proceedings of the USENIX Winter Technical Conference (USENIX'93). USENIX, Berkeley, CA, 405--420.Google Scholar
- Samsung Report. http://news.cnet.com/8301-13924_3-9876557-64.html.Google Scholar
- SanDisk SSD. http://www.sandisk.com/.Google Scholar
- Savage, S. and Wilkes, J. 1996. AFRAID: A Frequently Redundant Array of Independent Disks. In Proceedings of the USENIX Annual Technical Conference (USENIX'96). USENIX, Berkeley, CA, 27--39. Google Scholar
Digital Library
- Schroeder, B. and Gibson, G. A. 2007. Disk failures in the real world: What does an MTTF of 1,000,000 hours mean to you? In Proceedings of the 5th USENIX Conference on File and Storage Technologies (FAST'07). USENIX, Berkeley, CA, 1--16. Google Scholar
Digital Library
- Soundararajan, G., Prabhakaran, V., Balakrishnan, M., and Wobber, T. 2010. Extending SSD lifetimes with disk-based write caches. In Proceedings of the 8th USENIX Conference on File and Storage Technologies (FAST'10). USENIX, Berkeley, CA, 101--114. Google Scholar
Digital Library
- SSSI, S. http://www.snia.org/forums/sssi/.Google Scholar
- Stodolsky, D., Gibson, G., and Holland, M. 1993. Parity logging overcoming the small write problem in redundant disk arrays. In Proceedings of the 20th Annual International Symposium on Computer Architecture (ISCA'93). 64--75. Google Scholar
Digital Library
- Storer, M. W., Greenan, K. M., Miller, E. L., and Voruganti, K. 2008. Pergamum: Replacing tape with energy efficient, reliable, disk-based archival storage. In Proceedings of the 6th USENIX Conference on File and Storage Technologies (FAST'08). USENIX, Berkeley, CA, 1--16. Google Scholar
Digital Library
- Tian, L., Feng, D., Jiang, H., Zhou, K., Zeng, L., Chen, J., Wang, Z., and Song, Z. 2007. PRO: A popularity-based multi-threaded reconstruction optimization for RAID-structured storage systems. In Proceedings of the 5th USENIX Conference on File and Storage Technologies (FAST'07). USENIX, Berkeley, CA, 277--290. Google Scholar
Digital Library
- Uysal, M., Merchantand, A., and Alvarez, G. A. 2003. Using MEMS-based storage in disk arrays. In Proceedings of the 2nd USENIX Conference on File and Storage Technologies (FAST'03). USENIX, Berkeley, CA, 89--102. Google Scholar
Digital Library
- Wilkes, J., Golding, R., Staelin, C., and Sullivan, T. 1995. The HP AutoRAID hierarchical storage system. Oper. Syst. Rev. 29, 5, 96--108. Google Scholar
Digital Library
- Wu, S., Jiang, H., Feng, D., Tian, L., and Mao, B. 2009. WorkOut: I/O workload outsourcing for boosting RAID reconstruction performance. In Proceedings of the 7th USENIX Conference on File and Storage Technologies (FAST'09). USENIX, Berkeley, CA, 239--252. Google Scholar
Digital Library
- Xie, T. and Sun, Y. 2010. Dynamic data reallocation in hybrid disk arrays. IEEE Trans. Parall. Distrib. Syst. 21, 9, 1330--1341. Google Scholar
Digital Library
- Zhu, Q., Chen, Z., Tan, L., Zhou, Y., Keeton, K., and Wilkes, J. 2005. Hibernator: Helping disk arrays sleep through the winter. In Proceedings of the ACM SIGOPS 20th Symposium on Operating Systems Principles (SOSP'05). ACM, New York, 177--190. Google Scholar
Digital Library
Index Terms
HPDA: A hybrid parity-based disk array for enhanced performance and reliability
Recommendations
Differential RAID: Rethinking RAID for SSD reliability
SSDs exhibit very different failure characteristics compared to hard drives. In particular, the bit error rate (BER) of an SSD climbs as it receives more writes. As a result, RAID arrays composed from SSDs are subject to correlated failures. By ...
An empirical study of redundant array of independent solid-state drives (RAIS)
Solid-state drives (SSD) are popular storage media devices alongside magnetic hard disk drives (HDD). SSD flash chips are packaged in HDD form factors and SSDs are compatible with regular HDD device drivers and I/O buses. This compatibility allows easy ...
WeLe-RAID: a SSD-based RAID for system endurance and performance
NPC'11: Proceedings of the 8th IFIP international conference on Network and parallel computingDue to the limited erasure/program cycles of flash memory, flashbased SSDs need to prolong their life time using wear-leveling mechanism to meet their advertised capacity all the time. However, there is no wear-leveling mechanism among SSDs in RAID ...






Comments