skip to main content
research-article

Reliability of SSDs in Enterprise Storage Systems: A Large-Scale Field Study

Published:13 January 2021Publication History
Skip Abstract Section

Abstract

This article presents the first large-scale field study of NAND-based SSDs in enterprise storage systems (in contrast to drives in distributed data center storage systems). The study is based on a very comprehensive set of field data, covering 1.6 million SSDs of a major storage vendor (NetApp). The drives comprise three different manufacturers, 18 different models, 12 different capacities, and all major flash technologies (SLC, cMLC, eMLC, 3D-TLC). The data allows us to study a large number of factors that were not studied in prior works, including the effect of firmware versions, the reliability of TLC NAND, and the correlations between drives within a RAID system. This article presents our analysis, along with a number of practical implications derived from it.

References

  1. 2020. NVM Express Specification. https://nvmexpress.org/. [Online; accessed 23-Oct-2020].Google ScholarGoogle Scholar
  2. Rajiv Agarwal and Marcus Marrow. 2010. A closed-form expression for write amplification in NAND flash. In Proceedings of the 2010 IEEE Globecom Workshops. IEEE, 1846–1850.Google ScholarGoogle ScholarCross RefCross Ref
  3. Nitin Agrawal, Vijayan Prabhakaran, Ted Wobber, John D. Davis, Mark S. Manasse, and Rina Panigrahy. 2008. Design tradeoffs for SSD performance. In Proceedings of the USENIX Annual Technical Conference (ATC’08), Vol. 57. USENIX Association.Google ScholarGoogle Scholar
  4. Jacob Alter, Ji Xue, Alma Dimnaku, and Evgenia Smirni. 2019. SSD failures in the field: Symptoms, causes, and prediction models. In Proceedings of the International Conference for High Performance Computing, Networking, Storage and Analysis. 1–14.Google ScholarGoogle ScholarDigital LibraryDigital Library
  5. Alan T. Arnholt and Ben Evans. 2017. BSDA: Basic Statistics and Data Analysis. https://www.rdocumentation.org/packages/BSDA/versions/1.2.0 R package.Google ScholarGoogle Scholar
  6. Lakshmi N. Bairavasundaram, Andrea C. Arpaci-Dusseau, Remzi H. Arpaci-Dusseau, Garth R. Goodson, and Bianca Schroeder. 2008. An analysis of data corruption in the storage stack. ACM Transactions on Storage (TOS) 4, 3 (2008), 1–28.Google ScholarGoogle ScholarDigital LibraryDigital Library
  7. Lakshmi N. Bairavasundaram, Garth R. Goodson, Shankar Pasupathy, and Jiri Schindler. 2007. An analysis of latent sector errors in disk drives. In Proceedings of the 2007 ACM SIGMETRICS International Conference on Measurement and Modeling of Computer Systems (SIGMETRICS’07). 289–300.Google ScholarGoogle ScholarDigital LibraryDigital Library
  8. Hanmant P. Belgal, Nick Righos, Ivan Kalastirsky, Jeff J. Peterson, Robert Shiner, and Neal Mielke. 2002. A new reliability model for post-cycling charge retention of flash memories. In Proceedings of the 40th Annual International Reliability Physics Symposium. IEEE, 7–20.Google ScholarGoogle ScholarCross RefCross Ref
  9. Matias Bjørling. 2019. From open-channel SSDs to zoned namespaces. In Proceedings of the Linux Storage and Filesystems Conference (Vault 19).Google ScholarGoogle Scholar
  10. Simona Boboila and Peter Desnoyers. 2010. Write endurance in flash drives: Measurements and analysis. In Proceedings of the 8th USENIX Conference on File and Storage Technologies (FAST’10). USENIX Association, 115–128.Google ScholarGoogle ScholarDigital LibraryDigital Library
  11. Adam Brand, Ken Wu, Sam Pan, and David Chin. 1993. Novel read disturb failure mechanism induced by FLASH cycling. In Proceedings of the 31st Annual International Reliability Physics Symposium. IEEE, 127–132.Google ScholarGoogle ScholarCross RefCross Ref
  12. Yu Cai, Erich F. Haratsch, Onur Mutlu, and Ken Mai. 2012. Error patterns in MLC NAND flash memory: Measurement, Characterization, and Analysis. In Proceedings of the Conference on Design, Automation and Test in Europe. EDA Consortium, 521–526.Google ScholarGoogle Scholar
  13. Yu Cai, Yixin Luo, Erich F. Haratsch, Ken Mai, and Onur Mutlu. 2015. Data retention in MLC NAND flash memory: Characterization, optimization, and recovery. In Proceedings of the 21st International Symposium on High Performance Computer Architecture (HPCA). IEEE, 551–563.Google ScholarGoogle ScholarCross RefCross Ref
  14. Yu Cai, Onur Mutlu, Erich F. Haratsch, and Ken Mai. 2013. Program interference in MLC NAND flash memory: Characterization, modeling, and mitigation. In Proceedings of the 31st International Conference on Computer Design (ICCD). IEEE, 123–130.Google ScholarGoogle ScholarCross RefCross Ref
  15. Yu Cai, Gulay Yalcin, Onur Mutlu, Erich F. Haratsch, Adrian Cristal, Osman S. Unsal, and Ken Mai. 2012. Flash correct-and-refresh: Retention-aware error management for increased Flash memory lifetime. In Proceedings of the 30th International Conference on Computer Design (ICCD). IEEE, 94–101.Google ScholarGoogle ScholarDigital LibraryDigital Library
  16. Paolo Cappelletti, Roberto Bez, Daniele Cantarelli, and Lorenzo Fratin. 1994. Failure mechanisms of Flash cell in program/erase cycling. In Proceedings of the IEEE International Electron Devices Meeting. IEEE, 291–294.Google ScholarGoogle ScholarCross RefCross Ref
  17. Chandranil Chakraborttii and Heiner Litz. 2020. Improving the accuracy, adaptability, and interpretability of SSD failure prediction models. In Proceedings of the 11th ACM Symposium on Cloud Computing. 120–133.Google ScholarGoogle ScholarDigital LibraryDigital Library
  18. Feng Chen, David A. Koufaty, and Xiaodong Zhang. 2009. Understanding intrinsic characteristics and system implications of flash memory based solid state drives. ACM SIGMETRICS Performance Evaluation Review 37, 1 (2009), 181–192.Google ScholarGoogle ScholarDigital LibraryDigital Library
  19. Feng Chen, Rubao Lee, and Xiaodong Zhang. 2011. Essential roles of exploiting internal parallelism of Flash memory based solid state drives in high-speed data processing. In Proceedings of the 2011 IEEE 17th International Symposium on High Performance Computer Architecture. IEEE, 266–277.Google ScholarGoogle ScholarCross RefCross Ref
  20. Peter Corbett, Bob English, Atul Goel, Tomislav Grcanac, Steven Kleiman, James Leong, and Sunitha Sankar. 2004. Row-diagonal parity for double disk failure correction. In Proceedings of the 3rd USENIX Conference on File and Storage Technologies (FAST’05). USENIX Association, 1–14.Google ScholarGoogle Scholar
  21. Jeffrey Dean and Luiz André Barroso. 2013. The tail at scale. Commun. ACM 56, 2 (2013), 74–80.Google ScholarGoogle ScholarDigital LibraryDigital Library
  22. Robin Degraeve, F. Schuler, Ben Kaczer, Martino Lorenzini, Dirk Wellekens, Paul Hendrickx, Michiel van Duuren, G. J. M. Dormans, Jan Van Houdt, L. Haspeslagh, Guido Groeseneken, and Georg Tempel. 2004. Analytical percolation model for predicting anomalous charge loss in Flash memories. IEEE Transactions on Electron Devices 51, 9 (2004), 1392–1400.Google ScholarGoogle ScholarCross RefCross Ref
  23. Peter Desnoyers. 2014. Analytic models of SSD write performance. ACM Transactions on Storage (TOS) 10, 2 (2014), 1–25.Google ScholarGoogle ScholarDigital LibraryDigital Library
  24. John K. Edwards, Daniel Ellard, Craig Everhart, Robert Fair, Eric Hamilton, Andy Kahn, Arkady Kanevsky, James Lentini, Ashish Prakash, Keith A. Smith, and Edward Zayas. 2008. FlexVol: Flexible, efficient file volume virtualization in WAFL. In Proceedings of the USENIX 2008 Annual Technical Conference (ATC’08). USENIX Association, 129–142.Google ScholarGoogle Scholar
  25. Nosayba El-Sayed, Ioan A. Stefanovici, George Amvrosiadis, Andy A. Hwang, and Bianca Schroeder. 2012. Temperature management in data centers: Why some (might) like it hot. In Proceedings of the 12th ACM SIGMETRICS/PERFORMANCE Joint International Conference on Measurement and Modeling of Computer Systems. 163–174.Google ScholarGoogle ScholarDigital LibraryDigital Library
  26. Jon G. Elerath. 2000. AFR: Problems of definition, calculation and measurement in a commercial environment. In Proceedings of the Annual Reliability and Maintainability Symposium. 2000 Proceedings. International Symposium on Product Quality and Integrity (Cat. No. 00CH37055). IEEE, 71–76.Google ScholarGoogle ScholarCross RefCross Ref
  27. Jon G. Elerath. 2000. Specifying reliability in the disk drive industry: No more MTBF’s. In Proceedings of the Annual Reliability and Maintainability Symposium. 2000 Proceedings. International Symposium on Product Quality and Integrity (Cat. No. 00CH37055). IEEE, 194–199.Google ScholarGoogle ScholarCross RefCross Ref
  28. Atul Goel and Peter Corbett. 2012. RAID triple parity. ACM SIGOPS Operating Systems Review 46, 3 (2012), 41–49.Google ScholarGoogle ScholarDigital LibraryDigital Library
  29. L. M. Grupp, A. M. Caulfield, J. Coburn, S. Swanson, E. Yaakobi, P. H. Siegel, and J. K. Wolf. 2009. Characterizing Flash memory: Anomalies, observations, and applications. In Proceedings of the 42nd Annual IEEE/ACM International Symposium on Microarchitecture (MICRO). 24–33.Google ScholarGoogle Scholar
  30. Laura M. Grupp, John D. Davis, and Steven Swanson. 2012. The bleak future of NAND flash memory. In Proceedings of the 10th USENIX Conference on File and Storage Technologies (FAST’12), Vol. 7. USENIX Association, 10–2.Google ScholarGoogle Scholar
  31. Haryadi S. Gunawi, Riza O. Suminto, Russell Sears, Casey Golliher, Swaminathan Sundararaman, Xing Lin, Tim Emami, Weiguang Sheng, Nematollah Bidokhti, Caitie McCaffrey, Deepthi Srinivasan, Biswaranjan Panda, Andrew Baptist, Gary Grider, Parks M. Fields, Kevin Harms, Robert B. Ross, Andree Jacobson, Robert Ricci, Kirk Webb, Peter Alvaro, Birali H. Runesha, Mingzhe Hao, and Huaicheng Li. 2018. Fail-slow at scale: Evidence of hardware performance faults in large production systems. ACM Transactions on Storage (TOS) 14, 3 (2018), 1–26.Google ScholarGoogle ScholarDigital LibraryDigital Library
  32. Mingzhe Hao, Gokul Soundararajan, Deepak Kenchammana-Hosekote, Andrew A. Chien, and Haryadi S. Gunawi. 2016. The tail at store: A revelation from millions of hours of disk and SSD deployments. In Proceedings of the 14th USENIX Conference on File and Storage Technologies (FAST’16). USENIX Association, 263–276.Google ScholarGoogle Scholar
  33. Dave Hitz, James Lau, and Michael A Malcolm. 1994. File system design for an NFS file server appliance. In USENIX Winter, Vol. 94.Google ScholarGoogle ScholarDigital LibraryDigital Library
  34. Xiao-Yu Hu, Evangelos Eleftheriou, Robert Haas, Ilias Iliadis, and Roman Pletka. 2009. Write amplification analysis in flash-based solid state drives. In Proceedings of SYSTOR 2009: The Israeli Experimental Systems Conference. 1–9.Google ScholarGoogle ScholarDigital LibraryDigital Library
  35. S. Hur, J. Lee, M. Park, J. Choi, K. Park, K. Kim, and K. Kim. 2004. Effective program inhibition beyond 90nm NAND flash memories. Proc. NVSM (2004), 44–45.Google ScholarGoogle Scholar
  36. Seok Jin Joo, Hea Jong Yang, Keum Hwan Noh, Hee Gee Lee, Won Sik Woo, Joo Yeop Lee, Min Kyu Lee, Won Yol Choi, Kyoung Pil Hwang, Hyoung Seok Kim, Sa Yong Sim, Sook Kyoung Kim, Hee Hyoun Chang, and Gi Hyoun Bae. 2006. Abnormal disturbance mechanism of sub-100 nm NAND flash memory. Japanese Journal of Applied Physics 45, 8R (2006), 6210.Google ScholarGoogle ScholarCross RefCross Ref
  37. Myoungsoo Jung and Mahmut Kandemir. 2013. Revisiting widely held SSD expectations and rethinking system-level implications. In Proceedings of the 2013 ACM SIGMETRICS International Conference on Measurement and Modeling of Computer Systems (SIGMETRICS’13). 203–216.Google ScholarGoogle ScholarDigital LibraryDigital Library
  38. Myoungsoo Jung, Ramya Prabhakar, and Mahmut Taylan Kandemir. 2012. Taking garbage collection overheads off the critical path in SSDs. In Proceedings of the ACM/IFIP/USENIX International Conference on Distributed Systems Platforms and Open Distributed Processing. Springer, 164–186.Google ScholarGoogle ScholarCross RefCross Ref
  39. Jeong-Uk Kang, Jeeseok Hyun, Hyunjoo Maeng, and Sangyeun Cho. 2014. The multi-streamed solid-state drive. In Proceedings of the 6th USENIX Conference on Hot Topics in Storage and File Systems (HotStorage’14). USENIX Association.Google ScholarGoogle ScholarDigital LibraryDigital Library
  40. Ram Kesavan, Matthew Curtis-Maury, and Mrinal K. Bhattacharjee. 2018. Efficient search for free blocks in the WAFL file system. In Proceedings of the 47th International Conference on Parallel Processing. 1–10.Google ScholarGoogle Scholar
  41. Ram Kesavan, Jason Hennessey, Richard Jernigan, Peter Macko, Keith A. Smith, Daniel Tennant, and V. R. Bharadwaj. 2019. FlexGroup volumes: A distributed WAFL file system. In Proceedings of the USENIX Annual Technical Conference (ATC’19). USENIX Association, 135–148.Google ScholarGoogle Scholar
  42. Ram Kesavan, Harendra Kumar, and Sushrut Bhowmik. 2018. WAFL iron: Repairing live enterprise file systems. In Proceedings of the 16th USENIX Conference on File and Storage Technologies (FAST’18). USENIX Association, 33–48.Google ScholarGoogle Scholar
  43. Bryan S. Kim, Jongmoo Choi, and Sang Lyul Min. 2019. Design tradeoffs for SSD reliability. In Proceedings of the 17th USENIX Conference on File and Storage Technologies (FAST’19). USENIX Association, Boston, MA, 281–294.Google ScholarGoogle ScholarDigital LibraryDigital Library
  44. Harendra Kumar, Yuvraj Patel, Ram Kesavan, and Sumith Makam. 2017. High performance metadata integrity protection in the WAFL copy-on-write file system. In Proceedings of the 15th USENIX Conference on File and Storage Technologies (FAST’17). USENIX Association, 197–212.Google ScholarGoogle ScholarDigital LibraryDigital Library
  45. Jae-Duk Lee, Chi-Kyung Lee, Myung-Won Lee, Han-Soo Kim, Kyu-Charn Park, and Won-Seong Lee. 2006. A new programming disturbance phenomenon in NAND flash memory by source/drain hot-electrons generated by GIDL current. In Proceedings of the 21st IEEE Non-Volatile Semiconductor Memory Workshop, 2006 (IEEE NVSMW 2006). IEEE, 31–33.Google ScholarGoogle Scholar
  46. Jun Li, Bowen Huang, Zhibing Sha, Zhigang Cai, Jianwei Liao, Balazs Gerofi, and Yutaka Ishikawa. 2020. Mitigating negative impacts of read disturb in SSDs. ACM Trans. Des. Autom. Electron. Syst. 26, 1, Article 3 (Sept. 2020), 24 pages.Google ScholarGoogle Scholar
  47. Ao Ma, Fred Douglis, Guanlin Lu, Darren Sawyer, Surendar Chandra, and Windsor Hsu. 2015. RAIDShield: Characterizing, monitoring, and proactively protecting against disk failures. In Proceedings of the 13th USENIX Conference on File and Storage Technologies (FAST’15). USENIX Association, Santa Clara, CA, 241–256.Google ScholarGoogle ScholarDigital LibraryDigital Library
  48. Farzaneh Mahdisoltani, Ioan Stefanovici, and Bianca Schroeder. 2017. Proactive error prediction to improve storage system reliability. In USENIX Annual Technical Conference (ATC’17). USENIX Association, 391–402.Google ScholarGoogle Scholar
  49. Ioannis Manousakis, Sriram Sankar, Gregg McKnight, Thu D. Nguyen, and Ricardo Bianchini. 2016. Environmental conditions and disk reliability in free-cooled datacenters. In Proceedings of the 14th USENIX Conference on File and Storage Technologies (FAST’16). 53–65.Google ScholarGoogle ScholarDigital LibraryDigital Library
  50. Aleksander Maricq, Dmitry Duplyakin, Ivo Jimenez, Carlos Maltzahn, Ryan Stutsman, and Robert Ricci. 2018. Taming performance variability. In Proceedings of the 13th Symposium on Operating Systems Design and Implementation (OSDI’18). 409–425.Google ScholarGoogle Scholar
  51. Justin Meza, Qiang Wu, Sanjev Kumar, and Onur Mutlu. 2015. A large-scale study of flash memory failures in the field. In Proceedings of the 2015 ACM SIGMETRICS International Conference on Measurement and Modeling of Computer Systems (SIGMETRICS’15). 177–190.Google ScholarGoogle ScholarDigital LibraryDigital Library
  52. Neal Mielke, Hanmant P. Belgal, Albert Fazio, Qingru Meng, and Nick Righos. 2006. Recovery effects in the distributed cycling of Flash memories. In Proceedings of the 44th Annual International Reliability Physics Symposium. IEEE, 29–35.Google ScholarGoogle ScholarCross RefCross Ref
  53. Neal Mielke, Todd Marquart, Ning Wu, Jeff Kessenich, Hanmant Belgal, Eric Schares, Falgun Trivedi, Evan Goodness, and Leland R. Nevill. 2008. Bit error rate in NAND flash memories. In Proceedings of the 46th Annual International Reliability Physics Symposium. IEEE, 9–19.Google ScholarGoogle Scholar
  54. Neal R. Mielke, Robert E. Frickey, Ivan Kalastirsky, Minyan Quan, Dmitry Ustinov, and Venkatesh J. Vasudevan. 2017. Reliability of solid-state drives based on NAND flash memory. Proc. IEEE 105, 9 (2017), 1725–1750.Google ScholarGoogle ScholarCross RefCross Ref
  55. Iyswarya Narayanan, Di Wang, Myeongjae Jeon, Bikash Sharma, Laura Caulfield, Anand Sivasubramaniam, Ben Cutler, Jie Liu, Badriddine Khessib, and Kushagra Vaid. 2016. SSD failures in datacenters: What? When? And Why? In Proceedings of the 9th ACM International on Systems and Storage Conference (SYSTOR’16). Article 7, 11 pages.Google ScholarGoogle ScholarDigital LibraryDigital Library
  56. NetApp Inc. Data ONTAP 9 [n.d.]. NetApp Inc. Data ONTAP 9. https://docs.netapp.com/ontap-9/index.jsp. [Online; accessed 23-Oct-2020].Google ScholarGoogle Scholar
  57. Biswaranjan Panda, Deepthi Srinivasan, Huan Ke, Karan Gupta, Vinayak Khot, and Haryadi S Gunawi. 2019. IASO: A fail-slow detection and mitigation framework for distributed storage services. In Proceedings of the USENIX Annual Technical Conference (ATC’19). USENIX Association, 47–62.Google ScholarGoogle Scholar
  58. Stan Park and Kai Shen. 2012. FIOS: A fair, efficient flash I/O scheduler. In Proceedings of the 10th USENIX Conference on File and Storage Technologies (FAST’12), Vol. 12. USENIX Association, 13–13.Google ScholarGoogle Scholar
  59. David A. Patterson, Garth Gibson, and Randy H. Katz. 1988. A case for redundant arrays of inexpensive disks (RAID). In Proceedings of the 1988 ACM SIGMOD International Conference on Management of Data (SIGMOD’88). Association for Computing Machinery (ACM), 109–116.Google ScholarGoogle Scholar
  60. Eduardo Pinheiro, Wolf-Dietrich Weber, and Luiz André Barroso. 2007. Failure trends in a large disk drive population. In Proceedings of the 5th USENIX Conference on File and Storage Technologies (FAST’07), Vol. 7. USENIX Association, 17–23.Google ScholarGoogle Scholar
  61. Abhishek Rajimwale, Vijayan Prabhakaran, and John D. Davis. 2009. Block management in solid-state devices. In Proceedings for the USENIX Annual Technical Conference (ATC’09). USENIX Association.Google ScholarGoogle Scholar
  62. Bianca Schroeder, Sotirios Damouras, and Phillipa Gill. 2010. Understanding latent sector errors and how to protect against them. ACM Transactions on Storage (TOS) 6, 3, Article 9 (Sept. 2010), 23 pages.Google ScholarGoogle Scholar
  63. Bianca Schroeder and Garth A. Gibson. 2007. Disk failures in the real world: What does an MTTF of 1,000,000 hours mean to you? In Proceedings of the 5th USENIX Conference on File and Storage Technologies (FAST’07), Vol. 7. USENIX Association, San Jose, CA, 1–16.Google ScholarGoogle Scholar
  64. Bianca Schroeder, Raghav Lagisetty, and Arif Merchant. 2016. Flash reliability in production: The expected and the unexpected. In Proceedings of the 14th USENIX Conference on File and Storage Technologies (FAST’16). USENIX Association, Santa Clara, CA, 67–80.Google ScholarGoogle ScholarDigital LibraryDigital Library
  65. Thomas Schwarz, Mary Baker, Steven Bassi, Bruce Baumgart, Wayne Flagg, Catherine van Ingen, Kobus Joste, Mark Manasse, and Mehul Shah. 2006. Disk failure investigations at the internet archive. In Work-in-Progress Session, NASA/IEEE Conference on Mass Storage Systems and Technologies (MSST).Google ScholarGoogle Scholar
  66. Sandeep Shah and Jon G. Elerath. 2004. Disk drive vintage and its effect on reliability. In Proceedings of the Annual Symposium Reliability and Maintainability (2004-RAMS). IEEE, 163–167.Google ScholarGoogle Scholar
  67. Sandeep Shah and Jon G. Elerath. 2005. Reliability analysis of disk drive failure mechanisms. In Proceedings of the Annual Reliability and Maintainability Symposium, 2005. IEEE, 226–231.Google ScholarGoogle Scholar
  68. Dimitris Skourtis, Dimitris Achlioptas, Noah Watkins, Carlos Maltzahn, and Scott Brandt. 2014. Flash on rails: Consistent flash performance through redundancy. In Proceedings of the USENIX Annual Technical Conference (ATC’14). USENIX Association, 463–474.Google ScholarGoogle Scholar
  69. Zoned Storage. [n.d.]. NVMe Zoned Namespaces. https://zonedstorage.io/introduction/zns/. [Online; accessed 23-Oct-2020].Google ScholarGoogle Scholar
  70. Kang-Deog Suh, Byung-Hoon Suh, Young-Ho Lim, Jin-Ki Kim, Young-Joon Choi, Yong-Nam Koh, Sung-Soo Lee, Suk-Chon Kwon, Byung-Soon Choi, Jin-Sun Yum, Jung-Hyuk Choi, Jang-Rae Kim, and Hyung-Kyu Lim. 1995. A 3.3 V 32 Mb NAND flash memory with incremental step pulse programming scheme. IEEE Journal of Solid-State Circuits 30, 11 (1995), 1149–1156.Google ScholarGoogle ScholarCross RefCross Ref
  71. Hung-Wei Tseng, Laura Grupp, and Steven Swanson. 2011. Understanding the impact of power loss on Flash memory. In Proceedings of the 48th Design Automation Conference (DAC’11). San Diego, CA, 35–40.Google ScholarGoogle ScholarDigital LibraryDigital Library
  72. C. van Ingen and J. Gray. 2005. Empirical Measurements of Disk Failure Rates and Error Rates. Technical Report. MSR-TR-2005.Google ScholarGoogle Scholar
  73. Wikipedia. [n.d.]. S.M.A.R.T. https://en.wikipedia.org/wiki/S.M.A.R.T. [Online; accessed 23-Oct-2020].Google ScholarGoogle Scholar
  74. Erci Xu, Mai Zheng, Feng Qin, Yikang Xu, and Jiesheng Wu. 2019. Lessons and actions: What we learned from 10K SSD-related storage system failures. In Proceedings of the USENIX Annual Technical Conference (ATC’19). USENIX Association, Renton, WA, 961–976.Google ScholarGoogle Scholar
  75. Jimmy Yang and Feng-Bin Sun. 1999. A comprehensive review of hard-disk drive reliability. In Porceedings of the 1999 Annual Reliability and Maintainability. Symposium. (Cat. No. 99CH36283). IEEE, 403–409.Google ScholarGoogle ScholarCross RefCross Ref
  76. Mai Zheng, Joseph Tucek, Feng Qin, and Mark Lillibridge. 2013. Understanding the robustness of SSDs under power fault. In Proceedings of the 11th USENIX Conference on File and Storage Technologies (FAST’13). USENIX Association, San Jose, CA, 271–284.Google ScholarGoogle ScholarDigital LibraryDigital Library
  77. Mai Zheng, Joseph Tucek, Feng Qin, Mark Lillibridge, Bill W. Zhao, and Elizabeth S. Yang. 2016. Reliability analysis of SSDs under power fault. ACM Transactions on Storage (TOS) 34, 4, Article 10 (Nov. 2016), 28 pages.Google ScholarGoogle Scholar

Index Terms

  1. Reliability of SSDs in Enterprise Storage Systems: A Large-Scale Field Study

          Recommendations

          Comments

          Login options

          Check if you have access through your login credentials or your institution to get full access on this article.

          Sign in

          Full Access

          • Published in

            cover image ACM Transactions on Storage
            ACM Transactions on Storage  Volume 17, Issue 1
            Special Section on Usenix Fast 2020
            February 2021
            165 pages
            ISSN:1553-3077
            EISSN:1553-3093
            DOI:10.1145/3446939
            • Editor:
            • Sam H. Noh
            Issue’s Table of Contents

            Copyright © 2021 ACM

            Publisher

            Association for Computing Machinery

            New York, NY, United States

            Publication History

            • Published: 13 January 2021
            • Accepted: 1 September 2020
            • Received: 1 June 2020
            Published in tos Volume 17, Issue 1

            Permissions

            Request permissions about this article.

            Request Permissions

            Check for updates

            Qualifiers

            • research-article
            • Research
            • Refereed

          PDF Format

          View or Download as a PDF file.

          PDF

          eReader

          View online with eReader.

          eReader

          HTML Format

          View this article in HTML Format .

          View HTML Format
          About Cookies On This Site

          We use cookies to ensure that we give you the best experience on our website.

          Learn more

          Got it!