Abstract
Benchmarking is critical when evaluating performance, but is especially difficult for file and storage systems. Complex interactions between I/O devices, caches, kernel daemons, and other OS components result in behavior that is rather difficult to analyze. Moreover, systems have different features and optimizations, so no single benchmark is always suitable. The large variety of workloads that these systems experience in the real world also adds to this difficulty.
In this article we survey 415 file system and storage benchmarks from 106 recent papers. We found that most popular benchmarks are flawed and many research papers do not provide a clear indication of true performance. We provide guidelines that we hope will improve future performance evaluations. To show how some widely used benchmarks can conceal or overemphasize overheads, we conducted a set of experiments. As a specific example, slowing down read operations on ext2 by a factor of 32 resulted in only a 2--5% wall-clock slowdown in a popular compile benchmark. Finally, we discuss future work to improve file system and storage benchmarking.
- *Abd-El-Malek, M., II, W. V. C., Cranor, C., Ganger, G., Hendricks, J., Klosterman, A. J., Mesnier, M., Prasad, M., Salmon, B., Sambasivan, R. R., Sinnamohideen, S., Strunk, J. D., Thereska, E., Wachs, M., and Wylie, J. J. 2005.Ursa Minor: Versatile cluster-based storage. In Proceedings of the 4th USENIX Conference on File and Storage Technologies, San Francisco, CA, 59--72. Google Scholar
Digital Library
- *Adya, A., Bolosky, W. J., Castro, M., Cermak, G., Chaiken, R., Douceur, J. R., Howell, J., Lorch, J. R., Theimer, M., and Wattenhofer, R. P. 2002. FARSITE: Federated, available, and reliable storage for an incompletely trusted environment. In Proceedings of the 5th Symposium on Operating System Design and Implementation, Boston, MA, 1--14. Google Scholar
Digital Library
- Agrawal, N., Bolosky, W. J., Douceur, J. R., and Lorch, J. R. 2007. A five-year study of file-system metadata. In Proceedings of the 5th USENIX Conference on File and Storage Technologies, San Jose, CA, 31--45. Google Scholar
Digital Library
- *Aguilera, M. K., Ji, M., Lillibridge, M., MacCormick, J., Oertli, E., Andersen, D., Burrows, M., Mann, T., and Thekkath, C. A. 2003. Block-Level security for network-attached disks. In Proceedings of the 2nd USENIX Conference on File and Storage Technologies, San Francisco, CA, 159--174. Google Scholar
Digital Library
- Akkerman, W. 2002. Strace software home page. www.liacs.nl/~wichert/strace/.Google Scholar
- *Anderson, D. C., Chase, J. S., and Vahdat, A. M. 2000. Interposed request routing for scalable network storage. In Proceedings of the 4th USENIX Symposium on Operating System Design and Implementation, San Diego, CA, 259--272. Google Scholar
Digital Library
- *Anderson, E., Hobbs, M., Keeton, K., Spence, S., Uysal, M., and Veitch, A. 2002. Hippodrome: Running circles around storage administration. In Proceedings of the 1st USENIX Conference on File and Storage Technologies, Monterey, CA, 175--188. Google Scholar
Digital Library
- *Anderson, E., Kallahalla, M., Uysal, M., and Swaminathan, R. 2004. Buttress: A toolkit for flexible and high fidelity I/O benchmarking. In Proceedings of the 3rd USENIX Conference on File and Storage Technologies, San Francisco, CA, 45--58. Google Scholar
Digital Library
- Andrerson, D. 2002. Fstress: A flexible network file service benchmark. Tech. Rep. TR-2001-2002, Duke University. May.Google Scholar
- *Aranya, A., Wright, C. P., and Zadok, E. 2004. Tracefs: A file system to trace them all. In Proceedings of the 3rd USENIX Conference on File and Storage Technologies, San Francisco, CA, 129--143. Google Scholar
Digital Library
- *Arpaci-Dusseau, A. C., Arpaci-Dusseau, R. H., Burnett, N. C., Denehy, T. E., Engle, T. J., Gunawi, H. S., Nugent, J. A., and Popovici, F. I. 2003. Transforming policies into mechanisms with Infokernel. In Proceedings of the 19th ACM Symposium on Operating Systems Principles (ACM SIGOPS), Bolton Landing, NY, 90--105. Google Scholar
Digital Library
- Blaze, M. 1992. NFS tracing by passive network monitoring. In Proceedings of the USENIX Winter Conference, San Francisco, CA.Google Scholar
- Bray, T. 1996. Bonnie home page. www.textuality.com/bonnie.Google Scholar
- Bryant, R., Forester, R., and Hawkes, J. 2002. Filesystem performance and scalability in Linux 2.4.17. In Proceedings of the Annual USENIX Technical Conference, FREENIX Track, Monterey, CA, 259--274. Google Scholar
Digital Library
- Bryant, R., Raddatz, D., and Sunshine, R. 2001. PenguinoMeter: A new file-I/O benchmark for Linux. In Proceedings of the 5th Annual Linux Showcase and Conference, Oakland, CA, 5--10. Google Scholar
Digital Library
- Chen, P. M. and Patterson, D. A. 1993. A new approach to I/O performance evaluation—Self-Scaling I/O benchmarks, predicted I/O performance. In Proceedings of the ACM SIGMETRICS International Conference on Measurement and Modeling of Computer Systems (ACM SIGOPS), Seattle, WA, 1--12. Google Scholar
Digital Library
- *Cipar, J., Corner, M. D., and Berger, E. D. 2007. TFS: A transparent file system for contributory storage. In Proceedings of the 5th USENIX Conference on File and Storage Technologies, San Jose, CA, 215--229. Google Scholar
Digital Library
- Coker, R. 2001. Bonnie++ home page. www.coker.com.au/bonnie++.Google Scholar
- *Corbett, P., English, B., Goel, A., Grcanac, T., Kleiman, S., Leong, J., and Sankar, S. 2004. Row-Diagonal parity for double disk failure correction. In Proceedings of the 3rd USENIX Conference on File and Storage Technologies, San Francisco, CA, 1--14. Google Scholar
Digital Library
- *Dabek, F., Kaashoek, M. F., Karger, D., and Morris, R. 2001. Wide-Area cooperative storage with CFS. In Proceedings of the 18th ACM Symposium on Operating Systems Principles, Banff, Canada. Google Scholar
Digital Library
- *DeBergalis, M., Corbett, P., Kleiman, S., Lent, A., Noveck, D., Talpey, T., and Wittle, M. 2003. The direct access file system. In Proceedings of the 2nd USENIX Conference on File and Storage Technologies, San Francisco, CA, 175--188. Google Scholar
Digital Library
- *Denehy, T. E., Arpaci-Dusseau, A. C., and Arpaci-Dusseau, R. H. 2002. Bridging the information gap in storage protocol stacks. In Proceedings of the Annual USENIX Technical Conference, Monterey, CA, 177--190. Google Scholar
Digital Library
- *Denehy, T. E., Arpaci-Dusseau, A. C., and Arpaci-Dusseau, R. H. 2005. Journal-guided resynchronization for software RAID. In Proceedings of the 4th USENIX Conference on File and Storage Technologies, San Francisco, CA, 87--100. Google Scholar
Digital Library
- *Dimitrijevic, Z., Rangaswami, R., and Chang, E. 2003. Design and implementation of semi-preemptible IO. In Proceedings of the 2nd USENIX Conference on File and Storage Technologies, San Francisco, CA, 145--158. Google Scholar
Digital Library
- *Eisler, M., Corbett, P., Kazar, M., Nydick, D. S., and Wagner, J. C. 2007. Data ONTAP GX: A scalable storage cluster. In Proceedings of the 5th USENIX Conference on File and Storage Technologies, San Jose, CA, 139--152. Google Scholar
Digital Library
- Ellard, D., Ledlie, J., Malkani, P., and Seltzer, M. 2003. Passive NFS tracing of email and research workloads. In Proceedings of the 2nd USENIX Conference on File and Storage Technologies, San Francisco, CA. Google Scholar
Digital Library
- Ellard, D. and Seltzer, M. 2003a. New NFS tracing tools and techniques for system analysis. In Proceedings of the Annual USENIX Conference on Large Installation Systems Administration, San Diego, CA. Google Scholar
Digital Library
- Ellard, D. and Seltzer, M. 2003b. NFS tricks and benchmarking traps. In Proceedings of the Annual USENIX Technical Conference, FREENIX Track, San Antonio, TX, 101--114. Google Scholar
Digital Library
- *Flinn, J., Sinnamohideen, S., Tolia, N., and Satyanaryanan, M. 2003. Data staging on untrusted surrogates. In Proceedings of the 2nd USENIX Conference on File and Storage Technologies, San Francisco, CA, 15--28. Google Scholar
Digital Library
- *Fraser, K. and Chang, F. 2003. Operating system I/O speculation: How two invocations are faster than one. In Proceedings of the Annual USENIX Technical Conference. San Antonio, TX, 325--338.Google Scholar
- *Fu, K., Kaashoek, M. F., and Mazières, D. 2000. Fast and secure distributed read-only file system. In Proceedings of the 4th USENIX Symposium on Operating System Design and Implementation, San Diego, CA, 181--196. Google Scholar
Digital Library
- *Ghemawat, S., Gobioff, H., and Leung, S. T. 2003. The Google file system. In Proceedings of the 19th ACM Symposium on Operating Systems Principles. (ACM SIGOPS), Bolton Landing, NY, 29--43. Google Scholar
Digital Library
- *Gniady, C., Butt, A. R., and Hu, Y. C. 2004. Program-Counter-Based pattern classification in buffer caching. In Proceedings of the 6th Symposium on Operating Systems Design and Implementation (ACM SIGOPS), San Francisco, CA, 395--408. Google Scholar
Digital Library
- *Gopal, B. and Manber, U. 1999. Integrating content-based access mechanisms with hierarchical file systems. In Proceedings of the 3rd Symposium on Operating Systems Design and Implementation. (ACM SIGOPS), New Orleans, LA, 265--278. Google Scholar
Digital Library
- *Grönvall, B., Westerlund, A., and Pink, S. 1999. The design of a multicast-based distributed file system. In Proceedings of the 3rd Symposium on Operating Systems Design and Implementation (ACM SIGOPS), New Orleans, LA, 251--264. Google Scholar
Digital Library
- *Gulati, A., Naik, M., and Tewari, R. 2007. Nache: Design and implementation of a caching proxy for nfsv4. In Proceedings of the 5th USENIX Conference on File and Storage Technologies, San Jose, CA, 199--214. Google Scholar
Digital Library
- *Howard, J. H., Kazar, M. L., Menees, S. G., Nichols, D. A., Satyanarayanan, M., Sidebotham, R. N., and West, M. J. 1988. Scale and performance in a distributed file system. ACM Trans. Comput. Syst. 6, 1 (Feb.), 51--81. Google Scholar
Digital Library
- *Huang, H., Hung, W., and Shin, K. 2005. FS2: Dynamic data replication in free disk space for improving disk performance and energy consumption. In Proceedings of the 20th ACM Symposium on Operating Systems Principles, Brighton, UK, 263--276. Google Scholar
Digital Library
- Huang, L. and Chiueh, T. 2001. Charm: An I/O-driven execution strategy for high-performance transaction processing. In Proceedings of the Annual USENIX Technical Conference, Boston, MA, 275--288. Google Scholar
Digital Library
- *Joglekar, A., Kounavis, M. E., and Berry, F. L. 2005. A scalable and high performance software iSCSI implementation. In Proceedings of the 4th USENIX Conference on File and Storage Technologies, San Francisco, CA, 267--280. Google Scholar
Digital Library
- Joukov, N., Traeger, A., Iyer, R., Wright, C. P., and Zadok, E. 2006. Operating system profiling via latency analysis. In Proceedings of the 7th Symposium on Operating Systems Design and Implementation (ACM SIGOPS), Seattle, WA, 89--102. Google Scholar
Digital Library
- *Joukov, N., Wong, T., and Zadok, E. 2005. Accurate and efficient replaying of file system traces. In Proceedings of the 4th USENIX Conference on File and Storage Technologies, San Francisco, CA, 337--350. Google Scholar
Digital Library
- *Kallahalla, M., Riedel, E., Swaminathan, R., Wang, Q., and Fu, K. 2003. Plutus: Scalable secure file sharing on untrusted storage. In Proceedings of the 2nd USENIX Conference on File and Storage Technologies, San Francisco, CA, 29--42. Google Scholar
Digital Library
- Kaminsky, M., Savvides, G., Mazieres, D., and Kaashoek, M. F. 2003. Decentralized user authentication in a global file system. In Proceedings of the 19th ACM Symposium on Operating Systems Principles (ACM SIGOPS), Bolton Landing, NY. Google Scholar
Digital Library
- Katcher, J. 1997. PostMark: A new filesystem benchmark. Tech. Rep. TR3022, Network Appliance. www.netapp.com/tech_library/3022.html.Google Scholar
- *Kim, J. M., Choi, J., Kim, J., Noh, S. H., Min, S. L., Cho, Y., and Kim, C. S. 2000. A low-overhead, high-performance unified buffer management scheme that exploits sequential and looping references. In Proceedings of the 4th USENIX Symposium on Operating System Design and Implementation, San Diego, CA, 119--134. Google Scholar
Digital Library
- *Kim, M., Cox, L., and Noble, B. 2002. Safety, visibility, and performance in a wide-area file system. In Proceedings of the 1st USENIX Conference on File and Storage Technologies. Monterey, CA. Google Scholar
Digital Library
- *Kroeger, T. M. and Long, D. D. E. 2001. Design and implementation of a predictive file prefetching algorithm. In Proceedings of the Annual USENIX Technical Conference, Boston, MA, 105--118. Google Scholar
Digital Library
- LASS. 2006. UMass trace repository. http://traces.cs.umass.edu.Google Scholar
- *Lee, Y., Leung, K., and Satyanarayanan, M. 1999. Operation-Based update propagation in a mobile file system. In Proceedings of the Annual USENIX Technical Conference, Monterey, CA, 43--56. Google Scholar
Digital Library
- *Li, J., Krohn, M., Mazières, D., and Shasha, D. 2004. Secure untrusted data repository (SUNDR). In Proceedings of the 6th Symposium on Operating Systems Design and Implementation, San Francisco, CA, 121--136. Google Scholar
Digital Library
- *Lu, C., Alvarez, G. A., and Wilkes, J. 2002. Aqueduct: Online data migration with performance guarantees. In Proceedings of the 1st USENIX Conference on File and Storage Technologies, Monterey, CA. Google Scholar
Digital Library
- *Lumb, C. R., Merchant, A., and Alvarez, G. A. 2003. Façade: Virtual storage devices with performance guarantees. In Proceedings of the 2nd USENIX Conference on File and Storage Technologies, San Francisco, CA, 131--144. Google Scholar
Digital Library
- *Lumb, C. R., Schindler, J., and Ganger, G. R. 2002. Freeblock scheduling outside of disk firmware. In Proceedings of the 1st USENIX Conference on File and Storage Technologies, Monterey, CA, 275--288. Google Scholar
Digital Library
- *MacCormick, J., Murphy, N., Najork, M., Thekkath, C., and Zhou, L. 2004. Boxwood: Abstractions as the foundation for storage infrastructure. In Proceedings of the 6th Symposium on Operating Systems Design and Implementation, San Francisco, CA, 105--120. Google Scholar
Digital Library
- *Magoutis, K., Addetia, S., Fedorova, A., and Seltzer, M. I. 2003. Making the most out of direct-access network attached storage. In Proceedings of the 2nd USENIX Conference on File and Storage Technologies, San Francisco, CA, 189--202. Google Scholar
Digital Library
- *Magoutis, K., Addetia, S., Fedorova, A., Seltzer, M. I., Chase, J. S., Gallatin, A. J., Kisley, R., Wickremesinghe, R. G., and Gabber, E. 2002. Structure and performance of the direct access file system. In Proceedings of the Annual USENIX Technical Conference, Monterey, CA. Google Scholar
Digital Library
- *Maziéres, D. 2001. A toolkit for user-level file systems. In Proceedings of the Annual USENIX Technical Conference, Boston, MA, 261--274. Google Scholar
Digital Library
- *Mazières, D., Kaminsky, M., Kaashoek, M. F., and Witchel, E. 1999. Separating key management from file system security. In Proceedings of the 17th ACM Symposium on Operating Systems Principles, Charleston, SC, 124--139. Google Scholar
Digital Library
- McDougall, R. and Mauro, J. 2005. FileBench. www.solarisinternals.com/si/tools/filebench/. Google Scholar
Digital Library
- *Memik, G., Kandemir, M., and Choudhary, A. 2002. Exploiting inter-file access patterns using multi-collective I/O. In Proceedings of the 1st USENIX Conference on File and Storage Technologies, Monterey, CA. Google Scholar
Digital Library
- Mesnier, M. P., Wachs, M., Sambasivan, R. R., Lopez, J., Hendricks, J., Ganger, G. R., and O'Hallaron, D. 2007. //TRACE: Parallel trace replay with approximate causal events. In Proceedings of the 5th USENIX Conference on File and Storage Technologies, San Jose, CA, 153--167. Google Scholar
Digital Library
- *Miller, E., Freeman, W., Long, D., and Reed, B. 2002. Strong security for network-attached storage. In Proceedings of the 1st USENIX Conference on File and Storage Technologies, Monterey, CA, 1--13. Google Scholar
Digital Library
- Mogul, J. 1999. Brittle metrics in operating systems research. In Proceedings of the IEEE Workshop on Hot Topics in Operating Systems (HOTOS), Rio Rica, AZ, 90--95. Google Scholar
Digital Library
- Mummert, L. and Satyanarayanan, M. 1994. Long term distributed file reference tracing: Implementation and experience. Tech. Rep. CMU-CS-94-213, Carnegie Mellon University, Pittsburgh, Pennsylvania. Google Scholar
Digital Library
- *Muniswamy-Reddy, K., Holland, D. A., Braun, U., and Seltzer, M. 2006. Provenance-Aware storage systems. In Proceedings of the Annual USENIX Technical Conference, Boston, MA, 43--56. Google Scholar
Digital Library
- *Muniswamy-Reddy, K., Wright, C. P., Himmer, A., and Zadok, E. 2004. A versatile and user-oriented versioning file system. In Proceedings of the 3rd USENIX Conference on File and Storage Technologies, San Francisco, CA, 115--128. Google Scholar
Digital Library
- *Muthitacharoen, A., Chen, B., and Mazieres, D. 2001. A low-bandwidth network file system. In Proceedings of the 18th ACM Symposium on Operating Systems Principles, Banff, Canada. Google Scholar
Digital Library
- *Muthitacharoen, A., Morris, R., Gil, T. M., and Che, B. 2002. Ivy: A read/write peer-to-peer file system. In Proceedings of the 5th Symposium on Operating System Design and Implementation, Boston, MA, 31--44. Google Scholar
Digital Library
- *Ng, W. T., Sun, H., Hillyer, B., Shriver, E., Gabber, E., and Ozden, B. 2002. Obtaining high performance for storage outsourcing. In Proceedings of the 1st USENIX Conference on File and Storage Technologies, Monterey, CA, 145--158. Google Scholar
Digital Library
- *Nightingale, E. B., Chen, P., and Flinn, J. 2005. Speculative execution in a distributed file system. In Proceedings of the 20th ACM Symposium on Operating Systems Principles, Brighton, UK, 191--205. Google Scholar
Digital Library
- *Nightingale, E. B. and Flinn, J. 2004. Energy-Efficiency and storage flexibility in the Blue file system. In Proceedings of the 6th Symposium on Operating Systems Design and Implementation, San Francisco, CA, 363--378. Google Scholar
Digital Library
- *Nightingale, E. B., Veeraraghavan, K., Chen, P. M., and Flinn, J. 2006. Rethink the sync. In Proceedings of the 7th Symposium on Operating Systems Design and Implementation, Seattle, WA, 1--14. Google Scholar
Digital Library
- *Nugent, J., Arpaci-Dusseau, A., and Arpaci-Dusseau, R. 2003. Controlling your PLACE in the file system with gray-box techniques. In Proceedings of the Annual USENIX Technical Conference, San Antonio, TX, 311--323.Google Scholar
- OSDL. 2004. Iometer project. www.iometer.org/.Google Scholar
- OSDL. 2007. Database test suite. www.osdl.org/lab_activities/kernel_testing/osdl_database_test_suite/.Google Scholar
- Ousterhout, J. 1990. Why aren't operating systems getting faster as fast as hardware? In Proceedings of the Summer USENIX Technical Conference, Anaheim, CA, 247--256.Google Scholar
- Ousterhout, J., Costa, H., Harrison, D., Kunze, J., Kupfer, M., and Thompson, J. 1985. A trace-driven analysis of the UNIX 4.2 BSD file system. In Proceedings of the 10th ACM Symposium on Operating System Principles, Orcas Island, WA, 15--24. Google Scholar
Digital Library
- *Padioleau, Y. and Ridoux, O. 2003. A logic file system. In Proceedings of the Annual USENIX Technical Conference, San Antonio, TX, 99--112.Google Scholar
- *Papathanasiou, A. E. and Scott, M. L. 2004. Energy efficient prefetching and caching. In Proceedings of the Annual USENIX Technical Conference, Boston, MA, 255--268. Google Scholar
Digital Library
- *Patterson, H., Manley, S., Federwisch, M., Hitz, D., Kleinman, S., and Owara, S. 2002. SnapMirror: File system based asynchronous mirroring for disaster recovery. In Proceedings of the 1st USENIX Conference on File and Storage Technologies, Monterey, CA, 117--129. Google Scholar
Digital Library
- *Peek, D. and Flinn, J. 2006. EnsemBlue: Integrating distributed storage and consumer electronics. In Proceedings of the 7th Symposium on Operating Systems Design and Implementation, Seattle, WA, 219--232. Google Scholar
Digital Library
- PEL. 2001. BYU trace distribution center. http://tds.cs.byu.edu/tds.Google Scholar
- *Peterson, Z. N. J., Burns, R., Ateniese, G., and Bono, S. 2007. Design and implementation of verifiable audit trails for a versioning file system. In Proceedings of the 5th USENIX Conference on File and Storage Technologies, San Jose, CA, 93--106. Google Scholar
Digital Library
- *Peterson, Z. N. J., Burns, R., J. Herring, A. S., and Rubin, A. D. 2005. Secure deletion for a versioning file system. In Proceedings of the 4th USENIX Conference on File and Storage Technologies, San Francisco, CA, 143--154. Google Scholar
Digital Library
- *Prabhakaran, V., Agrawal, N., Bairavasundaram, L. N., Gunawi, H. S., Arpaci-Dusseau, A. C., and Arpaci-Dusseau, R. H. 2005a. IRON file systems. In Proceedings of the 20th ACM Symposium on Operating Systems Principles, Brighton, UK, 206--220. Google Scholar
Digital Library
- *Prabhakaran, V., Arpaci-Dusseau, A. C., and Arpaci-Dussea, R. H. 2005b. Analysis and evolution of journaling file systems. In Proceedings of the Annual USENIX Technical Conference, Anaheim, CA, 105--120. Google Scholar
Digital Library
- *Quinlan, S. and Dorward, S. 2002. Venti: A new approach to archival storage. In Proceedings of the 1st USENIX Conference on File and Storage Technologies, Monterey, CA, 89--101. Google Scholar
Digital Library
- *Radkov, P., Yin, L., Goyal, P., Sarkar, P., and Shenoy, P. 2004. A performance comparison of NFS and iSCSI for IP-networked storage. In Proceedings of the 3rd USENIX Conference on File and Storage Technologies, San Francisco, CA, 101--114. Google Scholar
Digital Library
- *Rhea, S., Eaton, P., Geels, D., Weatherspoon, H., Zhao, B., and Kubiatowicz, J. 2003. Pond: The OceanStore prototype. In Proceedings of the 2nd USENIX Conference on File and Storage Technologies, San Francisco, CA, 1--14. Google Scholar
Digital Library
- Robinson, D. 1999. The advancement of NFS benchmarking: SFS 2.0. In Proceedings of the 13th USENIX Systems Administration Conference, Seattle, WA, 175--185. Google Scholar
Digital Library
- Roselli, D., Lorch, J. R., and Anderson, T. E. 2000. A comparison of file system workloads. In Proceedings of the Annual USENIX Technical Conference, San Diego, CA, 41--54. Google Scholar
Digital Library
- Rosenblum, M. 1992. The design and implementation of a log-structured file system. Ph.D. thesis, Electrical Engineering and Computer Sciences, Computer Science Division, University of California. Google Scholar
Digital Library
- *Rowstron, A. and Druschel, P. 2001. Storage management and caching in PAST, a large-scale, persistent peer-to-peer storage utility. In Proceedings of the 18th ACM Symposium on Operating Systems Principles, Banff, Canada. Google Scholar
Digital Library
- Ruemmler, C. and Wilkes, J. 1993. UNIX disk access patterns. In Proceedings of the Winter USENIX Technical Conference, San Diego, CA, 405--420.Google Scholar
- Ruwart, T. M. 2001. File system performance benchmarks, then, now, and tomorrow. In Proceedings of the 14th IEEE Symposium on Mass Storage Systems, San Diego, CA. Google Scholar
Digital Library
- *Saito, Y., Karamanolis, C., Karlsson, M., and Mahalingam, M. 2002. Taming aggressive replication in the Pangaea wide-area file system. In Proceedings of the 5th Symposium on Operating System Design and Implementation, Boston, MA, 15--30. Google Scholar
Digital Library
- Sandberg, R., Goldberg, D., Kleiman, S., Walsh, D., and Lyon, B. 1985. Design and implementation of the Sun network filesystem. In Proceedings of the Summer USENIX Technical Conference, Portland, Oregon, 119--130.Google Scholar
- *Santry, D. S., Feeley, M. J., Hutchinson, N. C., Veitch, A. C., Carton, R. W., and Ofir, J. 1999. Deciding when to forget in the Elephant file system. In Proceedings of the 17th ACM Symposium on Operating Systems Principles, Charleston, SC, 110--123. Google Scholar
Digital Library
- *Sarkar, P., Uttamchandani, S., and Voruganti, K. 2003. Storage over IP: When does hardware support help? In Proceedings of the 2nd USENIX Conference on File and Storage Technologies, San Francisco, CA, 231--244. Google Scholar
Digital Library
- *Schindler, J., Griffin, J. L., Lumb, C. R., and Ganger, G. R. 2002. Track-Aligned extents: Matching access patterns to disk drive characteristics. In Proceedings of the 1st USENIX Conference on File and Storage Technologies, Monterey, CA, 259--274. Google Scholar
Digital Library
- *Schindler, J., Schlosser, S. W., Shao, M., and Ailamaki, A. 2004. Atropos: A disk array volume manager for orchestrated use of disks. In Proceedings of the 3rd USENIX Conference on File and Storage Technologies, San Francisco, CA, 159--172. Google Scholar
Digital Library
- *Schlosser, S. W., Schindler, J., Papadomanolakis, S., Shao, M., Ailamaki, A., Faloutsos, C., and Ganger, G. R. 2005. On multidimensional data and modern disks. In Proceedings of the 4th USENIX Conference on File and Storage Technologies, San Francisco, CA, 225--238. Google Scholar
Digital Library
- Schmidt, A., Waas, F., Kersten, M., Florescu, D., Carey, M. J., Manolescu, I., and Busse, R. 2001. Why and how to benchmark XML databases. ACM SIGMOD Rec. 30, 3 (Sept.), 27--32. Google Scholar
Digital Library
- *Schmuck, F. and Haskin, R. 2002. GPFS: A shared-disk file system for large computing clusters. In Proceedings of the 1st USENIX Conference on File and Storage Technologies, Monterey, CA, 231--244. Google Scholar
Digital Library
- *Seltzer, M. I., Ganger, G. R., McKusick, M. K., Smith, K. A., Soules, C. A. N., and Stein, C. A. 2000. Journaling versus soft updates: Asynchronous meta-data protection in file systems. In Proceedings of the Annual USENIX Technical Conference, San Diego, CA, 71--84. Google Scholar
Digital Library
- Seltzer, M. I., Krinsky, D., Smith, K. A., and Zhang, X. 1999. The case for application-specific benchmarking. In Proceedings of the IEEE Workshop on Hot Topics in Operating Systems (HOTOS), Rio Rica, AZ, 102--107. Google Scholar
Digital Library
- Shein, B., Callahan, M., and Woodbury, P. 1989. NFSSTONE: A network file server performance benchmark. In Proceedings of the Summer USENIX Technical Conference, Baltimore, MD, 269--275.Google Scholar
- Shepler, S. 2005. NFS version 4. In Proceedings of the Annual USENIX Technical Conference, Anaheim, CA. http://mediacast.sun.com/share/shepler/20050414_usenix_ext.pdf.Google Scholar
- *Shrira, L. and Xu, H. 2006. Thresher: An efficient storage manager for copy-on-write snapshots. In Proceedings of the Annual USENIX Technical Conference, Boston, MA, 57--70. Google Scholar
Digital Library
- *Sivathanu, G., Sundararaman, S., and Zadok, E. 2006. Type-Safe disks. In Proceedings of the 7th Symposium on Operating Systems Design and Implementation, Seattle, WA, 15--28. Google Scholar
Digital Library
- *Sivathanu, M., Bairavasundaram, L. N., Arpaci-Dusseau, A. C., and Arpaci-Dusseau, R. H. 2004a. Life or death at block-level. In Proceedings of the 6th Symposium on Operating Systems Design and Implementation, San Francisco, CA, 379--394. Google Scholar
Digital Library
- *Sivathanu, M., Bairavasundaram, L. N., Arpaci-Dusseau, A. C., and Arpaci-Dusseau, R. H. 2005. Database-Aware semantically-smart storage. In Proceedings of the 4th USENIX Conference on File and Storage Technologies, San Francisco, CA, 239--252. Google Scholar
Digital Library
- *Sivathanu, M., Prabhakaran, V., Arpaci-Dusseau, A. C., and Arpaci-Dusseau, R. H. 2004b. Improving storage system availability with D-GRAID. In Proceedings of the 3rd USENIX Conference on File and Storage Technologies, San Francisco, CA, 15--30. Google Scholar
Digital Library
- *Sivathanu, M., Prabhakaran, V., Popovici, F. I., Denehy, T. E., Arpaci-Dusseau, A. C., and Arpaci-Dusseau, R. H. 2003. Semantically-Smart disk systems. In Proceedings of the 2nd USENIX Conference on File and Storage Technologies, San Francisco, CA, 73--88. Google Scholar
Digital Library
- Small, C., Ghosh, N., Saleeb, H., Seltzer, M., and Smith, K. 1997. Does systems research measure up? Tech. Rep. TR-16-97, Harvard University. November.Google Scholar
- Smith, K. A. and Seltzer, M. I. 1997. File system aging—Increasing the relevance of file system benchmarks. In Proceedings of the ACM SIGMETRICS International Conference on Measurement and Modeling of Computer Systems, Seattle, WA, 203--213. Google Scholar
Digital Library
- SNIA. 2007. SNIA—Storage network industry association: IOTTA repository. http://iotta.snia.org.Google Scholar
- *Sobti, S., Garg, N., Zhang, C., Yu, X., Krishnamurthy, A., and Wang, R. 2002. PersonalRAID: Mobile storage for distributed and disconnected computers. In Proceedings of the 1st USENIX Conference on File and Storage Technologies, Monterey, CA, 159--174. Google Scholar
Digital Library
- *Soules, C. A. N., Goodson, G. R., Strunk, J. D., and Ganger, G. R. 2003. Metadata efficiency in versioning file systems. In Proceedings of the 2nd USENIX Conference on File and Storage Technologies, San Francisco, CA, 43--58. Google Scholar
Digital Library
- Spadavecchia, J. and Zadok, E. 2002. Enhancing NFS cross-administrative domain access. In Proceedings of the Annual USENIX Technical Conference, FREENIX Track, Monterey, CA, 181--194. Google Scholar
Digital Library
- SPC. 2007. Storage performance council. www.storageperformance.org.Google Scholar
- SPEC. 2001. SPEC SFS97_R1 V3.0. www.spec.org/sfs97r1.Google Scholar
- SPEC. 2003. SPEC SMT97. www.spec.org/osg/smt97/.Google Scholar
- SPEC. 2004. SPEC SDM Suite. www.spec.org/osg/sdm91/.Google Scholar
- SPEC. 2005a. The SPEC organization. www.spec.org/.Google Scholar
- SPEC. 2005b. SPECweb99. www.spec.org/web99.Google Scholar
- SPEC. 2007. SPECviewperf 9. www.spec.org/gpc/opc.static/vp9info.html.Google Scholar
- *Stein, C. A., Howard, J. H., and Seltzer, M. I. 2001. Unifying file system protection. In Proceedings of the Annual USENIX Technical Conference, Boston, MA, 79--90. Google Scholar
Digital Library
- *Strunk, J. D., Goodson, G. R., Scheinholtz, M. L., Soules, C. A. N., and Ganger, G. R. 2000. Self-Securing storage: Protecting data in compromised systems. In Proceedings of the 4th USENIX Symposium on Operating System Design and Implementation, San Diego, CA, 165--180. Google Scholar
Digital Library
- Swartz, K. L. 1996. The brave little toaster meets Usenet. In Proceedings of the 10th USENIX System Administration Conference (LISA), Chicago, IL, 161--170. Google Scholar
Digital Library
- *Tan, Y., Wong, T., Strunk, J. D., and Ganger, G. R. 2005. Comparison-Based file server verification. In Proceedings of the Annual USENIX Technical Conference, Anaheim, CA, 121--133. Google Scholar
Digital Library
- Tang, D. 1995. Benchmarking filesystems. Tech. Rep. TR-19-95, Harvard University.Google Scholar
- Tang, D. and Seltzer, M. 1994. Lies, damned lies, and file system benchmarks. Tech. Rep. TR-34-94, Harvard University. December. In VINO: The 1994 Fall Harvest.Google Scholar
- *Thereska, E., Schindler, J., Bucy, J., Salmon, B., Lumb, C. R., and Ganger, G. R. 2004. A framework for building unobtrusive disk maintenance applications. In Proceedings of the 3rd USENIX Conference on File and Storage Technologies, San Francisco, CA, 213--226. Google Scholar
Digital Library
- *Tian, L., Feng, D., Jiang, H., Zhou, K., Zeng, L., Chen, J., Wang, Z., and Song, Z. 2007. PRO: A popularity-based multi-threaded reconstruction optimization for RAID-structured storage systems. In Proceedings of the 5th USENIX Conference on File and Storage Technologies, San Jose, CA, 277--290. Google Scholar
Digital Library
- *Tolia, N., Harkes, J., Kozuch, M., and Satyanarayanan, M. 2004. Integrating portable and distributed storage. In Proceedings of the 3rd USENIX Conference on File and Storage Technologies, San Francisco, CA, 227--238. Google Scholar
Digital Library
- *Tolia, N., Kozuch, M., Satyanarayanan, M., Karp, B., Bressoud, T., and Perrig, A. 2003. Opportunistic use of content addressable storage for distributed file systems. In Proceedings of the Annual USENIX Technical Conference, San Antonio, TX, 127--140.Google Scholar
- TPC. 2005. Transaction processing performance council. www.tpc.org.Google Scholar
- Tridgell, A. 1999. Dbench-3.03 README. http://samba.org/ftp/tridge/dbench/README.Google Scholar
- Van Meter, R. 1997. Observing the effects of multi-zone disks. In Proceedings of the Annual USENIX Technical Conference, Anaheim, CA, 19--30. Google Scholar
Digital Library
- *Van Meter, R. and Gao, M. 2000. Latency management in storage systems. In Proceedings of the 4th USENIX Symposium on Operating System Design and Implementation, San Diego, CA, 103--118. Google Scholar
Digital Library
- *Veeraraghavan, K., Myrick, A., and Flinn, J. 2007. Cobalt: Separating content distribution from authorization in distributed file systems. In Proceedings of the 5th USENIX Conference on File and Storage Technologies, San Jose, CA, 231--244. Google Scholar
Digital Library
- VERITAS Software. 1999. VERITAS file server edition performance brief: A PostMark 1.11 benchmark comparison. Tech. Rep., Veritas Software Corporation. June. http://eval.veritas.com/webfiles/docs/fsedition-postmark.pdf.Google Scholar
- VeriTest. 2002. NetBench. www.veritest.com/benchmarks/netbench/.Google Scholar
- *Vilayannur, M., Nath, P., and Sivasubramaniam, A. 2005. Providing tunable consistency for a parallel file store. In Proceedings of the 4th USENIX Conference on File and Storage Technologies, San Francisco, CA, 17--30. Google Scholar
Digital Library
- Vogels, W. 1999. File system usage in Windows NT 4.0. In Proceedings of the 17th ACM Symposium on Operating Systems Principles, Charleston, SC, 93--109. Google Scholar
Digital Library
- *Wachs, M., Abd-El-Malek, M., Thereska, E., and Ganger, G. R. 2007. Argon: Performance insulation for shared storage servers. In Proceedings of the 5th USENIX Conference on File and Storage Technologies, San Jose, CA, 61--76. Google Scholar
Digital Library
- *Wang, A. A., Reiher, P., Popek, G. J., and Kuenning, G. H. 2002. Conquest: Better performance through a disk/persistent-RAM hybrid file system. In Proceedings of the Annual USENIX Technical Conference, Monterey, CA, 15--28. Google Scholar
Digital Library
- *Wang, R. Y., Anderson, T. E., and Patterson, D. A. 1999. Virtual log based file systems for a programmable disk. In Proceedings of the 3rd Symposium on Operating Systems Design and Implementation, New Orleans, LA, 29--44. Google Scholar
Digital Library
- *Wang, Y. and Merchant, A. 2007. Proportional-Share scheduling for distributed storage systems. In Proceedings of the 5th USENIX Conference on File and Storage Technologies, San Jose, CA, 47--60. Google Scholar
Digital Library
- Watson, A. and Nelson, B. 1992. LADDIS: A multi-vendor and vendor-neutral SPEC NFS benchmark. In Proceedings of the 6th USENIX Systems Administration Conference (LISA VI), Long Beach, CA, 17--32.Google Scholar
- *Weddle, C., Oldham, M., Qian, J., Wang, A. A., Reiher, P., and Kuenning, G. 2007. PARAID: A gear-shifting power-aware RAID. In Proceedings of the 5th USENIX Conference on File and Storage Technologies, San Jose, CA, 245--260. Google Scholar
Digital Library
- *Weil, S., Brandt, S., Miller, E., Long, D., and Maltzahn, C. 2006. Ceph: A scalable, high-performance distributed file system. In Proceedings of the 7th Symposium on Operating Systems Design and Implementation, Seattle, WA, 307--320. Google Scholar
Digital Library
- Wittle, M. and Keith, B. E. 1993. LADDIS: The next generation in NFS file server benchmarking. In Proceedings of the Summer USENIX Technical Conference, Cincinnati, OH, 111--128. Google Scholar
Digital Library
- Wright, C. P., Dave, J., and Zadok, E. 2003a. Cryptographic file systems performance: What you don't know can hurt you. In Proceedings of the 2nd IEEE International Security In Storage Workshop. IEEE Computer Society, Washington, DC, 47--61. Google Scholar
Digital Library
- Wright, C. P., Joukov, N., Kulkarni, D., Miretskiy, Y., and Zadok, E. 2005. Auto-Pilot: A platform for system software benchmarking. In Proceedings of the Annual USENIX Technical Conference, FREENIX Track, Anaheim, CA, 175--187. Google Scholar
Digital Library
- *Wright, C. P., Martino, M., and Zadok, E. 2003b. NCryptfs: A secure and convenient cryptographic file system. In Proceedings of the Annual USENIX Technical Conference, San Antonio, TX, 197--210.Google Scholar
- *Yu, X., Gum, B., Chen, Y., Wang, R. Y., Li, K., Krishnamurthy, A., and Anderson, T. E. 2000. Trading capacity for performance. In Proceedings of the 4th USENIX Symposium on Operating System Design and Implementation, San Diego, CA, 243--258. Google Scholar
Digital Library
- *Yumerefendi, A. R. and Chase, J. S. 2007. Strong accountability for network storage. In Proceedings of the 5th USENIX Conference on File and Storage Technologies, San Jose, CA, 77--92. Google Scholar
Digital Library
- *Zadok, E. 2002. Overhauling Amd for the '00s: A case study of GNU autotools. In Proceedings of the Annual USENIX Technical Conference, FREENIX Track, Monterey, CA, 287--297. Google Scholar
Digital Library
- *Zadok, E., Anderson, J. M., Bădulescu, I., and Nieh, J. 2001. Fast indexing: Support for size-changing algorithms in stackable file systems. In Proceedings of the Annual USENIX Technical Conference, Boston, MA, 289--304. Google Scholar
Digital Library
- *Zadok, E., Bădulescu, I., and Shender, A. 1999. Extending file systems using stackable templates. In Proceedings of the Annual USENIX Technical Conference, Monterey, CA, 57--70. Google Scholar
Digital Library
- *Zadok, E. and Nieh, J. 2000. FiST: A language for stackable file systems. In Proceedings of the Annual USENIX Technical Conference, San Diego, CA, 55--70. Google Scholar
Digital Library
- *Zhang, C., Yu, X., Krishnamurthy, A., and Wang, R. Y. 2002. Configuring and scheduling an eager-writing disk array for a transaction processing workload. In Proceedings of the 1st USENIX Conference on File and Storage Technologies, Monterey, CA, 289--304. Google Scholar
Digital Library
- *Zhang, Z. and Ghose, K. 2003. yFS: A journaling file system design for handling large data sets with reduced seeking. In Proceedings of the 2nd USENIX Conference on File and Storage Technologies, San Francisco, CA, 59--72. Google Scholar
Digital Library
- *Zhou, Y., Philbin, J., and Li, K. 2001. The multi-queue replacement algorithm for second level buffer caches. In Proceedings of the Annual USENIX Technical Conference, Boston, MA, 91--104. Google Scholar
Digital Library
- *Zhu, N., Chen, J., and Chiueh, T. 2005a. TBBT: Scalable and accurate trace replay for file server evaluation. In Proceedings of the 4th USENIX Conference on File and Storage Technologies, San Francisco, CA, 323--336. Google Scholar
Digital Library
- *Zhu, Q., Chen, Z., Tan, L., Zhou, Y., Keeton, K., and Wilkes, J. 2005b. Hibernator: Helping disk arrays sleep through the winter. In Proceedings of the 20th ACM Symposium on Operating Systems Principles, Brighton, UK, 177--190. Google Scholar
Digital Library
Index Terms
A nine year study of file system and storage benchmarking
Recommendations
A five-year study of file-system metadata
For five years, we collected annual snapshots of file-system metadata from over 60,000 Windows PC file systems in a large corporation. In this article, we use these snapshots to study temporal changes in file size, file age, file-type frequency, ...
A Novel Reordering Write Buffer to Improve Write Performance of Log-Structured File Systems
Abstract--This paper presents a novel reordering write buffer which improves the performance of Log-structured File Systems (LFS). While LFS has a good write performance, high garbage-collection overhead degrades its performance under high disk space ...
A multiple-file write scheme for improving write performance of small files in Fast File System
Fast File System (FFS) stores files to disk in separate disk writes, each of which incurs a disk positioning (seek + rotation) limiting the write performance for small files. We propose a new scheme called co-writing to accelerate small file writes in ...






Comments