Abstract
A long body of research work has led to the conjecture that highly efficient IO processing at user-level would necessarily violate protection. In this paper, we debunk this myth by introducing DLibOS a new paradigm that consists of distributing a library OS on specialized cores to achieve performance and protection at the user-level. Its main novelty consists of leveraging network-on-chip to allow hardware message passing, rather than context switches, for communication between different address spaces. To demonstrate the feasibility of our approach, we implement a driver and a network stack at user-level on a Tilera many-core machine. We define a novel asynchronous socket interface and partition the memory such that the reception, the transmission and the application modify isolated regions. Our high performance results of 4.2 and 3.1 million requests per second obtained on a webserver and the Memcached applications, respectively, confirms the relevance of our design decisions. Finally, we compare DLibOS against a non-protected user-level network stack and show that protection comes at a negligible cost.
- T. E. Anderson. 1992. The case for application-specific operating systems {1992} Proceedings Third Workshop on Workstation Operating Systems. 92--94.Google Scholar
- Berk Atikoglu, Yuehai Xu, Eitan Frachtenberg, Song Jiang, and Mike Paleczny. 2012. Workload Analysis of a Large-scale Key-value Store SIGMETRICS '12. ACM, 53--64. Google Scholar
Digital Library
- Luiz Barroso, Mike Marty, David Patterson, and Parthasarathy Ranganathan. 2017. Attack of the Killer Microseconds. Commun. ACM Vol. 60, 4 (March. 2017), 48--54. http://www.usenix.org/conference/atc16/technical-sessions/presentation/yasukataGoogle Scholar
Digital Library
- Christopher Zimmer and Frank Mueller. 2015. NoCMsg: A Scalable Message-Passing Abstraction for Network-on-Chips. ACM Trans. Archit. Code Optim. Vol. 12, 1 (March. 2015), 1:1--1:24. 1544--3566 Google Scholar
Digital Library
Index Terms
DLibOS: Performance and Protection with a Network-on-Chip
Recommendations
DLibOS: Performance and Protection with a Network-on-Chip
ASPLOS '18: Proceedings of the Twenty-Third International Conference on Architectural Support for Programming Languages and Operating SystemsA long body of research work has led to the conjecture that highly efficient IO processing at user-level would necessarily violate protection. In this paper, we debunk this myth by introducing DLibOS a new paradigm that consists of distributing a ...
The IX Operating System: Combining Low Latency, High Throughput, and Efficiency in a Protected Dataplane
The conventional wisdom is that aggressive networking requirements, such as high packet rates for small messages and μs-scale tail latency, are best addressed outside the kernel, in a user-level networking stack. We present ix, a dataplane operating ...
Energy proportionality and workload consolidation for latency-critical applications
SoCC '15: Proceedings of the Sixth ACM Symposium on Cloud ComputingEnergy proportionality and workload consolidation are important objectives towards increasing efficiency in large-scale datacenters. Our work focuses on achieving these goals in the presence of applications with μs-scale tail latency requirements. Such ...







Comments