Abstract
We consider the problem of recovering from the failures of distributable threads (“threads”) in distributed real-time systems that operate under runtime uncertainties including those on thread execution times, thread arrivals, and node failure occurrences. When a thread experiences a node failure, the result is a broken thread having an orphan. Under a termination model, the orphans must be detected and aborted, and exceptions must be delivered to the farthest, contiguous surviving thread segment for resuming thread execution. Our application/scheduling model includes the proposed distributable thread programming model for the emerging Distributed Real-Time Specification for Java (DRTSJ), together with an exception-handler model. Threads are subject to time/utility function (TUF) time constraints and an utility accrual (UA) optimality criterion. A key underpinning of the TUF/UA scheduling paradigm is the notion of “best-effort” where higher importance threads are always favored over lower importance ones, irrespective of thread urgency as specified by their time constraints. We present a thread scheduling algorithm called HUA and a thread integrity protocol called TPR. We show that HUA and TPR bound the orphan cleanup and recovery time with bounded loss of the best-effort property. Our implementation experience for HUA/TPR in the Reference Implementation of the proposed programming model for the DRTSJ demonstrates the algorithm/protocol's effectiveness.
- Aguilera, M. K., Lann, G. L., and Toueg, S. 2002. On the impact of fast failure detectors on real-time fault-tolerant systems. In Proceedings of the 16th International Conference on Distributed Computing (DISC'02), Springer, Berlin, 354--370. Google Scholar
Digital Library
- Anderson, J. S. and Jensen, E. D. 2006. Distributed real-time specification for Java: A status report (digest). In Proceedings of the 4th International Workshop on Java Technologies for Real-Time and Embedded Systems (JTRES'06), ACM, New York, 3--9. Google Scholar
Digital Library
- Cares, J. R. 2006. Distributed Networked Operations: The Foundations of Network Centric Warfare. iUniverse, Inc., Lincoln, NE.Google Scholar
- Clark, R., Jensen, E. D., Kanevsky, A., Maurer, J. A., Wallace, P., Wheeler, T., Zhang, Y., Wells, D., Lawrence, T., and Hurley, P. 1999. An adaptive, distributed airborne tracking system (“process the right tracks at the right time”). In Proceedings of the 11IPPS/SPDP'99 Workshops. In conjunction with the 13th International Parallel Processing Symposium and 10th Symposium on Parallel and Distributed Processing. Springer, Berlin, 353--362. Google Scholar
Digital Library
- Clark, R. K. 1990. Scheduling dependent real-time activities. Ph.D dissertation. CMU-CS-90-155, Carnegie Mellon University. Google Scholar
Digital Library
- Curley, E., Anderson, J., Ravindran, B., and Jensen, E. D. 2006. Recovering from distributable thread failures with assured timeliness in real-time distributed systems. In Proceedings of the 25th IEEE Symposium on Reliable Distributed Systems (SRDS'06). IEEE, Los Alamitos, CA, 267--276. Google Scholar
Digital Library
- de Oliveira, R. S. and da Silva Fraga, J. 2000. Fixed priority scheduling of tasks with arbitrary precedence constraints in distributed hard real-time systems. J. Syst. Architecture 49, 11, 991--1004.Google Scholar
Cross Ref
- Ford, B. and Lepreau, J. 1994. Evolving Mach 3.0 to a migrating thread model. In Proceedings of the USENIX Winter Technical Conference (WTEC'94). USENIX Association, Berkeley, CA, 9. Google Scholar
Digital Library
- Goldberg, J., Greenberg, I., et al. 1995. Adaptive fault-resistant systems (ch. 5. Adpative distributed thread integrity). Tech. rep. csl-95-02, SRI International. http://www.csl.sri.com/papers/sri-csl-95-02/.Google Scholar
- Harbour, M. G. and Palencia, J. C. 2003. Response time analysis for tasks scheduled under EDF within fixed priorities. In Proceedings of the 24th IEEE International Real-Time Systems Symposium (RTSS'03). IEEE, Los Alamitos, CA, 200. Google Scholar
Digital Library
- Hermant, J.-F. and Le Lann, G. 2002. Fast asynchronous uniform consensus in real-time distributed systems. IEEE Trans. Comput. 51, 8, 931--944. Google Scholar
Digital Library
- Hermant, J.-F. and Widder, J. 2005. Implementing reliable distributed real-time systems with the Theta-model. In Proceedings of the 9th International Conference on Principles of Distributed Systems (OPODIS'05). Lecture Notes in Computer Science, vol. 3974, Springer, Berlin, 334--350. Google Scholar
Digital Library
- Horn, W. 1974. Some simple scheduling algorithms. Naval Res. Logistics Q. 21, 177--185.Google Scholar
Cross Ref
- Jensen, E. D., Locke, C. D., and Tokuda, H. 1985. A time-driven scheduling model for realtime systems. In Proceedings of the IEEE Real-Time Systems Symposium (RTSS'85). IEEE, Los Alamitos, CA, 112--122.Google Scholar
- Kao, B. and Garcia-Molina, H. 1997. Deadline assignment in a distributed soft real-time system. IEEE Trans. Paral. Distrib. Syst. 8, 12, 1268--1274. Google Scholar
Digital Library
- Li, P. 2004. Utility accrual real-time scheduling: Models and algorithms. Ph.D. dissertation, Virginia Tech., Blacksburg, VA. Google Scholar
Digital Library
- Li, P., Ravindran, B., et al. 2004. A formally verified application-level framework for real-time scheduling on POSIX real-time operating systems. IEEE Trans. Softw. Engin. 30, 9, 613--629. Google Scholar
Digital Library
- Locke, C. D. 1986. Best-effort decision making for real-time scheduling. Ph.D. dissertation. CMU-CS-86-134, Carnegie Mellon University. Google Scholar
Digital Library
- Maynard, D. P., Shipman, S. E., et al. 1988. An example real-time command, control, and battle management application for alpha. Tech. rep., Archons Project Tech. rep. 88121, Computer Science Dept., Carnegie Mellon University.Google Scholar
- Mills, D. L. 1995. Improved algorithms for synchronizing computer network clocks. IEEE/ACM Trans. Netw. 3, 245--254. Google Scholar
Digital Library
- Nagy, S. and Bestavros, A. 1997. Admission control for soft-transactions in ACCORD. In Proceedings of the 3rd IEEE Real-Time Technology and Applications Symposium (RTAS'97). IEEE, Los Alamitos, CA, 160. Google Scholar
Digital Library
- Northcutt, J. D. 1987. Mechanisms for Reliable Distributed Real-Time Operating Systems: The Alpha Kernel. Academic Press, San Diego, CA. Google Scholar
Digital Library
- OMG. 2001. Real-Time CORBA 2.0: Dynamic scheduling specification. Tech. rep., Object Management Group.Google Scholar
- Palencia, J. C. and Harbour, M. G. 1998. Schedulability analysis for tasks with static and dynamic offsets. In Proceedings of the IEEE Real-Time Systems Symposium (RTSS'98). IEEE, Los Alamitos, CA, 26. Google Scholar
Digital Library
- Palencia, J. C. and Harbour, M. G. 2003. Offset-based response time analysis of distributed systems scheduled under EDF. In Proceedings of the15th IEEE Euromicro Conference on Real-Time Systems (ECRTS'03). IEEE, Los Alamitos, CA, 3--12.Google Scholar
- Pellizzoni, R. and Lipari, G. 2005. Improved schedulability analysis of real-time transactions with earliest deadline scheduling. In Proceedings of the 11th IEEE Real Time on Embedded Technology and Applications Symposium (RTAS'05). IEEE, Los Alamitos, CA, 66--75. Google Scholar
Digital Library
- Ravindran, B., Anderson, J. S., and Jensen, E. D. 2007. On distributed real-time scheduling in networked embedded systems in the presence of crash failures. In Proceedings of the 5th IFIP WG 10.2 International Workshop on Software Technologies for Embedded and Ubiquitous Systems (SEUS'07). Lecture Notes in Computer Science, vol. 4761, Springer, Berlin, 67--81. Google Scholar
Digital Library
- Ravindran, B., Jensen, E. D., and Li, P. 2005. On recent advances in time/utility function real-time scheduling and resource management. In Proceedings of the 8th IEEE International Symposium on Object-Oriented Real-Time Distributed Computing (ISORC'05). IEEE, Los Alamitos, CA, 55--60. Google Scholar
Digital Library
- Sha, L., Rajkumar, R., and Lehoczky, J. P. 1990. Priority inheritance protocols: An approach to real-time synchronization. IEEE Trans. Comput. 39, 9, 1175--1185. Google Scholar
Digital Library
- Spuri, M. 1996. Holistic analysis of deadline scheduled real-time distributed systems. Tech. rep. RR-2873, INRIA.Google Scholar
- Streich, H. 1995. Taskpair-scheduling: An approach for dynamic real-time systems. Mini and Microcomput. 17, 2, 77--83.Google Scholar
- Sun, J. 1997. Fixed priority scheduling of end-to-end periodic tasks. Ph.D. dissertation, Computer Science Department, University of Illinois, Urbana-Champaign.Google Scholar
- The Open Group. 1998. MK7.3a release notes. The Open Group Research Institute, Cambridge, MA.Google Scholar
- Tindell, K. and Clark, J. 1994. Holistic schedulability analysis for distributed hard real-time systems. Microprocess. Microprogram. 40, 2-3, 117--134. Google Scholar
Digital Library
Index Terms
Recovering from distributable thread failures in distributed real-time Java
Recommendations
Distributed real-time specification for Java: a status report (digest)
JTRES '06: Proceedings of the 4th international workshop on Java technologies for real-time and embedded systemsThe Distributed Real-Time Specification for Java (DRTSJ) is under development within Sun's Java Community Process (JCP) as Java Specification Request 50 (JSR-50), lead by the MITRE Corporation. We present the engineering considerations and design ...
Recovering from Distributable Thread Failures with Assured Timeliness in Real-Time Distributed Systems
SRDS '06: Proceedings of the 25th IEEE Symposium on Reliable Distributed SystemsWe consider the problem of recovering from failures of distributable threads with assured timeliness. When a node hosting a portion of a distributable thread fails, it causes orphans-i.e., thread segments that are disconnected from the thread's root. We ...
Implementing distributable real-time threads in the Linux kernel: programming interface and scheduling support
SAC '12: Proceedings of the 27th Annual ACM Symposium on Applied ComputingWe present an implementation of Real-Time CORBA's distributable threads (DTs) as a first-class, end-to-end realtime programming and scheduling abstraction in the Linux kernel. We use Ingo Molnar's PREEMPT_RT kernel patch, which enables nearly complete ...






Comments