skip to main content
research-article

Vector Extensions in COTS Processors to Increase Guaranteed Performance in Real-Time Systems

Published:24 January 2023Publication History
Skip Abstract Section

Abstract

The need for increased application performance in high-integrity systems such as those in avionics is on the rise as software continues to implement more complex functionalities. The prevalent computing solution for future high-integrity embedded products is multi-processor systems-on-chip (MPSoC) processors. MPSoCs include central processing unit (CPU) multicores that enable improving performance via thread-level parallelism. MPSoCs also include generic accelerators (graphics processing units [GPUs]) and application-specific accelerators. However, the data processing approach (DPA) required to exploit each of these underlying parallel hardware blocks carries several open challenges to enable the safe deployment in high-integrity domains. The main challenges include the qualification of its associated runtime system and the difficulties in analyzing programs deploying the DPA with out-of-the-box timing analysis and code coverage tools. In this work, we perform a thorough analysis of vector extensions (VExts) in current commercial off-the-shelf (COTS) processors for high-integrity systems. We show that VExts prevent many of the challenges arising with parallel programming models and GPUs. Unlike other DPAs, VExts require no runtime support, prevent design race conditions that might arise with parallel programming models, and have minimum impact on the software ecosystem, enabling the use of existing code coverage and timing analysis tools. We develop vectorized versions of neural network kernels and show that the NVIDIA Xavier VExts provide a reasonable increase in guaranteed application performance of up to 2.7x. Our analysis contends that VExts are the DPA approach with arguably the fastest path for adoption in high-integrity systems.

REFERENCES

  1. [1] Alcaide Sergi, Kosmidis Leonidas, Tabani Hamid, Hernández Carles, Abella Jaume, and Cazorla Francisco J.. 2018. Safety-related challenges and opportunities for GPUs in the automotive domain. IEEE Micro 38, 6 (2018), 4655. DOI:Google ScholarGoogle ScholarCross RefCross Ref
  2. [2] Amert Tanya, Otterness Nathan, Yang Ming, Anderson James H., and Smith F. Donelson. 2017. GPU scheduling on the NVIDIA TX2: Hidden details revealed. In 2017 IEEE Real-Time Systems Symposium (RTSS’17), Paris, France, December 5-8, 2017. IEEE Computer Society, New York, NY, 104115. DOI:Google ScholarGoogle ScholarCross RefCross Ref
  3. [3] Amert Tanya, Voronov Sergey, and Anderson James H.. 2019. OpenVX and real-time certification: The troublesome history. In IEEE Real-Time Systems Symposium (RTSS’19), Hong Kong, SAR, China, December 3-6, 2019. IEEE, New York, NY, 312325. DOI:Google ScholarGoogle ScholarCross RefCross Ref
  4. [4] Arm. 2020. Arm - Cortex-A57 Software Optimization Guide. Retrieved September 5, 2022 from https://developer.arm.com/documentation/uan0015/b/.Google ScholarGoogle Scholar
  5. [5] Arm. 2020. Arm - Neon Intrinsics Reference. Retrieved September 5, 2022 from https://developer.arm.com/architectures/instruction-sets/simd-isas/neon/intrinsics.Google ScholarGoogle Scholar
  6. [6] Baidu. 2019. Apollo, an Open Autonomous Driving Platform. Retrieved September 5, 2022 from http://apollo.auto/.Google ScholarGoogle Scholar
  7. [7] Barnes George H., Brown Richard M., Kato Maso, Kuck David J., Slotnick Daniel L., and Stokes Richard A.. 1968. The ILLIAC IV computer. IEEE Transactions on Computers 17, 8 (1968), 746757. DOI:Google ScholarGoogle ScholarDigital LibraryDigital Library
  8. [8] Beldianu Spiridon F. and Ziavras Sotirios G.. 2013. Multicore-based vector coprocessor sharing for performance and energy gains. ACM Transactions on Embedded Computing Systems 13, 2, Article 17 (Sep.2013), 25 pages. DOI:Google ScholarGoogle ScholarDigital LibraryDigital Library
  9. [9] Benito Marc, Trompouki Matina Maria, Kosmidis Leonidas, Garcia Juan David, Carretero Sergio, and Wenger Ken. 2021. Comparison of GPU computing methodologies for safety-critical systems: An avionics case study. In Design, Automation & Test in Europe Conference & Exhibition (DATE’21), Grenoble, France, February 1–5, 2021. IEEE, New York, NY, 717718. DOI:Google ScholarGoogle ScholarCross RefCross Ref
  10. [10] Betts Adam, Chong Nathan, Donaldson Alastair F., Qadeer Shaz, and Thomson Paul. 2012. GPUVerify: A verifier for GPU kernels. In Proceedings of the 27th Annual ACM SIGPLAN Conference on Object-Oriented Programming, Systems, Languages, and Applications (OOPSLA’12), part of SPLASH’12, Tucson, AZ, October 21-25, 2012, Leavens Gary T. and Dwyer Matthew B. (Eds.). ACM, New York, NY, 113132. DOI:Google ScholarGoogle ScholarDigital LibraryDigital Library
  11. [11] Calderón Alejandro J., Kosmidis Leonidas, Nicolás Carlos F., Cazorla Francisco J., and Onaindia Peio. 2020. GMAI: Understanding and exploiting the internals of GPU resource allocation in critical systems. ACM Transactions on Embedded Computing Systems 19, 5 (2020), 34:1–34:23. DOI:Google ScholarGoogle ScholarDigital LibraryDigital Library
  12. [12] García Carlos Hervás. 2020. AI-4-GNC airbus DS perspectives. In 14th ESA Workshop on Avionics, Data, Control and Software Systems (ADCSS’20). European Space Agency (ESA), Paris, France, 112.Google ScholarGoogle Scholar
  13. [13] Cavicchioli Roberto, Capodieci Nicola, Solieri Marco, and Bertogna Marko. 2019. Novel methodologies for predictable CPU-To-GPU command offloading. In 31st Euromicro Conference on Real-Time Systems (ECRTS’19), July 9–12, 2019, Stuttgart, Germany(Leibniz International Proceedings in Informatics, Vol. 133), Quinton Sophie (Ed.). Schloss Dagstuhl - Leibniz-Zentrum für Informatik, Dagstuhl, Germany, 22:1–22:22. DOI:Google ScholarGoogle ScholarCross RefCross Ref
  14. [14] Certification Authorities Software Team. 2016. CAST-32A Multi-core Processors. Certification Authorities Software Team. http://cast32a.com/files/cast-32a.pdf.Google ScholarGoogle Scholar
  15. [15] Company RAPITA Systems. A DANLAW. 2019. RapiCover. Low-overhead Coverage Analysis for Critical Software. Retrieved September 5, 2022 from https://www.rapitasystems.com/products/rapicover.Google ScholarGoogle Scholar
  16. [16] Company RAPITA Systems. A DANLAW. 2019. RapiTime. In-depth Execution Time Analysis for Critical Software. Retrieved September 5, 2022 from https://www.rapitasystems.com/products/rapitime.Google ScholarGoogle Scholar
  17. [17] Derrien Steven, Isabelle Puaut, Panayiotis Alefragis, Marcus Bednara, Harald Bucher, Clément David, Yann Debray, Umut Durak, Imen Fassi, Christian Ferdinand, Damien Hardy, Angeliki Kritikakou, Gerard K. Rauwerda, Simon Reder, Martin Sicks, Timo Stripf, Kim Sunesen, Timon D. ter Braak, Nikolaos S. Voros, and Jürgen Becker. 2017. WCET-aware parallelization of model-based applications for multi-cores: The ARGO approach. In Design, Automation & Test in Europe Conference & Exhibition (DATE’17), Lausanne, Switzerland, March 27-31, 2017, IEEE, New York, NY, 286–289. Google ScholarGoogle ScholarCross RefCross Ref
  18. [18] Dreyer Boris and Hochberger Christian. 2019. Non-intrusive online timing analysis of large embedded applications. In 19th International Workshop on Worst-Case Execution Time Analysis (WCET’19), July 9, 2019, Stuttgart, Germany(OASICS, Vol. 72), Altmeyer Sebastian (Ed.). Schloss Dagstuhl - Leibniz-Zentrum für Informatik, Dagstuhl, Germany, 2:1–2:11. DOI:Google ScholarGoogle ScholarCross RefCross Ref
  19. [19] Farina Alfonso. 2006. Introduction to Radar Signal and Data Processing: The Opportunity. Technical Report. Selex Sistemi Integrati, Rome, Italy. https://apps.dtic.mil/sti/pdfs/ADA472912.pdf.Google ScholarGoogle Scholar
  20. [20] semiconductor Freescale. 2016. QorIQ T2080 Reference Manual. Also supports T2081. Doc. No.: T2080RM. Rev. 3, 11/2016.Google ScholarGoogle Scholar
  21. [21] Gamba Jonah. 2020. Automotive Radar Applications. Springer Singapore, Singapore, 123142. DOI:Google ScholarGoogle ScholarCross RefCross Ref
  22. [22] Ge Yi, Takebe Yoshimasa, Toichi Masahiko, Mouri Makoto, Ito Makiko, Hirose Yoshio, and Takahashi Hiromasa. 2011. A vector coprocessor architecture for embedded systems. In 2011 International SoC Design Conference. IEEE, New York, NY, 195198. DOI:Google ScholarGoogle ScholarCross RefCross Ref
  23. [23] Hassan Mohamed, Kaushik Anirudh M., and Patel Hiren D.. 2017. Predictable cache coherence for multi-core real-time systems. In 2017 IEEE Real-Time and Embedded Technology and Applications Symposium (RTAS’17), Pittsburgh, PA, April 18–21, 2017, Parmer Gabriel (Ed.). IEEE Computer Society, New York, NY, 235246. DOI:Google ScholarGoogle ScholarCross RefCross Ref
  24. [24] Hellström Daniel and Cros Fabrice. 2015. RTEMS SMP Final Report: Development Environment for Future Leon Multi-core. Technical Report. European Space Agency (ESA), Paris, France. http://microelectronics.esa.int/gr740/RTEMS-SMPFinalReport-CGAislerASD-OAR.pdf.Google ScholarGoogle Scholar
  25. [25] Hofmann Martin, Neukart Florian, and Bäck Thomas. 2017. Artificial intelligence and data science in the automotive industry. CoRR abs/1709.01989 (2017), 122. arXiv:1709.01989. http://arxiv.org/abs/1709.01989.Google ScholarGoogle Scholar
  26. [26] Hong Shin, Ahn Jaemin, Park Sangmin, Kim Moonzoo, and Harrold Mary Jean. 2012. Testing concurrent programs to achieve high synchronization coverage. In International Symposium on Software Testing and Analysis (ISSTA’12), Minneapolis, MN, July 15–20, 2012, Heimdahl Mats Per Erik and Su Zhendong (Eds.). ACM, New York, NY, 210220. DOI:Google ScholarGoogle ScholarDigital LibraryDigital Library
  27. [27] International Organization for Standardization. 2009. ISO/DIS 26262. Road Vehicles–Functional Safety. International Organization for Standardization.Google ScholarGoogle Scholar
  28. [28] Johnson Chris W.. 2018. The increasing risks of risk assessment: On the rise of artificial intelligence and non-determinism in safety-critical systems. In the 26th Safety-Critical Systems Symposium. Safety-Critical Systems Club York, UK, SCSC on Amazon/CreateSpace, York, UK, 15.Google ScholarGoogle Scholar
  29. [29] Kaushik Anirudh M., Tegegn Paulos, Wu Zhuanhao, and Patel Hiren D.. 2019. CARP: A data communication mechanism for multi-core mixed-criticality systems. In IEEE Real-Time Systems Symposium (RTSS’19), Hong Kong, SAR, China, December 3–6, 2019. IEEE, New York, NY, 419432. DOI:Google ScholarGoogle ScholarCross RefCross Ref
  30. [30] Kovač Mario, Notton Philippe, Hofman Daniel, and Knezović Josip. 2020. How Europe is preparing its core solution for exascale machines and a global, sovereign, advanced computing platform. Mathematical and Computational Applications 25, 3 (2020), 18. DOI:Google ScholarGoogle ScholarCross RefCross Ref
  31. [31] Kozyrakis Christoforos E. and Patterson David A.. 2003. Overcoming the limitations of conventional vector processors. In 30th International Symposium on Computer Architecture (ISCA’03), June 9–11, 2003, San Diego, CA,Gottlieb Allan and Li Kai (Eds.). IEEE Computer Society, New York, NY, 399409. DOI:Google ScholarGoogle ScholarCross RefCross Ref
  32. [32] Li Hanbing, Puaut Isabelle, and Rohou Erven. 2015. Tracing flow information for tighter WCET estimation: Application to vectorization. In 21st IEEE International Conference on Embedded and Real-Time Computing Systems and Applications (RTCSA’15), Hong Kong, China, August 19–21, 2015. IEEE Computer Society, New York, NY, 217226. DOI:Google ScholarGoogle ScholarDigital LibraryDigital Library
  33. [33] Lin Yuan, Baron Nadev, Lee Hyunseok, Mahlke Scott, and Mudge Trevor. 2004. A programmable Vector coprocessor architecture for wireless applications. In 3rd Workshop on Application Specific Processors. ACM, New York, NY, 103–110. https://cccp.eecs.umich.edu/papers/lin-wasp04.pdf.Google ScholarGoogle Scholar
  34. [34] Trompouki Matina Maria and Kosmidis Leonidas. 2021. DO-178C certification of general-purpose GPU software: Review of existing methods and future directions. In IEEE/AIAA 40th Digital Avionics Systems Conference (DASC’21). IEEE, New York, NY, 19. DOI:Google ScholarGoogle ScholarCross RefCross Ref
  35. [35] Munera Adrian, Royuela Sara, Llort Germán, Mercadal Estanislao, Wartel Franck, and Quiñones Eduardo. 2020. Experiences on the characterization of parallel applications in embedded systems with Extrae/Paraver. In 49th International Conference on Parallel Processing (ICPP’20), Edmonton, AB, Canada, August 17–20, 2020, Amaral José Nelson, John Lizy Kurian, and Shen Xipeng (Eds.). ACM, New York, NY, 53:1–53:11. DOI:Google ScholarGoogle ScholarDigital LibraryDigital Library
  36. [36] Munera Adrian, Royuela Sara, and Quiñones Eduardo. 2020. Towards a qualifiable OpenMP framework for embedded systems. In 2020 Design, Automation & Test in Europe Conference & Exhibition, DATE’20, Grenoble, France, March 9–13, 2020. IEEE, New York, NY, 903908. DOI:Google ScholarGoogle ScholarCross RefCross Ref
  37. [37] Netlib.org. 2021. EISPACK. Retrieved September 5, 2022 from http://www.netlib.org/eispack/.Google ScholarGoogle Scholar
  38. [38] Quiñones Eduardo and Wartel Franck. 2019. Extrae: An OpenMP-compatible performance monitoring tool for the GR740. In GR740 User Day (at ESTEC/ESA). European Space Agency (ESA), Paris, France, 120.Google ScholarGoogle Scholar
  39. [39] NVIDIA. 2016. NVIDIA - Jetson TX1 Module. Retrieved September 5, 2022 from https://developer.nvidia.com/embedded/jetson-tx1.Google ScholarGoogle Scholar
  40. [40] NVIDIA. 2017. NVIDIA - Jetson TX2 Module. Retrieved September 5, 2022 from https://developer.nvidia.com/embedded/jetson-tx2.Google ScholarGoogle Scholar
  41. [41] NVIDIA. 2018. Technical Reference Manual. Xavier Series SoC. DP-09253-002. Version 1.1. Technical Report. NVIDIA.Google ScholarGoogle Scholar
  42. [42] NVIDIA. 2021. NVIDIA DRIVE PX. Scalable Supercomputer for Autonomous Driving. Retrieved September 5, 2022 from http://www.nvidia.com/object/drive-px.html.Google ScholarGoogle Scholar
  43. [43] Olmedo Ignacio Sanudo, Capodieci Nicola, Martinez Jorge Luis, Marongiu Andrea, and Bertogna Marko. 2020. Dissecting the CUDA scheduling hierarchy: A performance and predictability perspective. In IEEE Real-Time and Embedded Technology and Applications Symposium (RTAS’20), Sydney, Australia, April 21–24, 2020. IEEE, New York, NY, 213225. DOI:Google ScholarGoogle ScholarCross RefCross Ref
  44. [44] Otterness Nathan and Anderson James H.. 2020. AMD GPUs as an alternative to NVIDIA for supporting real-time workloads. In 32nd Euromicro Conference on Real-Time Systems (ECRTS’20), July 7–10, 2020, Virtual Conference(Leibniz International Proceedings in Informatics, Vol. 165), Völp Marcus (Ed.). Schloss Dagstuhl - Leibniz-Zentrum für Informatik, Dagstuhl, Germany, 10:1–10:23. DOI:Google ScholarGoogle ScholarCross RefCross Ref
  45. [45] Ozaktas Haluk, Rochange Christine, and Sainrat Pascal. 2013. Automatic WCET analysis of real-time parallel applications. In 13th International Workshop on Worst-Case Execution Time Analysis (WCET’13), July 9, 2013, Paris, France(OASICS, Vol. 30), Maiza Claire (Ed.). Schloss Dagstuhl - Leibniz-Zentrum für Informatik, Dagstuhl, Germany, 1120. DOI:Google ScholarGoogle ScholarCross RefCross Ref
  46. [46] Ozaktas Haluk, Rochange Christine, and Sainrat Pascal. 2014. Minimizing the cost of synchronisations in the WCET of real-time parallel programs. In 17th International Workshop on Software and Compilers for Embedded Systems SCOPES’14, Sankt Goar, Germany, June 10–11, 2014, Corporaal Henk and Stuijk Sander (Eds.). ACM, New York, NY, 98107. DOI:Google ScholarGoogle ScholarDigital LibraryDigital Library
  47. [47] Peng Chao. 2019. On the correctness of GPU programs. In Proceedings of the 28th ACM SIGSOFT International Symposium on Software Testing and Analysis (ISSTA’19), Beijing, China, July 15–19, 2019, Zhang Dongmei and Møller Anders (Eds.). ACM, New York, NY, 443447. DOI:Google ScholarGoogle ScholarDigital LibraryDigital Library
  48. [48] Pérez-Cerrolaza Jon, Obermaisser Roman, Abella Jaume, Cazorla Francisco J., Grüttner Kim, Agirre Irune, Ahmadian Hamidreza, and Allende Imanol. 2020. Multi-core devices for safety-critical systems: A survey. ACM Computing Surveys 53, 4 (2020), 79:1–79:38. DOI:Google ScholarGoogle ScholarDigital LibraryDigital Library
  49. [49] Platzer Michael and Puschner Peter. 2021. Vicuna: A timing-predictable RISC-V vector coprocessor for scalable parallel computation. In 33rd Euromicro Conference on Real-Time Systems (ECRTS’21)(Leibniz International Proceedings in Informatics, Vol. 196), Brandenburg Björn B. (Ed.). Schloss Dagstuhl–Leibniz-Zentrum für Informatik, Dagstuhl, Germany, 1:1–1:18. DOI:Google ScholarGoogle ScholarCross RefCross Ref
  50. [50] Pujol Roger, Tabani Hamid, Kosmidis Leonidas, Mezzetti Enrico, Abella Jaume, and Cazorla Francisco J.. 2019. Generating and exploiting deep learning variants to increase heterogeneous resource utilization in the NVIDIA Xavier. In 31st Euromicro Conference on Real-Time Systems (ECRTS’19)(Leibniz International Proceedings in Informatics, Vol. 133), Quinton Sophie (Ed.). Schloss Dagstuhl–Leibniz-Zentrum fuer Informatik, Dagstuhl, Germany, 23:1–23:23. DOI:Google ScholarGoogle ScholarCross RefCross Ref
  51. [51] Radack David, Tiedeman Harold G., and Parkinson Paul. 2018. Civil Certification of Multi-core Processing Systems in Commercial Avionics. White Paper. Retrieved September 6, 2022 from https://www.collinsaerospace.com/-/media/CA/product-assets/marketing/m/multicore-processing-systems/multi-core-certification-white-paper.pdf?rev=5649fddf5fa345dfab1fbf5a6193112d.Google ScholarGoogle Scholar
  52. [52] Redmon Joseph and Farhadi Ali. 2018. YOLOv3: An incremental improvement. CoRR abs/1804.02767 (2018), 16. arXiv:1804.02767. http://arxiv.org/abs/1804.02767.Google ScholarGoogle Scholar
  53. [53] Royuela Sara, Duran Alejandro, Serrano Maria A., Quiñones Eduardo, and Martorell Xavier. 2017. A functional safety OpenMP ^* for critical real-time embedded systems. In Proceedings of Scaling OpenMP for Exascale Performance and Portability — 13th International Workshop on OpenMP (IWOMP’17), Stony Brook, NY, September 20–22, 2017(Lecture Notes in Computer Science, Vol. 10468), Supinski Bronis R. de, Olivier Stephen L., Terboven Christian, Chapman Barbara M., and Müller Matthias S. (Eds.). Springer, New York, NY, 231245. DOI:Google ScholarGoogle ScholarCross RefCross Ref
  54. [54] EUROCAE RTCA and. 2011. DO-178C/ED-12C, Software Considerations in Airborne Systems and Equipment Certification. RTCA and EUROCAE.Google ScholarGoogle Scholar
  55. [55] EUROCAE RTCA and. 2011. RTCA DO-330 — Software Tool Qualification Considerations. RTCA and EUROCAE.Google ScholarGoogle Scholar
  56. [56] Sensfelder Nathanaël, Brunel Julien, and Pagetti Claire. 2020. On how to identify cache coherence: Case of the NXP QorIQ T4240. In 32nd Euromicro Conference on Real-Time Systems (ECRTS’20), July 7–10, 2020, Virtual Conference(Leibniz International Proceedings in Informatics, Vol. 165), Völp Marcus (Ed.). Schloss Dagstuhl - Leibniz-Zentrum für Informatik, Dagstuhl, Germany, 13:1–13:22. DOI:Google ScholarGoogle ScholarCross RefCross Ref
  57. [57] Sritharan Nivedita, Kaushik Anirudh M., Hassan Mohamed, and Patel Hiren D.. 2019. Enabling predictable, simultaneous and coherent data sharing in mixed criticality systems. In IEEE Real-Time Systems Symposium, RTSS 2019, Hong Kong, SAR, China, December 3-6, 2019. IEEE, New York, NY, 433445. Google ScholarGoogle ScholarCross RefCross Ref
  58. [58] Sritharan Nivedita, Kaushik Anirudh M., Hassan Mohamed, and Patel Hiren D.. 2019. Enabling predictable, simultaneous and coherent data sharing in mixed criticality systems. In IEEE Real-Time Systems Symposium (RTSS’19), Hong Kong, SAR, China, December 3–6, 2019. IEEE, New York, NY, 433445. DOI:Google ScholarGoogle ScholarCross RefCross Ref
  59. [59] Stephens Nigel, Biles Stuart, Boettcher Matthias, Eapen Jacob, Eyole Mbou, Gabrielli Giacomo, Horsnell Matt, Magklis Grigorios, Martinez Alejandro, Prémillieu Nathanaël, Reid Alastair, Rico Alejandro, and Walker Paul. 2017. The ARM scalable vector extension. IEEE Micro 37, 2 (2017), 2639. DOI:Google ScholarGoogle ScholarDigital LibraryDigital Library
  60. [60] Sugimoto Hideki and Adachi Koji. 2020. Vector compliance testing for RISC-V. In RISC-V Global Forum. RISC-V International, Zurich, Switzerland, 135. Retrieved September 6, 2022 from https://riscvglobalforum2020.sched.com/event/dO3P/vector-compliance-testing-for-risc-v-hideki-sugimoto-koji-adachi-nsitexe-inc.Google ScholarGoogle Scholar
  61. [61] Teschler Lee. 2019. The Basics of Automotive Radar. Retrieved September 6, 2022 from https://www.designworldonline.com/the-basics-of-automotive-radar/.Google ScholarGoogle Scholar
  62. [62] Trompouki Matina Maria and Kosmidis Leonidas. 2018. Brook auto: High-level certification-friendly programming for GPU-powered automotive systems. In Proceedings of the 55th Annual Design Automation Conference (DAC’18), San Francisco, CA, June 24–29, 2018. ACM, New York, NY, 100:1–100:6. DOI:Google ScholarGoogle ScholarDigital LibraryDigital Library
  63. [63] Ungerer Theo, Bradatsch Christian, Frieb Martin, Kluge Florian, Mische Jörg, Stegmeier Alexander, Jahr Ralf, Gerdes Mike, Zaykov Pavel G., Matusova Lucie, Li ZaiJianJia, Petrov Zlatko, Böddeker Bert, Kehr Sebastian, Regler Hans, Hugl Andreas, Rochange Christine, Ozaktas Haluk, Cassé Hugues, Bonenfant Armelle, Sainrat Pascal, Lay Nick, George David, Broster Ian, Quiñones Eduardo, Panic Milos, Abella Jaume, Hernández Carles, Cazorla Francisco J., Uhrig Sascha, Rohde Mathias, and Pyka Arthur. 2016. Parallelizing industrial hard real-time applications for the parMERASA multicore. ACM Trans. Embed. Comput. Syst. 15, 3 (2016), 53:153:27. Google ScholarGoogle ScholarDigital LibraryDigital Library
  64. [64] VECTOR. 2021. Coffee with Vector: Code Coverage for CUDA Code using VectorCAST/QA. Retrieved September 6, 2022 from https://www.vector.com/es/es/eventos/global-de-en/webinar-recordings/2021/coffee-with-vector-code-coverage-for-cuda-code-using-vectorcastqa/.Google ScholarGoogle Scholar
  65. [65] Wilhelm Reinhard, Engblom Jakob, Ermedahl Andreas, Holsti Niklas, Thesing Stephan, Whalley David B., Bernat Guillem, Ferdinand Christian, Heckmann Reinhold, Mitra Tulika, Mueller Frank, Puaut Isabelle, Puschner Peter P., Staschulat Jan, and Stenström Per. 2008. The worst-case execution-time problem —overview of methods and survey of tools. ACM Transactions on Embedded Computing Systems 7, 3 (2008), 36:1–36:53. DOI:Google ScholarGoogle ScholarDigital LibraryDigital Library
  66. [66] Wong Michael, Klemm Michael, Duran Alejandro, Mattson Tim, Haab Grant, Supinski Bronis R. de, and Churbanov Andrey. 2010. Towards an error model for OpenMP. In Proceedings of Beyond Loop Level Parallelism in OpenMP: Accelerators, Tasking and More, 6th International Workshop on OpenMP (IWOMP’10), Tsukuba, Japan, June 14–16, 2010(Lecture Notes in Computer Science, Vol. 6132), Sato Mitsuhisa, Hanawa Toshihiro, Müller Matthias S., Chapman Barbara M., and Supinski Bronis R. de (Eds.). Springer, New York, NY, 7082. DOI:Google ScholarGoogle ScholarDigital LibraryDigital Library
  67. [67] Xilinx. 2019. Rockwell Collins Uses Zynq UltraScale+ RFSoC Devices in Revolutionizing How Arrays are Produced and Fielded: Powered by Xilinx. Retrieved September 6, 2022 from https://www.xilinx.com/video/corporate/rockwell-collins-rfsoc-revolutionizing-how-arrays-are-produced.html.Google ScholarGoogle Scholar
  68. [68] Yang Ming, Otterness Nathan, Amert Tanya, Bakita Joshua, Anderson James H., and Smith F. Donelson. 2018. Avoiding pitfalls when using NVIDIA GPUs for real-time tasks in autonomous systems. In 30th Euromicro Conference on Real-Time Systems, ECRTS’18, July 3–6, 2018, Barcelona, Spain(Leibniz International Proceedings in Informatics, Vol. 106), Altmeyer Sebastian (Ed.). Schloss Dagstuhl - Leibniz-Zentrum für Informatik, Dagstuhl, Germany, 20:1–20:21. DOI:Google ScholarGoogle ScholarCross RefCross Ref

Index Terms

  1. Vector Extensions in COTS Processors to Increase Guaranteed Performance in Real-Time Systems

            Recommendations

            Comments

            Login options

            Check if you have access through your login credentials or your institution to get full access on this article.

            Sign in

            Full Access

            • Published in

              cover image ACM Transactions on Embedded Computing Systems
              ACM Transactions on Embedded Computing Systems  Volume 22, Issue 2
              March 2023
              560 pages
              ISSN:1539-9087
              EISSN:1558-3465
              DOI:10.1145/3572826
              • Editor:
              • Tulika Mitra
              Issue’s Table of Contents

              Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

              Publisher

              Association for Computing Machinery

              New York, NY, United States

              Publication History

              • Published: 24 January 2023
              • Online AM: 31 August 2022
              • Accepted: 16 August 2022
              • Revised: 6 July 2022
              • Received: 1 March 2022
              Published in tecs Volume 22, Issue 2

              Permissions

              Request permissions about this article.

              Request Permissions

              Check for updates

              Qualifiers

              • research-article
              • Refereed

            PDF Format

            View or Download as a PDF file.

            PDF

            eReader

            View online with eReader.

            eReader

            Full Text

            View this article in Full Text.

            View Full Text

            HTML Format

            View this article in HTML Format .

            View HTML Format
            About Cookies On This Site

            We use cookies to ensure that we give you the best experience on our website.

            Learn more

            Got it!