skip to main content
10.1145/3491418.3535138acmconferencesArticle/Chapter ViewAbstractPublication PagespearcConference Proceedingsconference-collections
extended-abstract

HyperShell v2: Distributed Task Execution for HPC

Published: 08 July 2022 Publication History

Abstract

HyperShell is an elegant, cross-platform, high-performance computing utility for processing shell commands over a distributed, asynchronous queue. It is a highly scalable workflow automation tool for many-task scenarios. There are several existing tools that serve a similar purpose, but lack some aspect that HyperShell provides (e.g., distributed, detailed logging, automated retries, super scale). Novel aspects of HyperShell include but are not limited to (1) cross-platform, (2) client-server design, (3) staggered launch for large scales, (4) persistent hosting of the server, and optionally (5) a database in-the-loop for restarts and persisting task metadata. HyperShell was originally created to support researchers at Purdue University, out of a specific unmet need. It has been in use for several years now. With this next release, we’ve completely re-implemented HyperShell as both an application and a library to provide new features, scalability, flexibility, robustness, and wider support. (https://github.com/glentner/hyper-shell)

References

[1]
GNU Software Foundation. 2022. GNU Make. https://www.gnu.org/software/make/
[2]
NERSC. 2022. TaskFarmer. https://docs.nersc.gov/jobs/workflow/taskfarmer/ TaskFarmer is a workflow manager developed in-house at NERSC to coordinate single or multicore tasks.
[3]
SchedMD. 2022. Slurm Workload Manager. https://slurm.schedmd.com
[4]
Carol Song, Preston Smith, Xiao Zhu, and Rajesh Kalyanam. 2020. NSF Award 2005632 - Category I: Anvil - A National Composable Advanced Computational Resource for the Future of Science and Engineering. (2020). https://www.nsf.gov/awardsearch/showAward?AWD_ID=2005632
[5]
Ole Tange. 2021. GNU Parallel 20220322. https://doi.org/10.5281/zenodo.6377950 GNU Parallel is a general parallelizer to run multiple serial command line programs in parallel without changing them.
[6]
Lucas Wilson, John Fonner, Oscar Esteban, Jason Allison, Marshall Lerner, and Harry Kenya. 2017. Launcher: A simple tool for executing high throughput computing workloads. Journal of Open Source Software (Aug. 2017). https://doi.org/

Cited By

View all
  • (2022)Novel Proposals for FAIR, Automated, Recommendable, and Robust Workflows2022 IEEE/ACM Workshop on Workflows in Support of Large-Scale Science (WORKS)10.1109/WORKS56498.2022.00016(84-92)Online publication date: Nov-2022

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM Conferences
PEARC '22: Practice and Experience in Advanced Research Computing 2022: Revolutionary: Computing, Connections, You
July 2022
455 pages
ISBN:9781450391610
DOI:10.1145/3491418
Permission to make digital or hard copies of part or all of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for third-party components of this work must be honored. For all other uses, contact the Owner/Author.

Sponsors

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 08 July 2022

Check for updates

Badges

  • Best Poster

Author Tags

  1. distributed computing
  2. high-throughput computing
  3. many-task computing

Qualifiers

  • Extended-abstract
  • Research
  • Refereed limited

Conference

PEARC '22
Sponsor:

Acceptance Rates

Overall Acceptance Rate 133 of 202 submissions, 66%

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)28
  • Downloads (Last 6 weeks)3
Reflects downloads up to 23 Nov 2024

Other Metrics

Citations

Cited By

View all
  • (2022)Novel Proposals for FAIR, Automated, Recommendable, and Robust Workflows2022 IEEE/ACM Workshop on Workflows in Support of Large-Scale Science (WORKS)10.1109/WORKS56498.2022.00016(84-92)Online publication date: Nov-2022

View Options

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

HTML Format

View this article in HTML Format.

HTML Format

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media