Dorian Arnold
YOU?
Author Swipe
View article: Report of the 2025 Workshop on Next-Generation Ecosystems for Scientific Computing: Harnessing Community, Software, and AI for Cross-Disciplinary Team Science
Report of the 2025 Workshop on Next-Generation Ecosystems for Scientific Computing: Harnessing Community, Software, and AI for Cross-Disciplinary Team Science Open
This report summarizes insights from the 2025 Workshop on Next-Generation Ecosystems for Scientific Computing: Harnessing Community, Software, and AI for Cross-Disciplinary Team Science, which convened more than 40 experts from national la…
View article: Message from the 2021 Program Chair
Message from the 2021 Program Chair Open
Pandemic notwithstanding, we are delighted to present a strong technical program.We received a total of 558 preliminary abstracts and 462 complete technical paper submissions this year.All submissions went through a thorough review process…
View article: INCA
INCA Open
Current proposals for in-network data processing operate on data as it streams through a network switch or endpoint. Since compute resources must be available when data arrives, these approaches provide deadline-based models of execution. …
View article: On the memory attribution problem: A solution and case study using MPI
On the memory attribution problem: A solution and case study using MPI Open
Summary As parallel applications running on large‐scale computing systems become increasingly memory constrained, the ability to attribute memory usage to the various components of the application is becoming increasingly important. We pre…
View article: Improving MPI Multi-threaded RMA Communication Performance
Improving MPI Multi-threaded RMA Communication Performance Open
One-sided communication is crucial to enabling communication concurrency. As core counts have increased, particularly with many-core architectures, one-sided (RMA) communication has been proposed to address the ever increasing contention a…
View article: Optimal Cooperative Checkpointing for Shared High-Performance Computing Platforms
Optimal Cooperative Checkpointing for Shared High-Performance Computing Platforms Open
In high-performance computing environments, input/output (I/O) from various sources often contend for scarce available bandwidth. Adding to the I/O operations inherent to the failure-free execution of an application, I/O from checkpoint/re…
View article: Unraveling Network-Induced Memory Contention: Deeper Insights with Machine Learning
Unraveling Network-Induced Memory Contention: Deeper Insights with Machine Learning Open
Remote Direct Memory Access (RDMA) is expected to be an integral communication mechanism for future exascale systems enabling asynchronous data transfers, so that applications may fully utilize CPU resources while simultaneously sharing da…
View article: Optimal Cooperative Checkpointing for Shared High-Performance Computing Platforms
Optimal Cooperative Checkpointing for Shared High-Performance Computing Platforms Open
In high-performance computing environments, input/output (I/O) from varioussources often contend for scare available bandwidth. Adding to the I/O operations inherent tothe failure-free execution of an application, I/O from checkpoint/resta…
View article: Accommodating Thread-Level Heterogeneity in Coupled Parallel Applications
Accommodating Thread-Level Heterogeneity in Coupled Parallel Applications Open
Hybrid parallel program models that combine message passing and multithreading (MP+MT) are becoming more popular, extending the basic message passing (MP) model that uses single-threaded processes for both inter- and intra-node parallelism…
View article: (SAI) Stalled, Active and Idle: Characterizing Power and Performance of Large-Scale Dragonfly Networks
(SAI) Stalled, Active and Idle: Characterizing Power and Performance of Large-Scale Dragonfly Networks Open
Exascale networks are expected to comprise a significant part of the total monetary cost and 10-20% of the power budget allocated to exascale systems. Yet, our understanding of current and emerging workloads on these networks is limited. L…
View article: VM-based Slack Emulation of Large-scale Systems.
VM-based Slack Emulation of Large-scale Systems. Open
This paper describes the design of a system to enable largescale testing of new software stacks and prospective high-end computing architectures. The proposed architecture combines system virtualization, time dilation, architectural simula…