Hal Finkel
YOU?
Author Swipe
View article: GRChombo: An adaptable numerical relativity code for fundamental physics
GRChombo: An adaptable numerical relativity code for fundamental physics Open
GRChombo is an open-source code for performing Numerical Relativity time evolutions, built on top of the publicly available Chombo software for the solution of PDEs. Whilst GRChombo uses standard techniques in NR, it focusses on applicatio…
View article: Responses to the Request for Information on Stewardship of Software for Scientific and High-Performance Computing
Responses to the Request for Information on Stewardship of Software for Scientific and High-Performance Computing Open
The Office of Science (SC) in the Department of Energy (DOE) invites interested parties to provide input relevant to the stewardship of the software ecosystem for scientific and high-performance computing.
View article: Autotuning PolyBench benchmarks with LLVM Clang/Polly loop optimization pragmas using Bayesian optimization
Autotuning PolyBench benchmarks with LLVM Clang/Polly loop optimization pragmas using Bayesian optimization Open
We develop a ytopt autotuning framework that leverages Bayesian optimization to explore the parameter space search and compare four different supervised learning methods within Bayesian optimization and evaluate their effectiveness. We sel…
View article: Extending C++ for Heterogeneous Quantum-Classical Computing
Extending C++ for Heterogeneous Quantum-Classical Computing Open
We present qcor—a language extension to C++ and compiler implementation that enables heterogeneous quantum-classical programming, compilation, and execution in a single-source context. Our work provides a first-of-its-kind C++ compiler ena…
View article: Autotuning PolyBench Benchmarks with LLVM Clang/Polly Loop Optimization Pragmas Using Bayesian Optimization (extended version)
Autotuning PolyBench Benchmarks with LLVM Clang/Polly Loop Optimization Pragmas Using Bayesian Optimization (extended version) Open
In this paper, we develop a ytopt autotuning framework that leverages Bayesian optimization to explore the parameter space search and compare four different supervised learning methods within Bayesian optimization and evaluate their effect…
View article: Non-Proprietary Companion to the Q1 CY2021 PathForward Final Assessment (WBS 2.4.1, Milestone PM-HI-1040)
Non-Proprietary Companion to the Q1 CY2021 PathForward Final Assessment (WBS 2.4.1, Milestone PM-HI-1040) Open
The PathForward element of the Exascale Computing Project (ECP) Hardware and Integration (HI) focus area has met its goals of preparing the US industry for exascale system procurements and generally improving US competitiveness in the worl…
View article: Report of the Workshop on Program Synthesis for Scientific Computing
Report of the Workshop on Program Synthesis for Scientific Computing Open
Program synthesis is an active research field in academia, national labs, and industry. Yet, work directly applicable to scientific computing, while having some impressive successes, has been limited. This report reviews the relevant areas…
View article: The Last Journey. I. An Extreme-scale Simulation on the Mira Supercomputer
The Last Journey. I. An Extreme-scale Simulation on the Mira Supercomputer Open
The Last Journey is a large-volume, gravity-only, cosmological N -body simulation evolving more than 1.24 trillion particles in a periodic box with a side length of 5.025 Gpc. It was implemented using the HACC simulation and analysis frame…
View article: Really Embedding Domain-Specific Languages into C++
Really Embedding Domain-Specific Languages into C++ Open
Domain-specific languages (DSLs) are both pervasive and powerful, but remain difficult to integrate into large projects. As a result, while DSLs can bring distinct advantages in performance, reliability, and maintainability, their use ofte…
View article: Autotuning PolyBench Benchmarks with LLVM Clang/Polly Loop Optimization Pragmas Using Bayesian Optimization
Autotuning PolyBench Benchmarks with LLVM Clang/Polly Loop Optimization Pragmas Using Bayesian Optimization Open
An autotuning is an approach that explores a search space of possible implementations/configurations of a kernel or an application by selecting and evaluating a subset of implementations/configurations on a target platform and/or use model…
View article: Autotuning Search Space for Loop Transformations
Autotuning Search Space for Loop Transformations Open
One of the challenges for optimizing compilers is to predict whether applying an optimization will improve its execution speed. Programmers may override the compiler's profitability heuristic using optimization directives such as pragmas i…
View article: Extending C++ for Heterogeneous Quantum-Classical Computing
Extending C++ for Heterogeneous Quantum-Classical Computing Open
We present qcor - a language extension to C++ and compiler implementation that enables heterogeneous quantum-classical programming, compilation, and execution in a single-source context. Our work provides a first-of-its-kind C++ compiler e…
View article: The Mira-Titan Universe. III. Emulation of the Halo Mass Function
The Mira-Titan Universe. III. Emulation of the Halo Mass Function Open
We construct an emulator for the halo mass function over group and cluster mass scales for a range of cosmologies, including the effects of dynamical dark energy and massive neutrinos. The emulator is based on the recently completed Mira-T…
View article: Proxy App Suite Release (FY2020)
Proxy App Suite Release (FY2020) Open
Version 4.0 of the ECP Proxy App Suite is practically unchanged from the previous release. The current set of proxies has proven useful for many aspects of benchmarking and co-design and we see little reason to alter the suite. Although th…
View article: FY20 Proxy App Suite Release: Report for ECP Proxy App Project Milestone ADCD-504-10
FY20 Proxy App Suite Release: Report for ECP Proxy App Project Milestone ADCD-504-10 Open
Version 4.0 of the ECP Proxy App Suite is practically unchanged from the previous release. The current set of proxies has proven useful for many aspects of benchmarking and co-design and we see little reason to alter the suite. Although th…
View article: Full-state quantum circuit simulation by using data compression
Full-state quantum circuit simulation by using data compression Open
Quantum circuit simulations are critical for evaluating quantum algorithms\nand machines. However, the number of state amplitudes required for full\nsimulation increases exponentially with the number of qubits. In this study, we\nleverage …
View article: ClangJIT: Enhancing C++ with Just-in-Time Compilation
ClangJIT: Enhancing C++ with Just-in-Time Compilation Open
The C++ programming language is not only a keystone of the high-performance-computing ecosystem but has proven to be a successful base for portable parallel-programming frameworks. As is well known, C++ programmers use templates to special…
View article: Quantitative Performance Assessment of Proxy Apps and Parents (Report for ECP Proxy App Project Milestone AD-CD-PA-504-5)
Quantitative Performance Assessment of Proxy Apps and Parents (Report for ECP Proxy App Project Milestone AD-CD-PA-504-5) Open
Proxy applications are small, simplified codes that allow application developers to share important features of large applications without forcing collaborators to assimilate large and complex code bases. Proxies can also be though of as m…
View article: Hybrid Quantum-Classical Computing Architectures
Hybrid Quantum-Classical Computing Architectures Open
We describe how classical supercomputing can aid unreliable quantum processors of intermediate size to solve large problem instances reliably. We advocate using a hybrid quantum-classical architecture where larger quantum circuits are brok…
View article: Memory-Efficient Quantum Circuit Simulation by Using Lossy Data Compression
Memory-Efficient Quantum Circuit Simulation by Using Lossy Data Compression Open
In order to evaluate, validate, and refine the design of new quantum algorithms or quantum computers, researchers and developers need methods to assess their correctness and fidelity. This requires the capabilities of quantum circuit simul…
View article: Amplitude-Aware Lossy Compression for Quantum Circuit Simulation
Amplitude-Aware Lossy Compression for Quantum Circuit Simulation Open
Classical simulation of quantum circuits is crucial for evaluating and validating the design of new quantum algorithms. However, the number of quantum state amplitudes increases exponentially with the number of qubits, leading to the expon…
View article: Amplitude-Aware Lossy Compression for Quantum Circuit Simulation
Amplitude-Aware Lossy Compression for Quantum Circuit Simulation Open
Classical simulation of quantum circuits is crucial for evaluating and validating the design of new quantum algorithms. However, the number of quantum state amplitudes increases exponentially with the number of qubits, leading to the expon…
View article: Loop Optimization Framework
Loop Optimization Framework Open
The LLVM compiler framework supports a selection of loop transformations such as vectorization, distribution and unrolling. Each transformation is carried-out by specialized passes that have been developed independently. In this paper we p…
View article: User-Directed Loop-Transformations in Clang
User-Directed Loop-Transformations in Clang Open
Directives for the compiler such as pragmas can help programmers to separate an algorithm's semantics from its optimization. This keeps the code understandable and easier to optimize for different platforms. Simple transformations such as …
View article: FY18 Proxy App Suite Release. Milestone Report for the ECP Proxy App Project
FY18 Proxy App Suite Release. Milestone Report for the ECP Proxy App Project Open
The ECP Proxy App Team released version 2.0 of the ECP Proxy App suite on September 28, 2018. The new version includes 5 new proxies and adds or improves coverage in several areas, most notably graph analytics and communication patterns. P…
View article: A Case Study of Integer Sum Reduction using Atomics
A Case Study of Integer Sum Reduction using Atomics Open
This paper presents the implementations of integer sum reduction using atomic functions on FPGA, CPU, and GPU platforms. We explain the implementations and optimizations of the kernel using an OpenCL-based highlevel synthesis flow for an F…
View article: Secrets of the Dark Universe: Simulating the Sky on the Blue Gene/Q, The Outer Rim Simulation
Secrets of the Dark Universe: Simulating the Sky on the Blue Gene/Q, The Outer Rim Simulation Open
An astonishing 99.6% of our Universe is dark. Observations indicate that the Universe consists of 70% of a mysterious dark energy and 25% of a yet-unidentified dark matter component, and only 0.4% of the remaining ordinary matter is visibl…
View article: Quantitative Performance Assessment of Proxy Apps and Parents
Quantitative Performance Assessment of Proxy Apps and Parents Open
This report completes the AD-CD-PA-1040 Milestone: We will develop a quantitative methodology to compare the fidelity of the ECP proxy applications with respect to the parent ECP application they represent. Fidelity includes comparison of …
View article: Quantum Sensing for High Energy Physics
Quantum Sensing for High Energy Physics Open
Report of the first workshop to identify approaches and techniques in the domain of quantum sensing that can be utilized by future High Energy Physics applications to further the scientific goals of High Energy Physics.