Explanipedia

5G Energy FRAME Report on 5G for Grid Use Case (Year 3 Final Report) Open

Xiaoyuan Fan, Yousu Chen, Dexin Wang, Chuan Qin, Yuan Liu , et al. · 2024

This report provides an extensive overview of the interrelationships among energy, communication, and computing—especially in the context of decarbonization goals, challenges, and opportunities. Technical examples enabled by 5G technologie…

Evaluating Emerging AI/ML Accelerators: IPU, RDU, and NVIDIA/AMD GPUs Open

Hongwu Peng, Caiwen Ding, Tong Geng, Sutanay Choudhury, Kevin Barker , et al. · 2024

Computer science Engineering

The relentless advancement of artificial intelligence (AI) and machine learning (ML) applications necessitates the development of specialized hardware accelerators capable of handling the increasing complexity and computational demands. Tr…

Beyond the Bridge: Contention-Based Covert and Side Channel Attacks on Multi-GPU Interconnect Open

Yicheng Zhang, Ravan Nazaraliyev, Sankha Baran Dutta, Nael Abu‐Ghazaleh, Andrés Márquez , et al. · 2024

Computer science Medicine Philosophy

High-speed interconnects, such as NVLink, are integral to modern multi-GPU systems, acting as a vital link between CPUs and GPUs. This study highlights the vulnerability of multi-GPU systems to covert and side channel attacks due to conges…

Experiences from the Roadrunner petascale hybrid systems Open

Darren J. Kerbyson, Scott Pakin, Michael Lang, Jose Carlos Sancho Pitarch, Kei Davis , et al. · 2024

Computer science Materials science

The combination of flexible microprocessors (AMD Opterons) with high-performing accelerators (IBM PowerXCell 8i) resulted in the extremely powerful Roadrunner system. Many challenges in both hardware and software were overcome to achieve i…

Comparing current cluster, massively parallel, and accelerated systems Open

Kevin Barker, Kei Davis, Adolfy Hoisie, Darren J. Kerbyson, Scott Pakin , et al. · 2024

Computer science Engineering Materials science

Currently there is large architectural diversity in high perfonnance computing systems. They include 'commodity' cluster systems that optimize per-node performance for small jobs, massively parallel processors (MPPs) that optimize aggregat…

The Landscape of Modern Machine Learning: A Review of Machine, Distributed and Federated Learning Open

Omer Subasi, Oceane Bel, Joseph Manzano, Kevin Barker · 2023

Computer science Mathematics Psychology

With the advance of the powerful heterogeneous, parallel and distributed computing systems and ever increasing immense amount of data, machine learning has become an indispensable part of cutting-edge technology, scientific research and co…

MPGemmFI: A Fault Injection Technique for Mixed Precision GEMM in ML Applications Open

Bo Fang, Xinyi Li, Harvey Dam, Cheng Tan, Siva Kumar Sastry Hari , et al. · 2023

Computer science Physics

Emerging deep learning workloads urgently need fast general matrix multiplication (GEMM). To meet such demand, one of the critical features of machine-learning-specific accelerators such as NVIDIA Tensor Cores, AMD Matrix Cores, and Google…

Evaluating Emerging AI/ML Accelerators: IPU, RDU, and NVIDIA/AMD GPUs Open

Hongwu Peng, Caiwen Ding, Tong Geng, Sutanay Choudhury, Kevin Barker , et al. · 2023

Computer science Engineering Geography

The relentless advancement of artificial intelligence (AI) and machine learning (ML) applications necessitates the development of specialized hardware accelerators capable of handling the increasing complexity and computational demands. Tr…

5G Energy FRAME: The Design and Implementation of Data, Model, and Use Case (Year 2 Report) Open

Xiaoyuan Fan, Dexin Wang, Chuan Qin, Kishan Prudhvi Guddanti, Venkataramani Kumar , et al. · 2023

Computer science Engineering Mathematics

This report summarizes the Year 2 work of Pacific Northwest National Laboratory’s (PNNL’s) 5G Fabricated Resource and Asset Management Encompassment for energy infrastructure (Energy FRAME) project funded by the Department of Energy Office…

Denial of Service Attack Detection via Differential Analysis of Generalized Entropy Progressions Open

Omer Subasi, Joseph Manzano, Kevin Barker · 2023

Computer science Physics

Denial-of-Service (DoS) attacks are one of the most common and consequential\ncyber attacks in computer networks. While existing research offers a plethora\nof detection methods, the issue of achieving both scalability and high\ndetection …

Codesign for Extreme Heterogeneity: Integrating Custom Hardware With Commodity Computing Technology to Support Next-Generation HPC Converged Workloads Open

James Ang, Kevin Barker, Draguna Vrabie, Gökçen Kestor · 2022

Computer science Economics

The future of high-performance technical computing will be driven by the convergence of physical simulation, Artificial Intelligence (AI), Machine Learning (ML), and data science computing capabilities. While computational performance gain…

Direction-optimizing Label Propagation Framework for Structure Detection in Graphs: Design, Implementation, and Experimental Analysis Open

Tony Liu, Andrew Lumsdaine, Mahantesh Halappanavar, Kevin Barker, Assefaw H. Gebremedhin · 2022

Computer science Biology Geography

Label Propagation is not only a well-known machine learning algorithm for classification but also an effective method for discovering communities and connected components in networks. We propose a new Direction-optimizing Label Propagation…

MSREP: A Fast yet Light Sparse Matrix Framework for Multi-GPU Systems Open

Jieyang Chen, Chenhao Xie, Jesun Firoz, Jiajia Li, Shuaiwen Leon Song , et al. · 2022

Computer science Mathematics Materials science

Sparse linear algebra kernels play a critical role in numerous applications, covering from exascale scientific simulation to large-scale data analytics. Offloading linear algebra kernels on one GPU will no longer be viable in these applica…

MGG: Accelerating Graph Neural Networks with Fine-grained intra-kernel Communication-Computation Pipelining on Multi-GPU Platforms Open

Yuke Wang, Boyuan Feng, Zheng Wang, Tong Geng, Kevin Barker , et al. · 2022

Computer science Mathematics Economics

The increasing size of input graphs for graph neural networks (GNNs) highlights the demand for using multi-GPU platforms. However, existing multi-GPU GNN systems optimize the computation and communication individually based on the conventi…

Technical Characterization and Benefit Evaluation of 5G-Enabled Grid Data Transport and Applications Open

Xiaoyuan Fan, James Ogle, Melanie Cree‐Green, Dexin Wang, Yousu Chen , et al. · 2022

Computer science Engineering Mathematics

This report summarizes the Year 1 work of Pacific Northwest National Laboratory’s (PNNL’s) 5G Fabricated Resource and Asset Management Encompassment for energy infrastructure (Energy FRAME) project funded by the Department of Energy Office…

Spy in the GPU-box: Covert and Side Channel Attacks on Multi-GPU Systems Open

Sankha Baran Dutta, Hoda Naghibijouybari, Arjun K. Gupta, Nael Abu‐Ghazaleh, Andrés Márquez , et al. · 2022

Computer science

The deep learning revolution has been enabled in large part by GPUs, and more recently accelerators, which make it possible to carry out computationally demanding training and inference in acceptable times. As the size of machine learning …

Bit-GraphBLAS: Bit-Level Optimizations of Matrix-Centric Graph Processing on GPU Open

Jou-An Chen, Hsin-Hsuan Sung, Xipeng Shen, Nathan R. Tallent, Kevin Barker , et al. · 2022

Computer science Physics

In a general graph data structure like an adjacency matrix, when edges are homogeneous, the connectivity of two nodes can be sufficiently represented using a single bit. This insight has, however, not yet been adequately exploited by the e…

Hardware Evaluation Analytical Modeling and Node Simulation: Benefits of Tighter GPU Integration Open

Brian Austin, Raymond A. Bair, Kevin Barker, Anthony M. Cabrera, Andrew A. Chien , et al. · 2021

Computer science Engineering

In this report, we examine several emerging technologies of interest to the Department of Energy and its computational centers. These include: 1) quantifying the benefit of tighter CPU-GPU integration, 2) quantifying the appropriate CPU co…

Denial-of-Service Attack Detection via Differential Analysis of Generalized Entropy Progressions Open

Omer Subasi, Joseph Manzano, Kevin Barker · 2021

Computer science Physics

Denial-of-Service (DoS) attacks are one of the most common and consequential cyber attacks in computer networks. While existing research offers a plethora of detection methods, the issue of achieving both scalability and high detection acc…

Leaky Buddies: Cross-Component Covert Channels on Integrated CPU-GPU Systems Open

Sankha Baran Dutta, Hoda Naghibijouybari, Nael Abu‐Ghazaleh, Andrés Márquez, Kevin Barker · 2021

Computer science Philosophy Physics

Graphics Processing Units (GPUs) are a ubiquitous component across the range of today's computing platforms, from phones and tablets, through personal computers, to high-end server class platforms. With the increasing importance of graphic…

ARENA: Asynchronous Reconfigurable Accelerator Ring to Enable Data-Centric Parallel Computing Open

Cheng Tan, Chenhao Xie, Tong Geng, Andrés Márquez, Antonino Tumeo , et al. · 2021

Computer science Economics Engineering

The next generation HPC and data centers are likely to be reconfigurable and data-centric due to the trend of hardware specialization and the emergence of data-driven applications. In this work, we propose ARENA – an asynchronous reconfigu…

pnnl/arena Open

Cheng Tan, PNNL Developer Central, Chenhao Xie, Tony Geng, Antonino Tumeo , et al. · 2021

Geography

CFA ARENA is a novel programming model with the support of a runtime targeting asynchronous data-centric execution paradigm in a distributed system. All the machine nodes in ARENA are connected by a ring network to bring the specialized co…

Fast and Scalable Sparse Triangular Solver for Multi-GPU Based HPC Architectures Open

Chenhao Xie, Jieyang Chen, Jesun Firoz, Jiajia Li, Shuaiwen Leon Song , et al. · 2020

Computer science

Designing efficient and scalable sparse linear algebra kernels on modern multi-GPU based HPC systems is a daunting task due to significant irregular memory references and workload imbalance across the GPUs. This is particularly the case fo…

Leaky Buddies: Cross-Component Covert Channels on Integrated CPU-GPU\n Systems Open

Sankha Baran Dutta, Hoda Naghibijouybari, Nael Abu‐Ghazaleh, Andrés Márquez, Kevin Barker · 2020

Computer science Philosophy Physics

Graphics Processing Units (GPUs) are a ubiquitous component across the range\nof today's computing platforms, from phones and tablets, through personal\ncomputers, to high-end server class platforms. With the increasing importance\nof grap…

ARENA: Asynchronous Reconfigurable Accelerator Ring to Enable Data-Centric Parallel Computing Open

Cheng Tan, Chenhao Xie, Tong Geng, Andrés Márquez, Antonino Tumeo , et al. · 2020

Computer science Engineering Economics

The next generation HPC and data centers are likely to be reconfigurable and data-centric due to the trend of hardware specialization and the emergence of data-driven applications. In this paper, we propose ARENA -- an asynchronous reconfi…

Kevin Barker YOU? Author Swipe