Explanipedia

AI: It's All About Inference Now Open

Michael Gschwind · 2025

As the scaling of pretraining is reaching a plateau of diminishing returns, model inference is quickly becoming an important driver for model performance. Today, test-time compute scaling offers a new, exciting avenue to increase model per…

Multi-petascale highly efficient parallel supercomputer Open

Sameh Asaad, Ralph E. Bellofatto, Michael A. Blocksome, Matthias A. Blumrich, Peter Boyle , et al. · 2023

A Multi-Petascale Highly Efficient Parallel Supercomputer of 100 petaOPS-scale computing, at decreased cost, power and footprint, and that allows for a maximum packaging density of processing nodes from an interconnect point of view. The S…

Matrix multiplication operations using pair-wise load and splat operations Open

Alexandre E. Eichenberger, Michael Gschwind, John A. Gunnels, Valentina Salapura · 2023

Mechanisms for performing a matrix multiplication operation are provided. A vector load operation is performed to load a first vector operand of the matrix multiplication operation to a first target vector register. A pair-wise load and sp…

Sustainable AI: Environmental Implications, Challenges and Opportunities Open

Carole-Jean Wu, Ramya Raghavendra, Udit Gupta, Bilge Acun, Newsha Ardalani , et al. · 2021

This paper explores the environmental impact of the super-linear growth trends for AI from a holistic perspective, spanning Data, Algorithms, and System Hardware. We characterize the carbon footprint of AI computing by examining the model …

First-Generation Inference Accelerator Deployment at Facebook Open

Michael J. Anderson, Benny Chen, Stephen Chen, Summer Deng, Jordan Fix , et al. · 2021

In this paper, we provide a deep dive into the deployment of inference accelerators at Facebook. Many of our ML workloads have unique characteristics, such as sparse memory accesses, large model sizes, as well as high compute, memory and n…

Steering Committee Open

Antonio González, Michael Gschwind, Hillery C. Hunter, Wen‐mei Hwu, Natalie Enright Jerger , et al. · 2020

Michael Gschwind YOU? Author Swipe