Explanipedia

Accelerating Sparse Deep Neural Networks Open

Asit Mishra, Jorge Albericio Latorre, Jeff Pool, Darko Stošić, Dušan Stošić , et al. · 2021

As neural network model sizes have dramatically increased, so has the interest in various techniques to reduce their parameter counts and accelerate their execution. An active area of research in this field is sparsity - encouraging zero v…

Self-Supervised GAN Compression Open

Chong Yu, Jeff Pool · 2020

Deep learning's success has led to larger and larger models to handle more and more complex tasks; trained models can contain millions of parameters. These large models are compute- and memory-intensive, which makes it a challenge to deplo…

Buddy Compression: Enabling Larger Memory for Deep Learning and HPC Workloads on GPUs Open

Esha Choukse, Michael B. Sullivan, Mike O’Connor, Mattan Erez, Jeff Pool , et al. · 2020

GPUs offer orders-of-magnitude higher memory bandwidth than traditional CPU-only systems. However, GPU device memory tends to be relatively small and the memory capacity can not be increased by the user. This paper describes Buddy Compress…

Buddy Compression: Enabling Larger Memory for Deep Learning and HPC\n Workloads on GPUs Open

Esha Choukse, Michael Sullivan, Mike O’Connor, Mattan Erez, Jeff Pool , et al. · 2019

GPUs offer orders-of-magnitude higher memory bandwidth than traditional\nCPU-only systems. However, GPU device memory tends to be relatively small and\nthe memory capacity can not be increased by the user. This paper describes\nBuddy Compr…

Structurally Sparsified Backward Propagation for Faster Long Short-Term Memory Training Open

Maohua Zhu, Jason Clemons, Jeff Pool, Minsoo Rhu, Stephen W. Keckler , et al. · 2018

Exploiting sparsity enables hardware systems to run neural networks faster and more energy-efficiently. However, most prior sparsity-centric optimization techniques only accelerate the forward pass of neural networks and usually require an…

Sparse Persistent RNNs: Squeezing Large Recurrent Networks On-Chip Open

Feiwen Zhu, Jeff Pool, Michael Andersch, Jeremy Appleyard, Fung Xie · 2018

Recurrent Neural Networks (RNNs) are powerful tools for solving sequence-based problems, but their efficacy and execution time are dependent on the size of the network. Following recent work in simplifying these networks with model pruning…

Efficient Sparse-Winograd Convolutional Neural Networks Open

Xingyu Liu, Jeff Pool, Song Han, William J. Dally · 2018

Convolutional Neural Networks (CNNs) are computationally intensive, which limits their application on mobile devices. Their energy is dominated by the number of multiplies needed to perform the convolutions. Winograd's minimal filtering al…

Compressing DMA Engine: Leveraging Activation Sparsity for Training Deep Neural Networks Open

Minsoo Rhu, Mike O’Connor, Niladrish Chatterjee, Jeff Pool, Stephen W. Keckler · 2018

Popular deep learning frameworks require users to fine-tune their memory usage so that the training data of a deep neural network (DNN) fits within the GPU physical memory. Prior work tries to address this restriction by virtualizing the m…

Exploring the Regularity of Sparse Structure in Convolutional Neural Networks Open

Huizi Mao, Song Han, Jeff Pool, Wenshuo Li, Xingyu Liu , et al. · 2017

Sparsity helps reduce the computational complexity of deep neural networks by skipping zeros. Taking advantage of sparsity is listed as a high priority in next generation DNN accelerators such as TPU. The structure of sparsity, i.e., the g…

DSD: Regularizing Deep Neural Networks with Dense-Sparse-Dense Training Flow. Open

Song Han, Jeff Pool, Sharan Narang, Huizi Mao, Shijian Tang , et al. · 2016

Modern deep neural networks have a large number of parameters, making them very powerful machine learning systems. A critical issue for training such large networks on large-scale data-sets is to prevent overfitting while at the same time …

DSD: Dense-Sparse-Dense Training for Deep Neural Networks Open

Song Han, Jeff Pool, Sharan Narang, Huizi Mao, Enhao Gong , et al. · 2016

Modern deep neural networks have a large number of parameters, making them very hard to train. We propose DSD, a dense-sparse-dense training flow, for regularizing deep neural networks and achieving better optimization performance. In the …

DSD: Dense-Sparse-Dense Training for Deep Neural Networks Open

Song Han, Jeff Pool, Sharan Narang, Huizi Mao, Enhao Gong , et al. · 2016

Modern deep neural networks have a large number of parameters, making them very hard to train. We propose DSD, a dense-sparse-dense training flow, for regularizing deep neural networks and achieving better optimization performance. In the …

Learning both Weights and Connections for Efficient Neural Networks Open

Song Han, Jeff Pool, John Tran, William J. Dally · 2015

Neural networks are both computationally intensive and memory intensive, making them difficult to deploy on embedded systems. Also, conventional networks fix the architecture before training starts; as a result, training cannot improve the…

Jeff Pool YOU? Author Swipe