Quantization (signal processing)

YOLOv6: A Single-Stage Object Detection Framework for Industrial Applications Open

Chuyi Li, Lulu Li, Hongliang Jiang, Kaiheng Weng, Yifei Geng , et al. · 2022

Computer science Philosophy

For years, the YOLO series has been the de facto industry-level standard for efficient object detection. The YOLO community has prospered overwhelmingly to enrich its use in a multitude of hardware platforms and abundant scenarios. In this…

A Survey on Learning to Hash Open

Jingdong Wang, Ting Zhang, Jingkuan Song, Nicu Sebe, Heng Tao Shen · 2017

Computer science

Nearest neighbor search is a problem of finding the data points from the database such that the distances from them to the query point are the smallest. Learning to hash is one of the major solutions to this problem and has been widely stu…

A Survey of Quantization Methods for Efficient Neural Network Inference Open

Amir Gholami, Sehoon Kim, Zhen Dong, Zhewei Yao, Michael W. Mahoney , et al. · 2022

Computer science

As soon as abstract mathematical computations were adapted to computation on digital computers, the problem of efficient representation, manipulation, and communication of the numerical values in those computations arose.Strongly related t…

QSGD: Communication-Efficient SGD via Gradient Quantization and Encoding Open

Dan Alistarh, Demjan Grubic, Jerry Li, Ryota Tomioka, Milan Vojnović · 2016

Computer science Biology

Parallel implementations of stochastic gradient descent (SGD) have received significant research attention, thanks to excellent scalability properties of this algorithm, and to its efficiency in the context of training deep neural networks…

Large Intelligent Surface-Assisted Wireless Communication Exploiting Statistical CSI Open

Yu Han, Wankai Tang, Shi Jin, Chao-Kai Wen, Xiaoli Ma · 2019

Computer science Mathematics Engineering

Large intelligent surface (LIS)-assisted wireless communications have drawn attention worldwide. With the use of low-cost LIS on building walls, signals can be reflected by the LIS and sent out along desired directions by controlling its p…

FastText.zip: Compressing text classification models Open

Armand Joulin, Édouard Grave, Piotr Bojanowski, Matthijs Douze, Hervé Jeǵou , et al. · 2016

Computer science

We consider the problem of producing compact architectures for text classification, such that the full model fits in a limited amount of memory. After considering different solutions inspired by the hashing literature, we propose a method …

DeepTrust^RT: Confidential Deep Neural Inference Meets Real-Time! Open

Song Han, Huizi Mao · 2024

Computer science Mathematics

Deep Neural Networks (DNNs) are becoming common in "learning-enabled" time-critical applications such as autonomous driving and robotics. One approach to protect DNN inference from adversarial actions and preserve model privacy/confidentia…

Quantizing deep convolutional networks for efficient inference: A whitepaper Open

Raghuraman Krishnamoorthi · 2018

Computer science

We present an overview of techniques for quantizing convolutional neural networks for inference with integer weights and activations. Per-channel quantization of weights and per-layer quantization of activations to 8-bits of precision post…

Trained Ternary Quantization Open

Chenzhuo Zhu, Song Han, Huizi Mao, William J. Dally · 2016

Computer science Mathematics

Deep neural networks are widely used in machine learning applications. However, the deployment of large neural networks models can be difficult to deploy on mobile devices with limited power budgets. To solve this problem, we propose Train…

Deep Hashing Network for Efficient Similarity Retrieval Open

Han Zhu, Mingsheng Long, Jianmin Wang, Yue Cao · 2016

Computer science Mathematics

Due to the storage and retrieval efficiency, hashing has been widely deployed to approximate nearest neighbor search for large-scale multimedia retrieval. Supervised hashing, which improves the quality of hash coding by exploiting the sema…

Proceedings of the Twenty-Eighth International Joint Conference on Artificial Intelligence Open

Lianli Gao, Xiaosu Zhu, Jingkuan Song, Zhou Zhao, Hengtao Shen · 2019

Computer science Mathematics Geography

Product Quantization (PQ) has long been a mainstream for generating an\nexponentially large codebook at very low memory/time cost. Despite its success,\nPQ is still tricky for the decomposition of high-dimensional vector space, and\nthe re…

Incremental Network Quantization: Towards Lossless CNNs with Low-Precision Weights Open

Aojun Zhou, Anbang Yao, Yiwen Guo, Lin Xu, Yurong Chen · 2017

Computer science Mathematics

This paper presents incremental network quantization (INQ), a novel method, targeting to efficiently convert any pre-trained full-precision convolutional neural network (CNN) model into a low-precision version whose weights are constrained…

QLoRA: Efficient Finetuning of Quantized LLMs Open

Tim Dettmers, Artidoro Pagnoni, Ari Holtzman, Luke Zettlemoyer · 2023

Computer science Geography

We present QLoRA, an efficient finetuning approach that reduces memory usage enough to finetune a 65B parameter model on a single 48GB GPU while preserving full 16-bit finetuning task performance. QLoRA backpropagates gradients through a f…

Throughput Analysis of Massive MIMO Uplink With Low-Resolution ADCs Open

Sven Jacobsson, Giuseppe Durisi, Mikael Coldrey, Ulf Gustavsson, Christoph Studer · 2017

Computer science Engineering

We investigate the uplink throughput achievable by a multiple-user (MU) massive multiple-input multiple-output (MIMO) system, in which the base station is equipped with a large number of low-resolution analog-to-digital converters (ADCs). …

Q-BERT: Hessian Based Ultra Low Precision Quantization of BERT Open

Sheng Shen, Zhen Dong, Jiayu Ye, Linjian Ma, Zhewei Yao , et al. · 2020

Computer science Mathematics Physics

Transformer based architectures have become de-facto models used for a range of Natural Language Processing tasks. In particular, the BERT based models achieved significant accuracy gain for GLUE tasks, CoNLL-03 and SQuAD. However, BERT ba…

Reduced Reference Perceptual Quality Model With Application to Rate Control for Video-Based Point Cloud Compression Open

Qi Liu, Hui Yuan, Raouf Hamzaoui, Honglei Su, Junhui Hou , et al. · 2021

Computer science Mathematics Economics

In rate-distortion optimization, the encoder settings are determined by maximizing a reconstruction quality measure subject to a constraint on the bitrate. One of the main challenges of this approach is to define a quality measure that can…

One-Bit Over-the-Air Aggregation for Communication-Efficient Federated Edge Learning: Design and Convergence Analysis Open

Guangxu Zhu, Yuqing Du, Denız Gündüz, Kaibin Huang · 2020

Computer science

Federated edge learning (FEEL) is a popular framework for model training at an edge server using data distributed at edge devices (e.g., smart-phones and sensors) without compromising their privacy. In the FEEL framework, edge devices peri…

Learned Step Size Quantization Open

Steven K. Esser, Jeffrey L. McKinstry, Deepika Bablani, Rathinakumar Appuswamy, Dharmendra S. Modha · 2019

Computer science Economics

Deep networks run with low precision operations at inference time offer power and space advantages over high precision alternatives, but need to overcome the challenge of maintaining high accuracy as precision decreases. Here, we present a…

Introduction to quantum electromagnetic circuits Open

Uri Vool, Michel Devoret · 2017

Computer science Physics Mathematics

Summary The article is a short opinionated review of the quantum treatment of electromagnetic circuits, with no pretension to exhaustiveness. This review, which is an updated and modernized version of a previous set of Les Houches School l…

Significantly Improving Lossy Compression for Scientific Data Sets Based on Multidimensional Prediction and Error-Controlled Quantization Open

Dingwen Tao, Sheng Di, Zizhong Chen, Franck Cappello · 2017

Computer science Engineering

Today's HPC applications are producing extremely large amounts of data, such that data storage and analysis are becoming more challenging for scientific research. In this work, we design a new error-controlled lossy compression algorithm f…

Redefining near-unity luminescence in quantum dots with photothermal threshold quantum yield Open

David Hanifi, Noah D. Bronstein, Brent A. Koscher, Zach Nett, Joseph K. Swabeck , et al. · 2019

Materials science Chemistry Physics

Superefficient light emission A challenge to improving synthesis methods for superefficient light-emitting semiconductor nanoparticles is that current analytical methods cannot measure efficiencies above 99%. Hanifi et al. used phototherma…

Model compression via distillation and quantization Open

Antonio Polino, Razvan Pascanu, Dan Alistarh · 2018

Computer science Chemistry

Deep neural networks (DNNs) continue to make significant advances, solving tasks from image classification to translation or reinforcement learning. One aspect of the field receiving considerable attention is efficiently executing deep mod…

Soft-to-Hard Vector Quantization for End-to-End Learning Compressible Representations Open

Eirikur Agustsson, Fabian Mentzer, Michael Tschannen, Lukas Cavigelli, Radu Timofte , et al. · 2017

Computer science Physics

We present a new approach to learn compressible representations in deep architectures with an end-to-end training strategy. Our method is based on a soft (continuous) relaxation of quantization and entropy, which we anneal to their discret…

On the Spectral Efficiency of Massive MIMO Systems With Low-Resolution ADCs Open

Jiayi Zhang, Linglong Dai, Shengyang Sun, Zhaocheng Wang · 2016

Computer science Engineering

The low-resolution analog-to-digital convertor (ADC) is a promising solution\nto significantly reduce the power consumption of radio frequency circuits in\nmassive multiple-input multiple-output (MIMO) systems. In this letter, we\ninvestig…

FedPAQ: A Communication-Efficient Federated Learning Method with Periodic Averaging and Quantization Open

Amirhossein Reisizadeh, Aryan Mokhtari, Hamed Hassani, Ali Jadbabaie, Ramtin Pedarsani · 2019

Computer science

Federated learning is a distributed framework according to which a model is trained over a set of devices, while keeping data localized. This framework faces several systems-oriented challenges which include (i) communication bottleneck si…

Deep Metric Learning to Rank Open

Fatih Çakir, Kun He, Xide Xia, Brian Kulis, Stan Sclaroff · 2019

Computer science Mathematics Economics

We propose a novel deep metric learning method by revisiting the learning to rank approach. Our method, named FastAP, optimizes the rank-based Average Precision measure, using an approximation derived from distance quantization. FastAP has…

Single Path One-Shot Neural Architecture Search with Uniform Sampling Open

Zichao Guo, Xiangyu Zhang, Haoyuan Mu, Wen Heng, Zechun Liu , et al. · 2019

Computer science Art

We revisit the one-shot Neural Architecture Search (NAS) paradigm and analyze its advantages over existing NAS approaches. Existing one-shot method, however, is hard to train and not yet effective on large scale datasets like ImageNet. Thi…

UVeQFed: Universal Vector Quantization for Federated Learning Open

Nir Shlezinger, Mingzhe Chen, Yonina C. Eldar, H. Vincent Poor, Shuguang Cui · 2020

Computer science

Traditional deep learning models are trained at a centralized server using\nlabeled data samples collected from end devices or users. Such data samples\noften include private information, which the users may not be willing to share.\nFeder…

Extremely Low Bit Neural Network: Squeeze the Last Bit Out With ADMM Open

Cong Leng, Zesheng Dou, Hao Li, Shenghuo Zhu, Rong Jin · 2018

Computer science Mathematics Physics

Although deep learning models are highly effective for various learning tasks, their high computational costs prohibit the deployment to scenarios where either memory or computational resources are limited. In this paper, we focus on compr…

Secure and Robust Fragile Watermarking Scheme for Medical Images Open

Abdulaziz Shehab, Mohamed Elhoseny, Khan Muhammad, Arun Kumar Sangaiah, Po Yang , et al. · 2018

Computer science Mathematics Chemistry

Over the past decade advances in computer-based communication and health services, the need for image security becomes urgent to address the requirements of both safety and non-safety in medical applications. This paper proposes a new frag…

Quantization (signal processing) ≈ Quantization (signal processing)