Towards End-to-End Image Compression and Analysis with Transformers

Exploring foci of: arXiv (Cornell University) Towards End-to-End Image Compression and Analysis with Transformers December 2021 • Yuanchao Bai, Yang Xu, Xianming Liu, Junjun Jiang, Yaowei Wang, Xiangyang Ji, Wen Gao We propose an end-to-end image compression and analysis model with Transformers, targeting to the cloud-based image classification application. Instead of placing an existing Transformer-based image classification model directly after an image codec, we aim to redesign the Vision Transformer (ViT) model to perform image classification from the compressed features and facilitate image compression with the long-term information from the Transformer. Specifically, we first replace the patchify stem (i.e., image split… Open Article Page

Computer Science Artificial Intelligence Image Compression Transformer Computer Vision Convolutional Neural Network Engineering Voltage Electrical Engineering Open Article