Exploring foci of:
arXiv (Cornell University)
Towards End-to-End Image Compression and Analysis with Transformers
December 2021 • Yuanchao Bai, Yang Xu, Xianming Liu, Junjun Jiang, Yaowei Wang, Xiangyang Ji, Wen Gao
We propose an end-to-end image compression and analysis model with Transformers, targeting to the cloud-based image classification application. Instead of placing an existing Transformer-based image classification model directly after an image codec, we aim to redesign the Vision Transformer (ViT) model to perform image classification from the compressed features and facilitate image compression with the long-term information from the Transformer. Specifically, we first replace the patchify stem (i.e., image split…
Computer Science
Artificial Intelligence
Image Compression
Transformer
Computer Vision
Convolutional Neural Network
Engineering
Voltage
Electrical Engineering