Explanipedia

A High-Level Feature Model to Predict the Encoding Energy of a Hardware Video Encoder Open

D Jayanarayana Reddy, Christian Herglotz, André Kaup · 2025

In today's society, live video streaming and user generated content streamed from battery powered devices are ubiquitous. Live streaming requires real-time video encoding, and hardware video encoders are well suited for such an encoding ta…

Optimized Learned Image Compression for Facial Expression Recognition Open

Xiumei Li, Marc Windsheimer, Lydia Helene Rupp, Bjoern M. Eskofier, André Kaup · 2025

Efficient data compression is crucial for the storage and transmission of visual data. However, in facial expression recognition (FER) tasks, lossy compression often leads to feature degradation and reduced accuracy. To address these chall…

Compact Latent Representation for Image Compression (CLRIC) Open

Ayman A. Ameen, Thomas Richter, André Kaup · 2025

Current image compression models often require separate models for each quality level, making them resource-intensive in terms of both training and storage. To address these limitations, we propose an innovative approach that utilizes late…

Overview of Variable Rate Coding in JPEG AI Open

Panqi Jia, Fabian Brand, Dequan Yu, Alexander Karabutov, Elena Alshina , et al. · 2025

Empirical evidence has demonstrated that learning-based image compression can outperform classical compression frameworks. This has led to the ongoing standardization of learned-based image codecs, namely Joint Photographic Experts Group (…

Improved Motion Plane Adaptive 360-Degree Video Compression Using Affine Motion Models Open

M. Healy Ritthaler, Andy Regensky, André Kaup · 2025

Efficient compression of 360-degree video content requires the application of advanced motion models for interframe prediction. The Motion Plane Adaptive (MPA) motion model projects the frames on multiple perspective planes in the 3D space…

Inter-Camera Color Correction for Multispectral Imaging with Camera Arrays Using a Consensus Image Open

Katja Kossira, Jürgen Seiler, André Kaup · 2024

This paper introduces a novel method for inter-camera color calibration for multispectral imaging with camera arrays using a consensus image. Capturing images using multispectral camera arrays has gained importance in medical, agricultural…

Variable Rate Learned Wavelet Video Coding using Temporal Layer Adaptivity Open

Anna Meyer, André Kaup · 2024

Learned wavelet video coders provide an explainable framework by performing discrete wavelet transforms in temporal, horizontal, and vertical dimensions. With a temporal transform based on motion-compensated temporal filtering (MCTF), spat…

Conditional Optimal Filter Selection for Multispectral Object Classification Open

Katja Kossira, David Schön, Jürgen Seiler, André Kaup · 2024

Capturing images using multispectral camera arrays has gained importance in medical, agricultural and environmental processes. However, using all available spectral bands is infeasible and produces much data, while only a fraction is neede…

Modeling the Energy Consumption of the HEVC Software Encoding Process using Processor events Open

Geetha Ramasubbu, André Kaup, Christian Herglotz · 2024

Developing energy-efficient video encoding algorithms is highly important due to the high processing complexities and, consequently, the high energy demand of the encoding process. To accomplish this, the energy consumption of the video en…

Design Space Exploration at Frame-Level for Joint Decoding Energy and Quality Optimization in VVC Open

Teresa Stürzenhofäcker, Matthias Kränzler, Christian Herglotz, André Kaup · 2024

In the pursuit of a reduced energy demand of VVC decoders, it was found that the coding tool configuration has a substantial influence on the bit rate efficiency and the decoding energy demand. The Advanced Design Space Exploration algorit…

Fast Edge-Aware Occlusion Detection in the Context of Multispectral Camera Arrays Open

Frank Sippel, Jürgen Seiler, André Kaup · 2024

Multispectral imaging is very beneficial in diverse applications, like healthcare and agriculture, since it can capture absorption bands of molecules in different spectral areas. A promising approach for multispectral snapshot imaging are …

End-to-end learned Lossy Dynamic Point Cloud Attribute Compression Open

Dat Thanh Nguyen, Daniel Zieger, Marc Stamminger, André Kaup · 2024

Recent advancements in point cloud compression have primarily emphasized geometry compression while comparatively fewer efforts have been dedicated to attribute compression. This study introduces an end-to-end learned dynamic lossy attribu…

High-Resolution Hyperspectral Video Imaging Using A Hexagonal Camera Array Open

Frank Sippel, Jürgen Seiler, André Kaup · 2024

Retrieving the reflectance spectrum from objects is an essential task for many classification and detection problems, since many materials and processes have a unique spectral behaviour. In many cases, it is highly desirable to capture hyp…

SVT-AV1 Encoding Bitrate Estimation Using Motion Search Information Open

Lena Eichermüller, Gaurang Chaudhari, Ioannis Katsavounidis, Zhijun Lei, Hassene Tmar , et al. · 2024

Enabling high compression efficiency while keeping encoding energy consumption at a low level, requires prioritization of which videos need more sophisticated encoding techniques. However, the effects vary highly based on the content, and …

A Study on the Effect of Color Spaces in Learned Image Compression Open

Srivatsa Prativadibhayankaram, Mahadev Prasad Panda, Jürgen Seiler, Thomas Richter, Heiko Sparenberg , et al. · 2024

In this work, we present a comparison between color spaces namely YUV, LAB, RGB and their effect on learned image compression. For this we use the structure and color based learned image codec (SLIC) from our prior work, which consists of …

Towards Video Codec Performance Evaluation: A Rate-Energy-Distortion Perspective Open

Geetha Ramasubbu, André Kaup, Christian Herglotz · 2024

The Bj{\o}ntegaard Delta rate (BD-rate) objectively assesses the coding efficiency of video codecs using the rate-distortion (R-D) performance but overlooks encoding energy, which is crucial in practical applications, especially for those …

Multispectral Snapshot Image Registration Using Learned Cross Spectral Disparity Estimation and a Deep Guided Occlusion Reconstruction Network Open

Frank Sippel, Jürgen Seiler, André Kaup · 2024

Multispectral imaging aims at recording images in different spectral bands. This is extremely beneficial in diverse discrimination applications, for example in agriculture, recycling or healthcare. One approach for snapshot multispectral i…

On Annotation-free Optimization of Video Coding for Machines Open

Marc Windsheimer, Fabian Brand, André Kaup · 2024

Today, image and video data is not only viewed by humans, but also automatically analyzed by computer vision algorithms. However, current coding standards are optimized for human perception. Emerging from this, research on video coding for…

Efficient Learned Wavelet Image and Video Coding Open

Anna Meyer, Srivatsa Prativadibhayankaram, André Kaup · 2024

Learned wavelet image and video coding approaches provide an explainable framework with a latent space corresponding to a wavelet decomposition. The wavelet image coder iWave++ achieves state-of-the-art performance and has been employed fo…

Bit Rate Matching Algorithm Optimization in JPEG-AI Verification Model Open

Panqi Jia, A. Burakhan Koyuncu, Jue Mao, Ze Cui, Yi Ma , et al. · 2024

The research on neural network (NN) based image compression has shown superior performance compared to classical compression frameworks. Unlike the hand-engineered transforms in the classical frameworks, NN-based models learn the non-linea…

Bit Distribution Study and Implementation of Spatial Quality Map in the JPEG-AI Standardization Open

Panqi Jia, Jue Mao, Esin Koyuncu, A. Burakhan Koyuncu, Timofey Solovyev , et al. · 2024

Currently, there is a high demand for neural network-based image compression codecs. These codecs employ non-linear transforms to create compact bit representations and facilitate faster coding speeds on devices compared to the hand-crafte…

Forensic analysis of AI-compression traces in spatial and frequency domain Open

Sandra Bergmann, Denise Moussa, Fabian Brand, André Kaup, Christian Rieß · 2024

The classical JPEG compression is a rich source of cues for forensic image analysis. However, this compression standard will in the near future be complemented by a new, highly efficient learning-based compression standard called JPEG-AI. …

Energy Demand Prediction for Hardware Video Decoders Using Software Profiling Open

Matthias Kränzler, Christian Herglotz, André Kaup · 2024

Energy efficiency for video communications is essential for mobile devices with a limited battery capacity. Therefore, hardware decoder implementations are commonly used to significantly reduce the energetic load of video playback. The ene…

Analysis of Neural Video Compression Networks for 360-Degree Video Coding Open

Andy Regensky, Fabian Brand, André Kaup · 2024

With the increasing efforts of bringing high-quality virtual reality technologies into the market, efficient 360-degree video compression gains in importance. As such, the state-of-the-art H.266/VVC video coding standard integrates dedicat…

A Comprehensive Review of Software and Hardware Energy Efficiency of Video Decoders Open

Matthias Kränzler, Christian Herglotz, André Kaup · 2024

Energy and compression efficiency are two essential parts of modern video decoder implementations that have to be considered. This work comprehensively studies the following six video coding formats regarding compression and decoding energ…

SLIC: A Learned Image Codec Using Structure and Color Open

Srivatsa Prativadibhayankaram, Mahadev Prasad Panda, Thomas Richter, Heiko Sparenberg, Siegfried Fößel , et al. · 2024

We propose the structure and color based learned image codec (SLIC) in which the task of compression is split into that of luminance and chrominance. The deep learning model is built with a novel multi-scale architecture for Y and UV chann…

Encoding Time and Energy Model for SVT-AV1 based on Video Complexity Open

Lena Eichermüller, Gaurang Chaudhari, Ioannis Katsavounidis, Zhijun Lei, Hassene Tmar , et al. · 2024

The share of online video traffic in global carbon dioxide emissions is growing steadily. To comply with the demand for video media, dedicated compression techniques are continuously optimized, but at the expense of increasingly higher com…

Temporal Context Network for 3d Human Pose Estimation with Graph Attention Open

Ming Zhao, Zhengdong Zeng, André Kaup · 2024

The Bjøntegaard Bible Why Your Way of Comparing Video Codecs May Be Wrong Open

Christian Herglotz, Hannah Och, Anna Meyer, Geetha Ramasubbu, Lena Eichermüller , et al. · 2024

In this paper, we provide an in-depth assessment on the Bjøntegaard Delta. We construct a large data set of video compression performance comparisons using a diverse set of metrics including PSNR, VMAF, bitrate, and processing energies. Th…

Enhanced Color Palette Modeling for Lossless Screen Content Compression Open

Hannah Och, Shabhrish Reddy Uddehaly, Tilo Strutz, André Kaup · 2023

Soft context formation is a lossless image coding method for screen content. It encodes images pixel by pixel via arithmetic coding by collecting statistics for probability distribution estimation. Its main pipeline includes three stages, …

André Kaup YOU? Author Swipe