Alan C. Bovik
YOU?
Author Swipe
View article: Non-Aligned Reference Image Quality Assessment for Novel View Synthesis
Non-Aligned Reference Image Quality Assessment for Novel View Synthesis Open
Evaluating the perceptual quality of Novel View Synthesis (NVS) images remains a key challenge, particularly in the absence of pixel-aligned ground truth references. Full-Reference Image Quality Assessment (FR-IQA) methods fail under misal…
View article: VIDMP3: Video Editing by Representing Motion with Pose and Position Priors
VIDMP3: Video Editing by Representing Motion with Pose and Position Priors Open
Motion-preserved video editing is crucial for creators, particularly in scenarios that demand flexibility in both the structure and semantics of swapped objects. Despite its potential, this area remains underexplored. Existing diffusion-ba…
View article: PIT-QMM: A Large Multimodal Model For No-Reference Point Cloud Quality Assessment
PIT-QMM: A Large Multimodal Model For No-Reference Point Cloud Quality Assessment Open
Large Multimodal Models (LMMs) have recently enabled considerable advances in the realm of image and video quality assessment, but this progress has yet to be fully explored in the domain of 3D assets. We are interested in using these mode…
View article: AttentionViG: Cross-Attention-Based Dynamic Neighbor Aggregation in Vision GNNs
AttentionViG: Cross-Attention-Based Dynamic Neighbor Aggregation in Vision GNNs Open
Vision Graph Neural Networks (ViGs) have demonstrated promising performance in image recognition tasks against Convolutional Neural Networks (CNNs) and Vision Transformers (ViTs). An essential part of the ViG framework is the node-neighbor…
View article: CHUG: Crowdsourced User-Generated HDR Video Quality Dataset
CHUG: Crowdsourced User-Generated HDR Video Quality Dataset Open
High Dynamic Range (HDR) videos enhance visual experiences with superior brightness, contrast, and color depth. The surge of User-Generated Content (UGC) on platforms like YouTube and TikTok introduces unique challenges for HDR video quali…
View article: Perceptual Classifiers: Detecting Generative Images using Perceptual Features
Perceptual Classifiers: Detecting Generative Images using Perceptual Features Open
Image Quality Assessment (IQA) models are employed in many practical image and video processing pipelines to reduce storage, minimize transmission costs, and improve the Quality of Experience (QoE) of millions of viewers. These models are …
View article: FaceExpressions-70k: A Dataset of Perceived Expression Differences
FaceExpressions-70k: A Dataset of Perceived Expression Differences Open
View article: TRIQA: Image Quality Assessment by Contrastive Pretraining on Ordered Distortion Triplets
TRIQA: Image Quality Assessment by Contrastive Pretraining on Ordered Distortion Triplets Open
Image Quality Assessment (IQA) models aim to predict perceptual image quality in alignment with human judgments. No-Reference (NR) IQA remains particularly challenging due to the absence of a reference image. While deep learning has signif…
View article: ICME 2025 Generalizable HDR and SDR Video Quality Measurement Grand Challenge
ICME 2025 Generalizable HDR and SDR Video Quality Measurement Grand Challenge Open
This paper reports IEEE International Conference on Multimedia \& Expo (ICME) 2025 Grand Challenge on Generalizable HDR and SDR Video Quality Measurement. With the rapid development of video technology, especially High Dynamic Range (HDR) …
View article: Understanding, detecting, and removing perceptual banding artifacts in compressed videos
Understanding, detecting, and removing perceptual banding artifacts in compressed videos Open
View article: Latent Guidance in Diffusion Models for Perceptual Evaluations
Latent Guidance in Diffusion Models for Perceptual Evaluations Open
Despite recent advancements in latent diffusion models that generate high-dimensional image data and perform various downstream tasks, there has been little exploration into perceptual consistency within these models on the task of No-Refe…
View article: HDRSDR-VQA: A Subjective Video Quality Dataset for HDR and SDR Comparative Evaluation
HDRSDR-VQA: A Subjective Video Quality Dataset for HDR and SDR Comparative Evaluation Open
We introduce HDRSDR-VQA, a large-scale video quality assessment dataset designed to facilitate comparative analysis between High Dynamic Range (HDR) and Standard Dynamic Range (SDR) content under realistic viewing conditions. The dataset c…
View article: Mesh Compression with Quantized Neural Displacement Fields
Mesh Compression with Quantized Neural Displacement Fields Open
Implicit neural representations (INRs) have been successfully used to compress a variety of 3D surface representations such as Signed Distance Functions (SDFs), voxel grids, and also other forms of structured data such as images, videos, a…
View article: Perceptual Visual Quality Assessment: Principles, Methods, and Future Directions
Perceptual Visual Quality Assessment: Principles, Methods, and Future Directions Open
As multimedia services such as video streaming, video conferencing, virtual reality (VR), and online gaming continue to expand, ensuring high perceptual visual quality becomes a priority to maintain user satisfaction and competitiveness. H…
View article: Quality Assessment of AI-Generated and AI-Enhanced Content: Challenges and Opportunities
Quality Assessment of AI-Generated and AI-Enhanced Content: Challenges and Opportunities Open
View article: Intraoperative Blood Pressure Misclassification Due to Inaccuracy of Noninvasive Oscillometric Cuff-Measured Blood Pressure
Intraoperative Blood Pressure Misclassification Due to Inaccuracy of Noninvasive Oscillometric Cuff-Measured Blood Pressure Open
View article: Satellite Streaming Video QoE Prediction: A Real-World Subjective Database and Network-Level Prediction Models
Satellite Streaming Video QoE Prediction: A Real-World Subjective Database and Network-Level Prediction Models Open
View article: Multiscale structural similarity for image quality assessment
Multiscale structural similarity for image quality assessment Open
The structural similarity image quality paradigm is based on the assumption that the human visual system is highly adapted for extracting structural information from the scene, and therefore a measure of structural similarity can provide a…
View article: Subjective and Objective Quality Assessment of Banding Artifacts on Compressed Videos
Subjective and Objective Quality Assessment of Banding Artifacts on Compressed Videos Open
Although there have been notable advancements in video compression technologies in recent years, banding artifacts remain a serious issue affecting the quality of compressed videos, particularly on smooth regions of high-definition videos.…
View article: Video Quality Assessment: A Comprehensive Survey
Video Quality Assessment: A Comprehensive Survey Open
Video quality assessment (VQA) is an important processing task, aiming at predicting the quality of videos in a manner highly consistent with human judgments of perceived quality. Traditional VQA models based on natural image and/or video …
View article: Satellite Streaming Video QoE Prediction: A Real-World Subjective Database and Network-Level Prediction Models
Satellite Streaming Video QoE Prediction: A Real-World Subjective Database and Network-Level Prediction Models Open
Demand for streaming services, including satellite, continues to exhibit unprecedented growth. Internet Service Providers find themselves at the crossroads of technological advancements and rising customer expectations. To stay relevant an…
View article: Quality Prediction of AI Generated Images and Videos: Emerging Trends and Opportunities
Quality Prediction of AI Generated Images and Videos: Emerging Trends and Opportunities Open
The advent of AI has influenced many aspects of human life, from self-driving cars and intelligent chatbots to text-based image and video generation models capable of creating realistic images and videos based on user prompts (text-to-imag…
View article: Subjective and Objective Quality Assessment of Rendered Human Avatar Videos in Virtual Reality
Subjective and Objective Quality Assessment of Rendered Human Avatar Videos in Virtual Reality Open
We study the visual quality judgments of human subjects on digital human avatars (sometimes referred to as "holograms" in the parlance of virtual reality [VR] and augmented reality [AR] systems) that have been subjected to distortions. We …
View article: Constructing Per-Shot Bitrate Ladders using Visual Information Fidelity
Constructing Per-Shot Bitrate Ladders using Visual Information Fidelity Open
Video service providers need their delivery systems to be able to adapt to network conditions, user preferences, display settings, and other factors. HTTP Adaptive Streaming (HAS) offers dynamic switching between different video representa…
View article: YouDream: Generating Anatomically Controllable Consistent Text-to-3D Animals
YouDream: Generating Anatomically Controllable Consistent Text-to-3D Animals Open
3D generation guided by text-to-image diffusion models enables the creation of visually compelling assets. However previous methods explore generation based on image or text. The boundaries of creativity are limited by what can be expresse…
View article: C3DAG: Controlled 3D Animal Generation using 3D pose guidance
C3DAG: Controlled 3D Animal Generation using 3D pose guidance Open
Recent advancements in text-to-3D generation have demonstrated the ability to generate high quality 3D assets. However while generating animals these methods underperform, often portraying inaccurate anatomy and geometry. Towards ameliorat…
View article: Joint Quality Assessment and Example-Guided Image Processing by Disentangling Picture Appearance from Content
Joint Quality Assessment and Example-Guided Image Processing by Disentangling Picture Appearance from Content Open
The deep learning revolution has strongly impacted low-level image processing tasks such as style/domain transfer, enhancement/restoration, and visual quality assessments. Despite often being treated separately, the aforementioned tasks sh…
View article: Cut-FUNQUE: An Objective Quality Model for Compressed Tone-Mapped High Dynamic Range Videos
Cut-FUNQUE: An Objective Quality Model for Compressed Tone-Mapped High Dynamic Range Videos Open
High Dynamic Range (HDR) videos have enjoyed a surge in popularity in recent years due to their ability to represent a wider range of contrast and color than Standard Dynamic Range (SDR) videos. Although HDR video capture has seen increasi…
View article: Subjective Quality Assessment of Compressed Tone-Mapped High Dynamic Range Videos
Subjective Quality Assessment of Compressed Tone-Mapped High Dynamic Range Videos Open
High Dynamic Range (HDR) videos are able to represent wider ranges of contrasts and colors than Standard Dynamic Range (SDR) videos, giving more vivid experiences. Due to this, HDR videos are expected to grow into the dominant video modali…
View article: Subjective and Objective Analysis of Indian Social Media Video Quality
Subjective and Objective Analysis of Indian Social Media Video Quality Open
We conducted a large-scale subjective study of the perceptual quality of User-Generated Mobile Video Content on a set of mobile-originated videos obtained from the Indian social media platform ShareChat. The content viewed by volunteer hum…