Simone Bianco
YOU?
Author Swipe
View article: Artificial Intelligence in Medicine and Healthcare: A Complexity-Based Framework for Model–Context–Relation Alignment
Artificial Intelligence in Medicine and Healthcare: A Complexity-Based Framework for Model–Context–Relation Alignment Open
Artificial intelligence (AI) is profoundly transforming medicine and healthcare, evolving from analytical tools aimed at automating specific tasks to integrated components of complex socio-technical systems. This work presents a conceptual…
View article: Improving image captioning descriptiveness by ranking and LLM-based fusion
Improving image captioning descriptiveness by ranking and LLM-based fusion Open
State-of-the-art (SoTA) image captioning models are often trained on the MicroSoft Common Objects in Context (MS-COCO) dataset, which contains human-annotated captions with an average length of approximately ten tokens. Although effective …
View article: Robust camera-independent color chart localization using YOLO
Robust camera-independent color chart localization using YOLO Open
Accurate color information plays a critical role in numerous computer vision tasks, with the Macbeth ColorChecker being a widely used reference target due to its colorimetrically characterized color patches. However, automating the precise…
View article: Scalable Residual Laplacian Network for HEVC-compressed Video Restoration
Scalable Residual Laplacian Network for HEVC-compressed Video Restoration Open
We present a novel Convolutional Neural Network that exploits the Laplacian decomposition technique, which is typically used in traditional image processing, to restore videos compressed with the High-Efficiency Video Coding (HEVC) algorit…
View article: Acquisition and Modeling of Material Appearance Using a Portable, Low Cost, Device
Acquisition and Modeling of Material Appearance Using a Portable, Low Cost, Device Open
Material appearance acquisition allows researchers to capture the optical properties of surfaces and use them in different tasks such as material analysis, digital twins reproduction, 3D configurators, augmented and virtual reality, etc. P…
View article: Enhancing Direction-of-Arrival Estimation with Multi-Task Learning
Enhancing Direction-of-Arrival Estimation with Multi-Task Learning Open
There are numerous methods in the literature for Direction-of-Arrival (DOA) estimation, including both classical and machine learning-based approaches that jointly estimate the Number of Sources (NOS) and DOA. However, most of these method…
View article: Cross-Camera Distracted Driver Classification through Feature Disentanglement and Contrastive Learning
Cross-Camera Distracted Driver Classification through Feature Disentanglement and Contrastive Learning Open
The classification of distracted drivers is pivotal for ensuring safe driving. Previous studies demonstrated the effectiveness of neural networks in automatically predicting driver distraction, fatigue, and potential hazards. However, rece…
View article: NTIRE 2024 Challenge on Night Photography Rendering
NTIRE 2024 Challenge on Night Photography Rendering Open
This paper presents a review of the NTIRE 2024 challenge on night photography rendering. The goal of the challenge was to find solutions that process raw camera images taken in nighttime conditions, and thereby produce a photo-quality outp…
View article: Pathspace Kalman Filters with Dynamic Process Uncertainty for Analyzing Time-course Data
Pathspace Kalman Filters with Dynamic Process Uncertainty for Analyzing Time-course Data Open
Kalman Filter (KF) is an optimal linear state prediction algorithm, with applications in fields as diverse as engineering, economics, robotics, and space exploration. Here, we develop an extension of the KF, called a Pathspace Kalman Filte…
View article: Reliability and Stability of Mean Opinion Score for Image Aesthetic Quality Assessment Obtained through Crowdsourcing
Reliability and Stability of Mean Opinion Score for Image Aesthetic Quality Assessment Obtained through Crowdsourcing Open
This .zip archieve contains 10 scenes and 5 styles for each scene. Also there are several folders with experimental data: 1-3 indestinguishable runs run with reduced control run with specified region run on weekend all runs combined in one…
View article: Reliability and Stability of Mean Opinion Score for Image Aesthetic Quality Assessment Obtained through Crowdsourcing
Reliability and Stability of Mean Opinion Score for Image Aesthetic Quality Assessment Obtained through Crowdsourcing Open
This .zip archieve contains 10 scenes and 5 styles for each scene. Also there are several folders with experimental data: 1-3 indestinguishable runs run with reduced control run with specified region run on weekend all runs combined in one…
View article: A RNN for Temporal Consistency in Low-Light Videos Enhanced by Single-Frame Methods
A RNN for Temporal Consistency in Low-Light Videos Enhanced by Single-Frame Methods Open
Low-light video enhancement (LLVE) has received little attention compared to low-light image enhancement (LLIE) mainly due to the lack of paired low-/normal-light video datasets. Consequently, a common approach to LLVE is to enhance each v…
View article: Reliability and Stability of Mean Opinion Score for Image Aesthetic Quality Assessment Obtained Through Crowdsourcing
Reliability and Stability of Mean Opinion Score for Image Aesthetic Quality Assessment Obtained Through Crowdsourcing Open
Image quality assessment (IQA) is widely used to evaluate the results of image processing methods. While in recent years the development of objective IQA metrics has seen much progress, there are still many tasks where subjective IQA is si…
View article: A General-purpose Pipeline for Realistic Synthetic Multispectral Image Dataset Generation
A General-purpose Pipeline for Realistic Synthetic Multispectral Image Dataset Generation Open
A pipeline for the generation of synthetic dataset of spectral scenes, with corresponding sensor readings, is here proposed. The pipeline is composed of two main parts: Part 1: Image pixel reflectance assignment. Individual pixels from an …
View article: Learning Color Constancy: 30 Years Later
Learning Color Constancy: 30 Years Later Open
The first paper investigating the use of machine learning to learn the relationship between an image of a scene and the color of the scene illuminant was published by Funt et al. in 1996. Specifically, they investigated if such a relations…
View article: RGB Illuminant Compensation using Spectral Super-resolution and Weighted Spectral Color Correction
RGB Illuminant Compensation using Spectral Super-resolution and Weighted Spectral Color Correction Open
This paper presents a novel approach for spectral illuminant correction in smartphone imaging systems, aiming to improve color accuracy and enhance image quality. The methods introduced include Spectral Super Resolution and Weighted Spectr…
View article: Semi-supervised cross-lingual speech emotion recognition
Semi-supervised cross-lingual speech emotion recognition Open
Performance in Speech Emotion Recognition (SER) on a single language has increased greatly in the last few years thanks to the use of deep learning techniques. However, cross-lingual SER remains a challenge in real-world applications due t…
View article: Improving Image Captioning Descriptiveness by Ranking and LLM-based Fusion
Improving Image Captioning Descriptiveness by Ranking and LLM-based Fusion Open
State-of-The-Art (SoTA) image captioning models are often trained on the MicroSoft Common Objects in Context (MS-COCO) dataset, which contains human-annotated captions with an average length of approximately ten tokens. Although effective …
View article: NTIRE 2023 Challenge on Night Photography Rendering
NTIRE 2023 Challenge on Night Photography Rendering Open
This paper presents a review of the NTIRE 2023 challenge on night photography rendering. The goal of the challenge was to find solutions that process raw camera images taken in nighttime conditions conditions, and thereby produce a photo-q…
View article: ENRICH: Multi-purposE dataset for beNchmaRking In Computer vision and pHotogrammetry
ENRICH: Multi-purposE dataset for beNchmaRking In Computer vision and pHotogrammetry Open
The availability of high-resolution data and accurate ground truth is essential to evaluate and compare methods and algorithms properly. Moreover, it is often difficult to acquire real data for a given application domain that is sufficient…
View article: Video restoration based on deep learning: a comprehensive survey
Video restoration based on deep learning: a comprehensive survey Open
Video restoration concerns the recovery of a clean video sequence starting from its degraded version. Different video restoration tasks exist, including denoising, deblurring, super-resolution, and reduction of compression artifacts. In th…
View article: Analysis of biases in automatic white balance datasets and methods
Analysis of biases in automatic white balance datasets and methods Open
Annotated datasets for automatic white balance (AWB) are used for the evaluation and, when necessary, the training, of AWB methods. Relying on such datasets requires awareness of the potential bias in their content and characteristics: som…
View article: AIM 2022 Challenge on Super-Resolution of Compressed Image and Video: Dataset, Methods and Results
AIM 2022 Challenge on Super-Resolution of Compressed Image and Video: Dataset, Methods and Results Open
This paper reviews the Challenge on Super-Resolution of Compressed Image and Video at AIM 2022. This challenge includes two tracks. Track 1 aims at the super-resolution of compressed image, and Track~2 targets the super-resolution of compr…
View article: Semi-supervised cross-lingual speech emotion recognition
Semi-supervised cross-lingual speech emotion recognition Open
Performance in Speech Emotion Recognition (SER) on a single language has increased greatly in the last few years thanks to the use of deep learning techniques. However, cross-lingual SER remains a challenge in real-world applications due t…
View article: Video restoration based on deep learning: a comprehensive survey
Video restoration based on deep learning: a comprehensive survey Open
Video restoration concerns the recovery of a clean video sequence starting from its degraded version. Different video restoration tasks exist, including denoising, deblurring, super-resolution, and reduction of compression artifacts. In th…
View article: Fast-n-Squeeze: towards real-time spectral reconstruction from RGB images
Fast-n-Squeeze: towards real-time spectral reconstruction from RGB images Open
We present an efficient method for the reconstruction of multispectral information from RGB images, as part of the NTIRE 2022 Spectral Reconstruction Challenge. Given an input image, our method determines a global RGB-to-spectral linear tr…
View article: NTIRE 2022 Spectral Recovery Challenge and Data Set
NTIRE 2022 Spectral Recovery Challenge and Data Set Open
This paper reviews the third biennial challenge on spectral reconstruction from RGB images, i.e., the recovery of whole-scene hyperspectral (HS) information from a 3-channel RGB image. This challenge presents the ARAD 1K data set: a new, l…