Explanipedia

FoleyGRAM: Video-to-Audio Generation with GRAM-Aligned Multimodal Encoders Open

Riccardo Fosco Gramaccioni, C. Marinoni, Eleonora Grassucci, Giordano Cicchetti, Aurelio Uncini , et al. · 2025

In this work, we present FoleyGRAM, a novel approach to video-to-audio generation that emphasizes semantic conditioning through the use of aligned multimodal encoders. Building on prior advancements in video-to-audio generation, FoleyGRAM …

Controllable Audio-Visual Viewpoint Generation from 360° Spatial Information Open

C. Marinoni, Riccardo Fosco Gramaccioni, Eleonora Grassucci, Danilo Comminiello · 2025

The generation of sounding videos has seen significant advancements with the advent of diffusion models. However, existing methods often lack the fine-grained control needed to generate viewpoint-specific content from larger, immersive 360…

StereoSync: Spatially-Aware Stereo Audio Generation from Video Open

C. Marinoni, Riccardo Fosco Gramaccioni, Kazuki Shimada, Takashi Shibuya, Yuki Mitsufuji , et al. · 2025

Although audio generation has been widely studied over recent years, video-aligned audio generation still remains a relatively unexplored frontier. To address this gap, we introduce StereoSync, a novel and efficient model designed to gener…

FolAI: Synchronized Foley Sound Generation with Semantic and Temporal Alignment Open

Riccardo Fosco Gramaccioni, C. Marinoni, Emilian Postolache, Marco Comunità, Luca Cosmo , et al. · 2024

Traditional sound design workflows rely on manual alignment of audio events to visual cues, as in Foley sound design, where everyday actions like footsteps or object interactions are recreated to match the on-screen motion. This process is…

Overview of the L3DAS23 Challenge on Audio-Visual Extended Reality Open

C. Marinoni, Riccardo Fosco Gramaccioni, Changan Chen, Aurelio Uncini, Danilo Comminiello · 2024

Computer science

The primary goal of the L3DAS23 Signal Processing Grand Challenge at ICASSP 2023 is to promote and support collaborative research on machine learning for 3D audio signal processing, with a specific emphasis on 3D speech enhancement and 3D …

Cosmography of the Local Universe by Multipole Analysis of the Expansion Rate Fluctuation Field Open

Basheer Kalbouneh, C. Marinoni, Roy Maartens · 2024

Physics Mathematics

We establish a relationship between the multipoles of the expansion rate fluctuation field $η,$ which capture in an accurate way deviations from isotropy in the redshift-distance relation, and the multipoles of the covariant cosmographic p…

L3DAS23: Learning 3D Audio Sources for Audio-Visual Extended Reality Open

Riccardo Fosco Gramaccioni, C. Marinoni, Changan Chen, Aurelio Uncini, Danilo Comminiello · 2024

Computer science

The primary goal of the L3DAS (Learning 3D Audio Sources) project is to stimulate and support collaborative research studies concerning machine learning techniques applied to 3D audio signal processing. To this end, the L3DAS23 Challenge, …

Covariant cosmography: the observer-dependence of the Hubble parameter Open

Roy Maartens, Jéssica Santiago, Chris Clarkson, Basheer Kalbouneh, C. Marinoni · 2023

Physics Mathematics

The disagreement between low- and high-redshift measurements of the Hubble parameter is emerging as a serious challenge to the standard model of cosmology. We develop a covariant cosmographic analysis of the Hubble parameter in a general s…

The multipolar structure of the local expansion rate Open

C. Marinoni, Basheer Kalbouneh, J. Bel · 2023

Physics

We design a new observable, the $\eta$ expansion rate fluctuation, to characterize deviations from linearity in the redshift-distance relationship in the local Universe. We also show how to compress the resulting signal into spherical harm…

Diffusion models for audio semantic communication Open

Eleonora Grassucci, C. Marinoni, M. Andrea Rodríguez, Danilo Comminiello · 2023

Computer science

Directly sending audio signals from a transmitter to a receiver across a noisy channel may absorb consistent bandwidth and be prone to errors when trying to recover the transmitted bits. On the contrary, the recent semantic communication a…

The multipole expansion of the local expansion rate Open

Basheer Kalbouneh, C. Marinoni, J. Bel · 2022

Physics Mathematics

We design a new observable, the expansion rate fluctuation $η$, to characterize deviations from the linear relation between redshift and distance in the local universe. We also show how to compress the resulting signal into spherical harmo…

Constraining spatial curvature with large-scale structure Open

J. Bel, Julien Larena, Roy Maartens, C. Marinoni, Louis Pèrenon · 2022

Physics Mathematics

We analyse the clustering of matter on large scales in an extension of the concordance model that allows for spatial curvature. We develop a consistent approach to curvature and wide-angle effects on the galaxy 2-point correlation function…

L3DAS22 Challenge: Learning 3D Audio Sources in a Real Office Environment Open

Eric Guizzo, C. Marinoni, Marco Pennese, Xinlei Ren, Xiguang Zheng , et al. · 2022

Computer science Engineering Art

The L3DAS22 Challenge is aimed at encouraging the development of machine learning strategies for 3D speech enhancement and 3D sound localization and detection in office-like environments. This challenge improves and extends the tasks of th…

L3DAS22 Challenge: Learning 3D Audio Sources in a Real Office Environment Open

Eric Guizzo, C. Marinoni, Marco Pennese, Xinlei Ren, Xiguang Zheng , et al. · 2022

Computer science Engineering Art

The L3DAS22 Challenge is aimed at encouraging the development of machine learning strategies for 3D speech enhancement and 3D sound localization and detection in office-like environments. This challenge improves and extends the tasks of th…

L3DAS21 Challenge: Machine Learning for 3D Audio Signal Processing Open

Eric Guizzo, Riccardo Fosco Gramaccioni, Saeid Jamili, C. Marinoni, Edoardo Massaro , et al. · 2021

Computer science Physics

The L3DAS21 Challenge is aimed at encouraging and fostering collaborative research on machine learning for 3D audio signal processing, with particular focus on 3D speech enhancement (SE) and 3D sound localization and detection (SELD). Alon…

Redshift drift in radially inhomogeneous Lemaître-Tolman-Bondi spacetimes Open

Romain Codur, C. Marinoni · 2021

Physics Economics

We provide a formula for estimating the redshift and its secular change (redshift drift) in Lemaître-Tolman-Bondi (LTB) spherically symmetric universes. We compute the scaling of the redshift drift for LTB models that predict Hubble diagra…

L3DAS21 challenge: machine learning for 3D audio signal processing Open

Eric Guizzo, Riccardo Fosco Gramaccioni, Saeid Jamili, C. Marinoni, Edoardo Massaro , et al. · 2021

Computer science Physics

The L3DAS21 Challenge11www.13das.com/mlsp2021 is aimed at encouraging and fostering collaborative research on machine learning for 3D audio signal processing, with particular focus on 3D speech enhancement (SE) and 3D sound localization an…

Proposal for a Real-Time Detection of our Acceleration through Space Open

J. Bel, C. Marinoni · 2018

Physics Art

Our proper acceleration with respect to the cosmic microwave background results in a real-time change of the angular position of distant extragalactic sources. The cosmological component of this aberration drift signal, the noninertial mot…

E pur si muove! Proposal for a real-time detection of our acceleration through space Open

J. Bel, C. Marinoni · 2018

Physics Art

Our proper acceleration with respect to the Cosmic Microwave Background results in a real-time change of the angular position of distant extragalactic sources. The cosmological component of this aberration drift signal, the non-inertial mo…

The VIMOS Public Extragalactic Redshift Survey (VIPERS) Open

M. Scodeggio, L. Guzzo, B. Garilli, B. R. Granett, M. Bolzonella , et al. · 2017

Physics

We present the full public data release (PDR-2) of the VIMOS Public Extragalactic Redshift Survey (VIPERS), performed at the ESO VLT. We release redshifts, spectra, CFHTLS magnitudes and ancillary information (as masks and weights) for a c…

Diagnostic of Horndeski theories Open

Louis Pèrenon, C. Marinoni, Federico Piazza · 2017

Physics

We study the effects of Horndeski models of dark energy on the observables of\nthe large-scale structure in the late time universe. A novel classification\ninto {\\it Late dark energy}, {\\it Early dark energy} and {\\it Early modified\ngr…

The VIMOS Public Extragalactic Redshift Survey (VIPERS). Full spectroscopic data and auxiliary information release (PDR-2) Open

M. Scodeggio, L. Guzzo, B. Garilli, B. R. Granett, M. Bolzonella , et al. · 2016

Physics

We present the full public data release (PDR-2) of the VIMOS Public Extragalactic Redshift Survey (VIPERS), performed at the ESO VLT. We release redshifts, spectra, CFHTLS magnitudes and ancillary information (as masks and weights) for a c…

C. Marinoni YOU? Author Swipe