Jaekwon Im
YOU?
Author Swipe
View article: FlashSR: One-step Versatile Audio Super-resolution via Diffusion Distillation
FlashSR: One-step Versatile Audio Super-resolution via Diffusion Distillation Open
Versatile audio super-resolution (SR) is the challenging task of restoring high-frequency components from low-resolution audio with sampling rates between 4kHz and 32kHz in various domains such as music, speech, and sound effects. Previous…
View article: Video-Foley: Two-Stage Video-To-Sound Generation via Temporal Event Condition For Foley Sound
Video-Foley: Two-Stage Video-To-Sound Generation via Temporal Event Condition For Foley Sound Open
Foley sound synthesis is crucial for multimedia production, enhancing user experience by synchronizing audio and video both temporally and semantically. Recent studies on automating this labor-intensive process through video-to-sound gener…
View article: DIFFRENT: A Diffusion Model for Recording Environment Transfer of Speech
DIFFRENT: A Diffusion Model for Recording Environment Transfer of Speech Open
Properly setting up recording conditions, including microphone type and placement, room acoustics, and ambient noise, is essential to obtaining the desired acoustic characteristics of speech. In this paper, we propose Diff-R-EN-T, a Diffus…
View article: Foley Sound Synthesis at the DCASE 2023 Challenge
Foley Sound Synthesis at the DCASE 2023 Challenge Open
The addition of Foley sound effects during post-production is a common technique used to enhance the perceived acoustic properties of multimedia content. Traditionally, Foley sound has been produced by human Foley artists, which involves m…
View article: Neural Vocoder Feature Estimation for Dry Singing Voice Separation
Neural Vocoder Feature Estimation for Dry Singing Voice Separation Open
Singing voice separation (SVS) is a task that separates singing voice audio from its mixture with instrumental audio. Previous SVS studies have mainly employed the spectrogram masking method which requires a large dimensionality in predict…