Jesse Emond YOU? Author Swipe

Last 10y

Open Invitation to Help Curate This Field & Enhance Impact .ORG

Language-Agnostic Multilingual Modeling Open

Arindrima Datta, Bhuvana Ramabhadran, Jesse Emond, Anjuli Kannan, Brian Roark · 2024

Computer science Philosophy

Multilingual Automated Speech Recognition (ASR) systems allow for the joint training of data-rich and data-scarce languages in a single model. This enables data and parameter sharing across languages, which is especially beneficial for the…

Modular Hybrid Autoregressive Transducer Open

Zhong Meng, Tongzhou Chen, Rohit Prabhavalkar, Yu Zhang, Gary Wang , et al. · 2022

Computer science Mathematics Engineering

Text-only adaptation of a transducer model remains challenging for end-to-end speech recognition since the transducer has no clearly separated acoustic model (AM), language model (LM) or blank model. In this work, we propose a modular hybr…

Non-Parallel Voice Conversion for ASR Augmentation Open

Gary Wang, Andrew E. Rosenberg, Bhuvana Ramabhadran, Fadi Biadsy, Yinghui Huang , et al. · 2022

Computer science Engineering Mathematics

Automatic speech recognition (ASR) needs to be robust to speaker differences. Voice Conversion (VC) modifies speaker characteristics of input speech. This is an attractive feature for ASR data augmentation. In this paper, we demonstrate th…

Creating related items for first view…