Exploring foci of:
arXiv (Cornell University)
Synt++: Utilizing Imperfect Synthetic Data to Improve Speech Recognition
October 2021 • Ting-Yao Hu, Mohammadreza Armandpour, Ashish Shrivastava, Jen-Hao Rick Chang, Hema Swetha Koppula, Oncel Tuzel
With recent advances in speech synthesis, synthetic data is becoming a viable alternative to real data for training speech recognition models. However, machine learning with synthetic data is not trivial due to the gap between the synthetic and the real data distributions. Synthetic datasets may contain artifacts that do not exist in real data such as structured noise, content errors, or unrealistic speaking styles. Moreover, the synthesis process may introduce a bias due to uneven sampling of the data manifold. W…
Computer Science
Artificial Intelligence
Training, Validation, And Test Data Sets
Machine Learning
Computer Vision
Anthropology