Dan Lyth
YOU?
Author Swipe
View article: Natural language guidance of high-fidelity text-to-speech with synthetic annotations
Natural language guidance of high-fidelity text-to-speech with synthetic annotations Open
Text-to-speech models trained on large-scale datasets have demonstrated impressive in-context learning capabilities and naturalness. However, control of speaker identity and style in these models typically requires conditioning on referenc…