Expressive Text-to-Speech Synthesis using Text Chat Dataset with Speaking Style Information

Exploring foci of: Transactions of the Japanese Society for Artificial Intelligence • Vol 38 • No 3 Expressive Text-to-Speech Synthesis using Text Chat Dataset with Speaking Style Information April 2023 • Yukinori Homma, Hiroki Kanagawa, Nozomi Kobayashi, Yusuke Ijima, Kuniko Saito This paper aims to generate expressive speech for integration with a robot and AI character dialogue systems. To generate expressive speech, some researchers have proposed using labels that express specific dialogue acts and emotions (i.e., speaking style information). Our approach is to use the speaking style information as an intermediate representation and to train a model for inferring the speaking style information from the text and a speech synthesis model independently. Using a model that infers speaking st… Open Article Page

Computer Science Style (Visual Arts) Artificial Intelligence History Politics Archaeology Programming Language Mathematics Geometry Open Article