Exploring foci of:
INTERNATIONAL CONFERENCE ON MULTIMODAL INTERACTION
Enhancing Resilience to Missing Data in Audio-Text Emotion Recognition with Multi-Scale Chunk Regularization
October 2023 • Wei-Cheng Lin, Lucas Goncalves, Carlos Busso
Most existing audio-text emotion recognition studies have focused on the computational modeling aspects, including strategies for fusing the modalities. An area that has received less attention is understanding the role of proper temporal synchronization between the modalities in the model performance. This study presents a transformer-based model designed with a word-chunk concept, which offers an ideal framework to explore different strategies to align text and speech. The approach creates chunks with alternativ…
Computer Science
Artificial Intelligence
Transformer
Machine Learning
Social Science
Biochemistry
Quantum Mechanics
Physics
Chemistry
Voltage