Enhancing Resilience to Missing Data in Audio-Text Emotion Recognition with Multi-Scale Chunk Regularization

Exploring foci of: INTERNATIONAL CONFERENCE ON MULTIMODAL INTERACTION Enhancing Resilience to Missing Data in Audio-Text Emotion Recognition with Multi-Scale Chunk Regularization October 2023 • Wei-Cheng Lin, Lucas Goncalves, Carlos Busso Most existing audio-text emotion recognition studies have focused on the computational modeling aspects, including strategies for fusing the modalities. An area that has received less attention is understanding the role of proper temporal synchronization between the modalities in the model performance. This study presents a transformer-based model designed with a word-chunk concept, which offers an ideal framework to explore different strategies to align text and speech. The approach creates chunks with alternativ… Open Article Page

Computer Science Artificial Intelligence Transformer Machine Learning Social Science Biochemistry Quantum Mechanics Physics Chemistry Open Article