MAMA: Meta-optimized Angular Margin Contrastive Framework for Video-Language Representation Learning

Exploring foci of: arXiv (Cornell University) MAMA: Meta-optimized Angular Margin Contrastive Framework for Video-Language Representation Learning July 2024 • T. Q. Nguyen, Bin Yi, Xiaobao Wu, Xinshuai Dong, Zhiyuan Hu, Khoi M. Le, Cong-Duy Nguyen, See-Kiong Ng, Luu Anh Tuan Data quality stands at the forefront of deciding the effectiveness of video-language representation learning. However, video-text pairs in previous data typically do not align perfectly with each other, which might lead to video-language representations that do not accurately reflect cross-modal semantics. Moreover, previous data also possess an uneven distribution of concepts, thereby hampering the downstream performance across unpopular subjects. To address these problems, we propose MAMA, a new approach to lear… Open Article Page

Computer Science Artificial Intelligence Feature Learning Machine Learning Law Politics Open Article