Exploring foci of:
arXiv (Cornell University)
MAMA: Meta-optimized Angular Margin Contrastive Framework for Video-Language Representation Learning
July 2024 • T. Q. Nguyen, Bin Yi, Xiaobao Wu, Xinshuai Dong, Zhiyuan Hu, Khoi M. Le, Cong-Duy Nguyen, See-Kiong Ng, Luu Anh Tuan
Data quality stands at the forefront of deciding the effectiveness of video-language representation learning. However, video-text pairs in previous data typically do not align perfectly with each other, which might lead to video-language representations that do not accurately reflect cross-modal semantics. Moreover, previous data also possess an uneven distribution of concepts, thereby hampering the downstream performance across unpopular subjects. To address these problems, we propose MAMA, a new approach to lear…
Computer Science
Artificial Intelligence
Feature Learning
Machine Learning
Law
Politics