CAVALRY-V: A Large-Scale Generator Framework for Adversarial Attacks on Video MLLMs

Exploring foci of: arXiv (Cornell University) CAVALRY-V: A Large-Scale Generator Framework for Adversarial Attacks on Video MLLMs July 2025 • Jiaming Zhang, Rui Hu, Wei Yang Bryan Lim Video Multimodal Large Language Models (V-MLLMs) have shown impressive capabilities in temporal reasoning and cross-modal understanding, yet their vulnerability to adversarial attacks remains underexplored due to unique challenges: complex cross-modal reasoning mechanisms, temporal dependencies, and computational constraints. We present CAVALRY-V (Cross-modal Language-Vision Adversarial Yielding for Videos), a novel framework that directly targets the critical interface between visual perception and language gener… Open Article Page

September 11 Attacks October 7 Attacks 2011 Norway Attacks D.C. Sniper Attacks Hijackers In The September 11 Attacks Casualties Of The September 11 Attacks 2008 Mumbai Attacks Health Effects Arising From The September 11 Attacks November 2015 Paris Attacks Open Article

Mars Attacks! 2015 Sousse Attacks 2001 Anthrax Attacks Jersey Shore Shark Attacks Of 1916 List Of Fatal Bear Attacks In North America Motives For The September 11 Attacks Generative Adversarial Network National Institutional Ranking Framework Radioisotope Thermoelectric Generator Open Article

2017 Barcelona Attacks The Myth Of The Framework Cynefin Framework Electric Generator 1991 Iraqi Missile Attacks Against Israel Generator Rex Open Article