Nils Hoehing YOU? Author Swipe

Last 10y

Open Invitation to Help Curate This Field & Enhance Impact .ORG

What's left can't be right -- The remaining positional incompetence of contrastive vision-language models Open

Nils Hoehing, Ellen Rushe, Anthony Ventresque · 2023

Contrastive vision-language models like CLIP have been found to lack spatial understanding capabilities. In this paper we discuss the possible causes of this phenomenon by analysing both datasets and embedding space. By focusing on simple …

Creating related items for first view…