Explanipedia

Cluster and Predict Latent Patches for Improved Masked Image Modeling Open

Timothée Darcet, Federico Baldassarre, Maxime Oquab, Julien Mairal, Piotr Bojanowski · 2025

Computer science

Masked Image Modeling (MIM) offers a promising approach to self-supervised representation learning, however existing MIM models still lag behind the state-of-the-art. In this paper, we systematically analyze target representations, loss fu…

DINOv2 Meets Text: A Unified Framework for Image- and Pixel-Level Vision-Language Alignment Open

Cijo Jose, Théo Moutakanni, Dahyun Kang, Federico Baldassarre, Timothée Darcet , et al. · 2024

Computer science

Self-supervised visual foundation models produce powerful embeddings that achieve remarkable performance on a wide range of downstream tasks. However, unlike vision-language models such as CLIP, self-supervised visual features are not read…

Automatic Data Curation for Self-Supervised Learning: A Clustering-Based Approach Open

Huy V. Vo, Vasil Khalidov, Timothée Darcet, Théo Moutakanni, Nikita Smetanin , et al. · 2024

Computer science

Self-supervised features are the cornerstone of modern machine learning systems. They are typically pre-trained on data collections whose construction and curation typically require extensive human effort. This manual process has some limi…

Advancing human-centric AI for robust X-ray analysis through holistic self-supervised learning Open

Théo Moutakanni, Piotr Bojanowski, Guillaume Chassagnon, Céline Hudelot, Armand Joulin , et al. · 2024

Computer science Psychology

AI Foundation models are gaining traction in various applications, including medical fields like radiology. However, medical foundation models are often tested on limited tasks, leaving their generalisability and biases unexplored. We pres…

You Don’t Need Domain-Specific Data Augmentations When Scaling Self-Supervised Learning Open

Théo Moutakanni, Maxime Oquab, Marc Szafraniec, Maria Vakalopoulou, Piotr Bojanowski · 2024

Computer science Mathematics Art

Self-Supervised learning (SSL) with Joint-Embedding Architectures (JEA) has led to outstanding performances. All instantiations of this paradigm were trained using strong and well-established hand-crafted data augmentations, leading to the…

Vision Transformers Need Registers Open

Timothée Darcet, Maxime Oquab, Julien Mairal, Piotr Bojanowski · 2023

Computer science Engineering Philosophy

Transformers have recently emerged as a powerful tool for learning visual representations. In this paper, we identify and characterize artifacts in feature maps of both supervised and self-supervised ViT networks. The artifacts correspond …

Dimensionality and Ramping: Signatures of Sentence Integration in the Dynamics of Brains and Deep Language Models Open

Théo Desbordes, Yair Lakretz, Valérie Chanoine, Maxime Oquab, Jean‐Michel Badier , et al. · 2023

Computer science Psychology Political science

A sentence is more than the sum of its words: its meaning depends on how they combine with one another. The brain mechanisms underlying such semantic composition remain poorly understood. To shed light on the neural vector code underlying …

DINOv2: Learning Robust Visual Features without Supervision Open

Maxime Oquab, Timothée Darcet, Théo Moutakanni, Huy Vo, Marc Szafraniec , et al. · 2023

Computer science Physics

The recent breakthroughs in natural language processing for model pretraining on large quantities of data have opened the way for similar foundation models in computer vision. These models could greatly simplify the use of images in any sy…

Dimensionality and ramping: Signatures of sentence integration in the dynamics of brains and deep language models Open

Théo Desbordes, Yair Lakretz, Valérie Chanoine, Maxime Oquab, Jean‐Michel Badier , et al. · 2023

Computer science Psychology Political science

A sentence is more than the sum of its words: its meaning depends on how they combine with one another. The brain mechanisms underlying such semantic composition remain poorly understood. To shed light on the neural vector code underlying …

Co-training $2^L$ Submodels for Visual Recognition Open

Hugo Touvron, Matthieu Cord, Maxime Oquab, Piotr Bojanowski, Jakob Verbeek , et al. · 2022

Computer science Chemistry Physics

We introduce submodel co-training, a regularization method related to co-training, self-distillation and stochastic depth. Given a neural network to be trained, for each sample we implicitly instantiate two altered networks, ``submodels'',…

Efficient conditioned face animation using frontally-viewed embedding Open

Maxime Oquab, Daniel Haziza, Ludovic Schwartz, Tao Xu, Katayoun Zand , et al. · 2022

Computer science Biology

As the quality of few shot facial animation from landmarks increases, new applications become possible, such as ultra low bandwidth video chat compression with a high degree of realism. However, there are some important challenges to tackl…

Self-appearance-aided Differential Evolution for Motion Transfer. Open

Peirong Liu, Rui Wang, Xuefei Cao, Yipin Zhou, Ashish Shah , et al. · 2021

Computer science

Image animation transfers the motion of a driving video to a static object in a source image, while keeping the source identity unchanged. Great progress has been made in unsupervised motion transfer recently, where no labelled data or gro…

Low Bandwidth Video-Chat Compression using Deep Generative Models Open

Maxime Oquab, Pierre Stock, Oran Gafni, Daniel Haziza, Tao Xu , et al. · 2021

Computer science

To unlock video chat for hundreds of millions of people hindered by poor connectivity or unaffordable data costs, we propose to authentically reconstruct faces on the receiver's device using facial landmarks extracted at the sender's side …

Can RNNs learn Recursive Nested Subject-Verb Agreements? Open

Yair Lakretz, Théo Desbordes, Jean-Rémi King, Benoît Crabbé, Maxime Oquab , et al. · 2021

Computer science Mathematics Biology

One of the fundamental principles of contemporary linguistics states that language processing requires the ability to extract recursively nested tree structures. However, it remains unclear whether and how this code could be implemented in…

Discriminating the Influence of Correlated Factors from Multivariate Observations: the Back-to-Back Regression Open

Jean-Rémi King, François Charton, David Lopez‐Paz, Maxime Oquab · 2020

Computer science Mathematics Biology

Identifying causes solely from observations can be particularly challenging when i) potential factors are difficult to manipulate independently and ii) observations are multi-dimensional. To address this issue, we introduce “Back-to-Back” …

Learning about an exponential amount of conditional distributions Open

Mohamed Ishmael Belghazi, Maxime Oquab, Yann LeCun, David Lopez‐Paz · 2019

Computer science Mathematics Biology

We introduce the Neural Conditioner (NC), a self-supervised machine able to learn about all the conditional distributions of a random vector $X$. The NC is a function $NC(x \cdot a, a, r)$ that leverages adversarial training to match each …

Convolutional neural networks : towards less supervision for visual recognition Open

Maxime Oquab · 2018

Computer science Engineering

Convolutional Neural Networks are flexible learning algorithms for computer vision that scale particularly well with the amount of data that is provided for training them. Although these methods had successful applications already in the ’…

Learning and transferring mid-level image representations using convolutional neural networks Open

Maxime Oquab, Léon Bottou, Ivan Laptev, Josef Šivic · 2016

Computer science

Convolutional neural networks (CNN) have recently shown outstanding image classification performance in the large-scale visual recognition challenge (ILSVRC2012). The suc-cess of CNNs is attributed to their ability to learn rich mid-level …

Revisiting Classifier Two-Sample Tests Open

David Lopez‐Paz, Maxime Oquab · 2016

Computer science

The goal of two-sample tests is to assess whether two samples, $S_P \sim P^n$ and $S_Q \sim Q^m$, are drawn from the same distribution. Perhaps intriguingly, one relatively unexplored method to build two-sample tests is the use of binary c…

Revisiting Classifier Two-Sample Tests for GAN Evaluation and Causal Discovery Open

David Lopez‐Paz, Maxime Oquab · 2016

Computer science Mathematics Physics

The goal of two-sample tests is to assess whether two samples, $S_P \sim P^n$ and $S_Q \sim Q^m$, are drawn from the same distribution. Perhaps intriguingly, one relatively unexplored method to build two-sample tests is the use of binary c…

ContextLocNet: Context-Aware Deep Network Models for Weakly Supervised Localization Open

Vadim Kantorov, Maxime Oquab, Minsu Cho, Ivan Laptev · 2016

Computer science Geography Physics

We aim to localize objects in images using image-level supervision only. Previous approaches to this problem mainly focus on discriminative object regions and often fail to locate precise object boundaries. We address this problem by intro…

Is object localization for free? – Weakly-supervised learning with convolutional neural networks Open

Maxime Oquab, Léon Bottou, Ivan Laptev, Josef Šivic · 2015

Computer science

International audience

Maxime Oquab YOU? Author Swipe