David Picard
YOU?
Author Swipe
View article: 3D Human Pose and Shape Estimation from LiDAR Point Clouds: A Review
3D Human Pose and Shape Estimation from LiDAR Point Clouds: A Review Open
In this paper, we present a comprehensive review of 3D human pose estimation and human mesh recovery from in-the-wild LiDAR point clouds. We compare existing approaches across several key dimensions, and propose a structured taxonomy to cl…
View article: Closing the Performance Gap in Generative Recommenders with Collaborative Tokenization and Efficient Modeling
Closing the Performance Gap in Generative Recommenders with Collaborative Tokenization and Efficient Modeling Open
Recent work has explored generative recommender systems as an alternative to traditional ID-based models, reframing item recommendation as a sequence generation task over discrete item tokens. While promising, such methods often underperfo…
View article: Around the World in 80 Timesteps: A Generative Approach to Global Visual Geolocation
Around the World in 80 Timesteps: A Generative Approach to Global Visual Geolocation Open
Global visual geolocation predicts where an image was captured on Earth. Since images vary in how precisely they can be localized, this task inherently involves a significant degree of ambiguity. However, existing approaches are determinis…
View article: PoM: Efficient Image and Video Generation with the Polynomial Mixer
PoM: Efficient Image and Video Generation with the Polynomial Mixer Open
Diffusion models based on Multi-Head Attention (MHA) have become ubiquitous to generate high quality images and videos. However, encoding an image or a video as a sequence of patches results in costly attention patterns, as the requirement…
View article: PAFUSE: Part-based Diffusion for 3D Whole-Body Pose Estimation
PAFUSE: Part-based Diffusion for 3D Whole-Body Pose Estimation Open
We introduce a novel approach for 3D whole-body pose estimation, addressing the challenge of scale -- and deformability -- variance across body parts brought by the challenge of extending the 17 major joints on the human body to fine-grain…
View article: Don't drop your samples! Coherence-aware training benefits Conditional diffusion
Don't drop your samples! Coherence-aware training benefits Conditional diffusion Open
Conditional diffusion models are powerful generative models that can leverage various types of conditional information, such as class labels, segmentation masks, or text captions. However, in many real-world scenarios, conditional informat…
View article: Analysis of Classifier-Free Guidance Weight Schedulers
Analysis of Classifier-Free Guidance Weight Schedulers Open
Classifier-Free Guidance (CFG) enhances the quality and condition adherence of text-to-image diffusion models. It operates by combining the conditional and unconditional predictions using a fixed weight. However, recent works vary the weig…
View article: Multiple Locally Linear Kernel Machines
Multiple Locally Linear Kernel Machines Open
In this paper we propose a new non-linear classifier based on a combination of locally linear classifiers. A well known optimization formulation is given as we cast the problem in a $\ell_1$ Multiple Kernel Learning (MKL) problem using man…
View article: An Analysis of Initial Training Strategies for Exemplar-Free Class-Incremental Learning
An Analysis of Initial Training Strategies for Exemplar-Free Class-Incremental Learning Open
Class-Incremental Learning (CIL) aims to build classification models from data streams. At each step of the CIL process, new classes must be integrated into the model. Due to catastrophic forgetting, CIL is particularly challenging when ex…
View article: Toward Unsupervised Visual Reasoning: Do Off-the-Shelf Features Know How to Reason?
Toward Unsupervised Visual Reasoning: Do Off-the-Shelf Features Know How to Reason? Open
Recent advances in visual representation learning allowed for the construction of a plethora of powerful features that are ready to use for numerous downstream tasks. Contrary to existing representation evaluations typically based on image…
View article: Image Compression using only Attention based Neural Networks
Image Compression using only Attention based Neural Networks Open
In recent research, Learned Image Compression has gained prominence for its capacity to outperform traditional handcrafted pipelines, especially at low bit-rates. While existing methods incorporate convolutional priors with occasional atte…
View article: Contraction d'une image en une petite séquence de tokens avec des modules d'attention croisée
Contraction d'une image en une petite séquence de tokens avec des modules d'attention croisée Open
International audience
View article: An Analysis of Initial Training Strategies for Exemplar-Free Class-Incremental Learning
An Analysis of Initial Training Strategies for Exemplar-Free Class-Incremental Learning Open
Class-Incremental Learning (CIL) aims to build classification models from data streams. At each step of the CIL process, new classes must be integrated into the model. Due to catastrophic forgetting, CIL is particularly challenging when ex…
View article: LRVS-Fashion: Extending Visual Search with Referring Instructions
LRVS-Fashion: Extending Visual Search with Referring Instructions Open
This paper introduces a new challenge for image similarity search in the context of fashion, addressing the inherent ambiguity in this domain stemming from complex images. We present Referred Visual Search (RVS), a task allowing users to d…
View article: SSP-Net: Scalable sequential pyramid networks for real-Time 3D human pose regression
SSP-Net: Scalable sequential pyramid networks for real-Time 3D human pose regression Open
View article: Alphazzle: Jigsaw Puzzle Solver with Deep Monte-Carlo Tree Search
Alphazzle: Jigsaw Puzzle Solver with Deep Monte-Carlo Tree Search Open
Solving jigsaw puzzles requires to grasp the visual features of a sequence of patches and to explore efficiently a solution space that grows exponentially with the sequence length. Therefore, visual deep reinforcement learning (DRL) should…
View article: FeTrIL: Feature Translation for Exemplar-Free Class-Incremental Learning
FeTrIL: Feature Translation for Exemplar-Free Class-Incremental Learning Open
Exemplar-free class-incremental learning is very challenging due to the negative effect of catastrophic forgetting. A balance between stability and plasticity of the incremental process is needed in order to obtain good accuracy for past a…
View article: FeTrIL: Feature Translation for Exemplar-Free Class-Incremental Learning
FeTrIL: Feature Translation for Exemplar-Free Class-Incremental Learning Open
International audience
View article: Towards Unsupervised Visual Reasoning: Do Off-The-Shelf Features Know How to Reason?
Towards Unsupervised Visual Reasoning: Do Off-The-Shelf Features Know How to Reason? Open
Recent advances in visual representation learning allowed to build an abundance of powerful off-the-shelf features that are ready-to-use for numerous downstream tasks. This work aims to assess how well these features preserve information a…
View article: H3WB: Human3.6M 3D WholeBody Dataset and Benchmark
H3WB: Human3.6M 3D WholeBody Dataset and Benchmark Open
We present a benchmark for 3D human whole-body pose estimation, which involves identifying accurate 3D keypoints on the entire human body, including face, hands, body, and feet. Currently, the lack of a fully annotated and accurate 3D whol…
View article: FeTrIL: Feature Translation for Exemplar-Free Class-Incremental Learning
FeTrIL: Feature Translation for Exemplar-Free Class-Incremental Learning Open
Exemplar-free class-incremental learning is very challenging due to the negative effect of catastrophic forgetting. A balance between stability and plasticity of the incremental process is needed in order to obtain good accuracy for past a…
View article: Improving Deep Metric Learning with Virtual Classes and Examples Mining
Improving Deep Metric Learning with Virtual Classes and Examples Mining Open
International audience
View article: SCAM! Transferring humans between images with Semantic Cross Attention Modulation
SCAM! Transferring humans between images with Semantic Cross Attention Modulation Open
A large body of recent work targets semantically conditioned image generation. Most such methods focus on the narrower task of pose transfer and ignore the more challenging task of subject transfer that consists in not only transferring th…
View article: Decanus to Legatus: Synthetic training for 2D-3D human pose lifting
Decanus to Legatus: Synthetic training for 2D-3D human pose lifting Open
3D human pose estimation is a challenging task because of the difficulty to acquire ground-truth data outside of controlled environments. A number of further issues have been hindering progress in building a universal and robust model for …
View article: PlaStIL: Plastic and Stable Memory-Free Class-Incremental Learning
PlaStIL: Plastic and Stable Memory-Free Class-Incremental Learning Open
Plasticity and stability are needed in class-incremental learning in order to learn from new data while preserving past knowledge. Due to catastrophic forgetting, finding a compromise between these two properties is particularly challengin…
View article: Achievable Rates and Machine Learning Detection of Faster than Nyquist Spectrally Efficient FDM
Achievable Rates and Machine Learning Detection of Faster than Nyquist Spectrally Efficient FDM Open
International audience
View article: Unveiling the Latent Space Geometry of Push-Forward Generative Models
Unveiling the Latent Space Geometry of Push-Forward Generative Models Open
Many deep generative models are defined as a push-forward of a Gaussian measure by a continuous generator, such as Generative Adversarial Networks (GANs) or Variational Auto-Encoders (VAEs). This work explores the latent space of such deep…
View article: Instance-Aware Observer Network for Out-of-Distribution Object Segmentation
Instance-Aware Observer Network for Out-of-Distribution Object Segmentation Open
Recent works on predictive uncertainty estimation have shown promising results on Out-Of-Distribution (OOD) detection for semantic segmentation. However, these methods struggle to precisely locate the point of interest in the image, i.e, t…
View article: Consensus-Based Optimization for 3D Human Pose Estimation in Camera Coordinates
Consensus-Based Optimization for 3D Human Pose Estimation in Camera Coordinates Open
View article: SCAM! Transferring Humans Between Images with Semantic Cross Attention Modulation
SCAM! Transferring Humans Between Images with Semantic Cross Attention Modulation Open