4D-Animal: Freely Reconstructing Animatable 3D Animals from Videos Article Swipe

PDF

Related Concepts

No concepts available.

Shanshan Zhong , Jiawei Peng , Zehan Zheng , Zhongzhan Huang , Wufei Ma , Guofeng Zhang , Qihao Liu , Alan Yuille , Jieneng Chen ·

YOU? · · 2025 · Open Access · · DOI: https://doi.org/10.48550/arxiv.2507.10437 · OA: W4414742253

Existing methods for reconstructing animatable 3D animals from videos typically rely on sparse semantic keypoints to fit parametric models. However, obtaining such keypoints is labor-intensive, and keypoint detectors trained on limited animal data are often unreliable. To address this, we propose 4D-Animal, a novel framework that reconstructs animatable 3D animals from videos without requiring sparse keypoint annotations. Our approach introduces a dense feature network that maps 2D representations to SMAL parameters, enhancing both the efficiency and stability of the fitting process. Furthermore, we develop a hierarchical alignment strategy that integrates silhouette, part-level, pixel-level, and temporal cues from pre-trained 2D visual models to produce accurate and temporally coherent reconstructions across frames. Extensive experiments demonstrate that 4D-Animal outperforms both model-based and model-free baselines. Moreover, the high-quality 3D assets generated by our method can benefit other 3D tasks, underscoring its potential for large-scale applications. The code is released at https://github.com/zhongshsh/4D-Animal.