Explanipedia

A geometric shape regularity effect in the human brain Open

Mathias Sablé-Meyer, Lucas, Benjamin, Cassandra Potier Watkins, Chenxi He, Maxence Pajot , et al. · 2025

The perception and production of regular geometric shapes, a characteristic trait of human cultures since prehistory, has unknown neural mechanisms. Behavioral studies suggest that humans are attuned to discrete regularities such as symmet…

UAVLight: A Benchmark for Illumination-Robust 3D Reconstruction in Unmanned Aerial Vehicle (UAV) Scenes Open

Du Kang, Liao Xue, Xia Jun-peng, Guo, Chaozheng, Gu Yi , et al. · 2025

Illumination inconsistency is a fundamental challenge in multi-view 3D reconstruction. Variations in sunlight direction, cloud cover, and shadows break the constant-lighting assumption underlying both classical multi-view stereo (MVS) and …

RM3DMOT: Roadside Monocular 3D Multi-Object Tracking with Motion-Appearance Optimization Open

Yuyang Yao, Ping Wang, Zuxing Li, Chao Wang, Can Tian , et al. · 2025

Safety Helmet-Based Scale Recovery for Low-Cost Monocular 3D Reconstruction on Construction Sites Open

Jianyu Ren, Lingling Wang, Xuanxuan Liu, Linghong Zeng, Jianyu Ren , et al. · 2025

Three-dimensional (3D) reconstruction is increasingly being adopted in construction site management. While most existing studies rely on auxiliary equipment such as LiDAR and depth cameras, monocular depth estimation offers broader applica…

Endo-G$^{2}$T: Geometry-Guided & Temporally Aware Time-Embedded 4DGS For Endoscopic Scenes Open

Liu Yang-le, Li, Fengze, Liu Kan, Ma Jieming · 2025

Endoscopic (endo) video exhibits strong view-dependent effects such as specularities, wet reflections, and occlusions. Pure photometric supervision misaligns with geometry and triggers early geometric drift, where erroneous shapes are rein…

Motion Marionette: Rethinking Rigid Motion Transfer via Prior Guidance Open

Wang, Haoxuan, Tao, Jiachen, WU Junyi, Liu, Gaowen, Kompella, Ramana Rao , et al. · 2025

We present Motion Marionette, a zero-shot framework for rigid motion transfer from monocular source videos to single-view target images. Previous works typically employ geometric, generative, or simulation priors to guide the transfer proc…

STAvatar: Soft Binding and Temporal Density Control for Monocular 3D Head Avatars Reconstruction Open

Zhao, Jiankuo, Zhu, Xiangyu, Wang, Zidu, Lei, Zhen · 2025

Reconstructing high-fidelity and animatable 3D head avatars from monocular videos remains a challenging yet essential task. Existing methods based on 3D Gaussian Splatting typically bind Gaussians to mesh triangles and model deformations s…

STAvatar: Soft Binding and Temporal Density Control for Monocular 3D Head Avatars Reconstruction Open

Zhao, Jiankuo, Zhu, Xiangyu, Wang, Zidu, Lei, Zhen · 2025

Reconstructing high-fidelity and animatable 3D head avatars from monocular videos remains a challenging yet essential task. Existing methods based on 3D Gaussian Splatting typically bind Gaussians to mesh triangles and model deformations s…

Anisometropia in bilateral hyperopic refractive amblyopia requires eye patching Open

Yukari Hasegawa, Satoshi Ueki · 2025

Uplifting Table Tennis: A Robust, Real-World Application for 3D Trajectory and Spin Estimation Open

Kienzle, Daniel, Ludwig, Katja, Lorenz, Julian, Satoh, Shin'ichi, Lienhart Rainer · 2025

Obtaining the precise 3D motion of a table tennis ball from standard monocular videos is a challenging problem, as existing methods trained on synthetic data struggle to generalize to the noisy, imperfect ball and table detections of the r…

Metric, inertially aligned monocular state estimation via kinetodynamic priors Open

Liu Jiaxin, Li Min, Xu Wanting, Liang Li, Yang Jiaqi , et al. · 2025

Accurate state estimation for flexible robotic systems poses significant challenges, particular for platforms with dynamically deforming structures that invalidate rigid-body assumptions. This paper tackles this problem and allows to exten…

MODEST: Multi-Optics Depth-of-Field Stereo Dataset Open

Trivedi, Nisarg K., Belludi, Vinayak A., Wang, Li-Yun, Taghavi, Pardis, Lok, Dante · 2025

Reliable depth estimation under real optical conditions remains a core challenge for camera vision in systems such as autonomous robotics and augmented reality. Despite recent progress in depth estimation and depth-of-field rendering, rese…

Motion Marionette: Rethinking Rigid Motion Transfer via Prior Guidance Open

Wang, Haoxuan, Tao, Jiachen, WU Junyi, Liu, Gaowen, Kompella, Ramana Rao , et al. · 2025

We present Motion Marionette, a zero-shot framework for rigid motion transfer from monocular source videos to single-view target images. Previous works typically employ geometric, generative, or simulation priors to guide the transfer proc…

DeLightMono: Enhancing Self-Supervised Monocular Depth Estimation in Endoscopy by Decoupling Uneven Illumination Open

Ou Mingyang, Li, Haojin, Zhang Yi-feng, Niu Ke, Qiu, Zhongxi , et al. · 2025

Self-supervised monocular depth estimation serves as a key task in the development of endoscopic navigation systems. However, performance degradation persists due to uneven illumination inherent in endoscopic images, particularly in low-in…