Exploring foci of:
doi.org
Fine-Tuning Image-Conditional Diffusion Models is Easier than you Think
February 2025 • Glenn Garcia, Karim Abou Zeid, Christian Schmidt, Daan de Geus, Alexander Hermans, Bastian Leibe
Recent work showed that large diffusion models can be reused as highly precise monocular depth estimators by casting depth estimation as an image-conditional image generation task. While the proposed model achieved state-of-the-art results, high computational demands due to multi-step inference limited its use in many scenarios. In this paper, we show that the perceived inefficiency was caused by a flaw in the inference pipeline that has so far gone unnoticed. The fixed model performs comparably to the best previo…
Computer Science
Artificial Intelligence
Computer Vision
Computer Graphics
Physics