Explanipedia

LatentCRF: Continuous CRF for Efficient Latent Diffusion Open

Kanchana Ranasinghe, Sadeep Jayasumana, Andreas Veit, Ayan Chakrabarti, Daniel Gläsner , et al. · 2024

Physics

Latent Diffusion Models (LDMs) produce high-quality, photo-realistic images, however, the latency incurred by multiple costly inference iterations can restrict their applicability. We introduce LatentCRF, a continuous Conditional Random Fi…

Rethinking FID: Towards a Better Evaluation Metric for Image Generation Open

Sadeep Jayasumana, Srikumar Ramalingam, Andreas Veit, Daniel Gläsner, Ayan Chakrabarti , et al. · 2023

Computer science Mathematics Political science

As with many machine learning problems, the progress of image generation methods hinges on good evaluation metrics. One of the most popular is the Frechet Inception Distance (FID). FID estimates the distance between a distribution of Incep…

MarkovGen: Structured Prediction for Efficient Text-to-Image Generation Open

Sadeep Jayasumana, Daniel Gläsner, Srikumar Ramalingam, Andreas Veit, Ayan Chakrabarti , et al. · 2023

Computer science

Modern text-to-image generation models produce high-quality images that are both photorealistic and faithful to the text prompts. However, this quality comes at significant computational cost: nearly all of these models are iterative and r…

On the Effectiveness of Impedance-Based Fingerprint Presentation Attack Detection Open

Jascha Kolberg, Daniel Gläsner, Ralph Breithaupt, Marta Gomez‐Barrero, Jörg Reinhold , et al. · 2021

Computer science Medicine

Within the last few decades, the need for subject authentication has grown steadily, and biometric recognition technology has been established as a reliable alternative to passwords and tokens, offering automatic decisions. However, as uns…

Balancing Robustness and Sensitivity using Feature Contrastive Learning Open

Seung‐Yeon Kim, Daniel Gläsner, Srikumar Ramalingam, Cho‐Jui Hsieh, Kishore Papineni , et al. · 2021

Computer science Chemistry

It is generally believed that robust training of extremely large networks is critical to their success in real-world applications. However, when taken to the extreme, methods that promote robustness can hurt the model's sensitivity to rare…

Balancing Constraints and Submodularity in Data Subset Selection. Open

Srikumar Ramalingam, Daniel Gläsner, Kaushal Patel, Raviteja Vemulapalli, Sadeep Jayasumana , et al. · 2021

Computer science

Deep learning has yielded extraordinary results in vision and natural language processing, but this achievement comes at a cost. Most deep learning models require enormous resources during training, both in terms of computation and in huma…

Less is more: Selecting informative and diverse subsets with balancing constraints Open

Srikumar Ramalingam, Daniel Gläsner, Kaushal Patel, Raviteja Vemulapalli, Sadeep Jayasumana , et al. · 2021

Computer science

Deep learning has yielded extraordinary results in vision and natural language processing, but this achievement comes at a cost. Most models require enormous resources during training, both in terms of computation and in human labeling eff…

Understanding Robustness of Transformers for Image Classification Open

Srinadh Bhojanapalli, Ayan Chakrabarti, Daniel Gläsner, Daliang Li, Thomas Unterthiner , et al. · 2021

Computer science Engineering Chemistry

Deep Convolutional Neural Networks (CNNs) have long been the architecture of choice for computer vision tasks. Recently, Transformer-based architectures like Vision Transformer (ViT) have matched or even surpassed ResNets for image classif…

Half-occlusion boundary detectors in computational stereo vision Open

Jialiang Wang, Daniel Gläsner, Todd Zickler · 2018

Mathematics Computer science Biology

There are two sources of depth information in a stereo pair. One is the correlation signal from smooth surface regions that are visible to both eyes, which provides depth information via triangulation. The other is the decorrelation signal…

A dynamic programming algorithm for perceptually consistent stereo Open

Jialiang Wang, Daniel Gläsner, Todd Zickler · 2017

Computer science

This document provides details of the dynamic programming algorithm discussed in “Towards perceptually consistent stereo: a scanline study." For the motivation of the algorithm, see that paper.

Daniel Gläsner YOU? Author Swipe