Explanipedia

Disentangled 3D Scene Generation with Layout Learning Open

Dave Epstein, Ben Poole, Ben Mildenhall, Alexei A. Efros, Aleksander Holynski · 2024

Computer science

We introduce a method to generate 3D scenes that are disentangled into their component objects. This disentanglement is unsupervised, relying only on the knowledge of a large pretrained text-to-image model. Our key insight is that objects …

Diffusion Self-Guidance for Controllable Image Generation Open

Dave Epstein, Allan Jabri, Ben Poole, Alexei A. Efros, Aleksander Holynski · 2023

Computer science

Large-scale generative models are capable of producing high-quality images from detailed text descriptions. However, many aspects of an image are difficult or impossible to convey through text. We introduce self-guidance, a method that pro…

BlobGAN: Spatially Disentangled Scene Representations Open

Dave Epstein, Taesung Park, Richard Zhang, Eli Shechtman, Alexei A. Efros · 2022

Computer science Mathematics Political science

We propose an unsupervised, mid-level representation for a generative model of scenes. The representation is mid-level in that it is neither per-pixel nor per-image; rather, scenes are modeled as a collection of spatial, depth-ordered "blo…

Learning Temporal Dynamics from Cycles in Narrated Video Open

Dave Epstein, Jiajun Wu, Cordelia Schmid, Chen Sun · 2021

Computer science Mathematics Sociology

Learning to model how the world changes as time elapses has proven a challenging problem for the computer vision community. We propose a self-supervised solution to this problem using temporal cycle consistency jointly in vision and langua…

Globetrotter: Unsupervised Multilingual Translation from Visual Alignment Open

Dídac Surís Coll-Vinent, Dave Epstein, Carl Vondrick · 2021

Computer science Philosophy Chemistry

Machine translation in a multi-language scenario requires large-scale parallel corpora for every language pair. Unsupervised translation is challenging because there is no explicit connection between languages, and the existing methods hav…

Learning Temporal Dynamics from Cycles in Narrated Video Open

Dave Epstein, Jiajun Wu, Cordelia Schmid, Chen Sun · 2021

Computer science Mathematics Sociology

Learning to model how the world changes as time elapses has proven a challenging problem for the computer vision community. We propose a self-supervised solution to this problem using temporal cycle consistency jointly in vision and langua…

Globetrotter: Connecting Languages by Connecting Images Open

Dídac Surís, Dave Epstein, Carl Vondrick · 2020

Computer science Medicine Philosophy

Machine translation between many languages at once is highly challenging, since training with ground truth requires supervision between all language pairs, which is difficult to obtain. Our key insight is that, while languages may vary dra…

Learning Goals from Failure Open

Dave Epstein, Carl Vondrick · 2020

Computer science Physics

We introduce a framework that predicts the goals behind observable human action in video. Motivated by evidence in developmental psychology, we leverage video of unintentional action to learn video representations of goals without direct s…

Video Representations of Goals Emerge from Watching Failure. Open

Dave Epstein, Carl Vondrick · 2020

Computer science Physics Political science

We introduce a video representation learning framework that models the latent goals behind observable human action. Motivated by how children learn to reason about goals and intentions by experiencing failure, we leverage unconstrained vid…

Oops! Predicting Unintentional Action in Video Open

Dave Epstein, Boyuan Chen, Carl Vondrick · 2020

Computer science History Physics

From just a short glance at a video, we can often tell whether a person's action is intentional or not. Can we train a model to recognize this? We introduce a dataset of in-the-wild videos of unintentional action, as well as a suite of tas…

NEUZZ: Efficient Fuzzing with Neural Program Smoothing Open

Dongdong She, Kexin Pei, Dave Epstein, Junfeng Yang, Baishakhi Ray , et al. · 2019

Computer science

Fuzzing has become the de facto standard technique for finding software vulnerabilities. However, even state-of-the-art fuzzers are not very efficient at finding hard-to-trigger software bugs. Most popular fuzzers use evolutionary guidance…

NEUZZ: Efficient Fuzzing with Neural Program Learning Open

Dongdong She, Kexin Pei, Dave Epstein, Junfeng Yang, Baishakhi Ray , et al. · 2018

Computer science

Fuzzing has become the de facto standard technique for finding software vulnerabilities. However, even state-of-the-art fuzzers are not very efficient at finding hard-to-trigger software bugs. Most popular fuzzers use evolutionary guidance…

NEUZZ: Efficient Fuzzing with Neural Program Smoothing Open

Dongdong She, Kexin Pei, Dave Epstein, Junfeng Yang, Baishakhi Ray , et al. · 2018

Computer science

Fuzzing has become the de facto standard technique for finding software vulnerabilities. However, even state-of-the-art fuzzers are not very efficient at finding hard-to-trigger software bugs. Most popular fuzzers use evolutionary guidance…

Dave Epstein YOU? Author Swipe