Explanipedia

Don't look at the camera: Achieving perceived eye contact in remote video communication Open

Alice L. L. Gao, Samyukta Jayakumar, Marcello Maniglia, Brian Curless, Ira Kemelmacher-Shlizerman , et al. · 2025

Eye contact is a crucial aspect of social interaction, conveying social cues based on the direction of one's gaze. Perceiving eye contact affects behavior and social processing. The widespread use of remote video conferencing technologies …

Fast and Scalable Mixed Precision Euclidean Distance Calculations Using GPU Tensor Cores Open

Brian Curless, Michael Gowanlock · 2025

Modern GPUs are equipped with tensor cores (TCs) that are commonly used for matrix multiplication in artificial intelligence workloads. However, because they have high computational throughput, they can lead to significant performance gain…

GenEscape: Hierarchical Multi-Agent Generation of Escape Room Puzzles Open

Brian Curless, Steve Seitz · 2025

We challenge text-to-image models with generating escape room puzzle images that are visually appealing, logically solid, and intellectually stimulating. While base image models struggle with spatial relationships and affordance reasoning,…

How Animals Dance (When You're Not Looking) Open

Aleksander Holynski, Brian Curless, Ira Kemelmacher, Steven M. Seitz · 2025

We present a framework for generating music-synchronized, choreography aware animal dance videos. Our framework introduces choreography patterns -- structured sequences of motion beats that define the long-range structure of a dance -- as …

Generating Fit Check Videos with a Handheld Camera Open

Bo‐Wei Chen, Brian Curless, Ira Kemelmacher-Shlizerman, Steven M. Seitz · 2025

Self-captured full-body videos are popular, but most deployments require mounted cameras, carefully-framed shots, and repeated practice. We propose a more convenient solution that enables full-body video capture using handheld mobile devic…

View2CAD: Reconstructing View-Centric CAD Models from Single RGB-D Scans Open

Brian Curless · 2025

Parametric CAD models, represented as Boundary Representations (B-reps), are foundational to modern design and manufacturing workflows, offering the precision and topological breakdown required for downstream tasks such as analysis, editin…

VidPanos: Generative Panoramic Videos from Casual Panning Videos Open

Jingwei Ma, Erika Lu, Roni Paiss, Shiran Zada, Aleksander Holynski , et al. · 2024

Panoramic image stitching provides a unified, wide-angle view of a scene that extends beyond the camera's field of view. Stitching frames of a panning video into a panoramic photograph is a well-understood problem for stationary scenes, bu…

Inverse Painting: Reconstructing The Painting Process Open

Bo‐Wei Chen, Yifan Wang, Brian Curless, Ira Kemelmacher-Shlizerman, Steven M. Seitz · 2024

Art Computer science Mathematics

Given an input painting, we reconstruct a time-lapse video of how it may have been painted. We formulate this as an autoregressive image generation problem, in which an initially blank "canvas" is iteratively updated. The model learns from…

Generative Inbetweening: Adapting Image-to-Video Models for Keyframe Interpolation Open

Xiaojuan Wang, Boyang Zhou, Brian Curless, Ira Kemelmacher-Shlizerman, Aleksander Holynski , et al. · 2024

Computer science

We present a method for generating video sequences with coherent motion between a pair of input key frames. We adapt a pretrained large-scale image-to-video diffusion model (originally trained to generate videos moving forward in time from…

ExtraNeRF: Visibility-Aware View Extrapolation of Neural Radiance Fields with Diffusion Models Open

Meng-Li Shih, Wei-Chiu Ma, Lorenzo Boyice, Aleksander Holynski, Forrester Cole , et al. · 2024

Computer science Geography Physics

We propose ExtraNeRF, a novel method for extrapolating the range of views handled by a Neural Radiance Field (NeRF). Our main idea is to leverage NeRFs to model scene-specific, fine-grained details, while capitalizing on diffusion models t…

Infinite Texture: Text-guided High Resolution Diffusion Texture Synthesis Open

Yifan Wang, Aleksander Holynski, Brian Curless, Steven M. Seitz · 2024

Computer science Physics Geology

We present Infinite Texture, a method for generating arbitrarily large texture images from a text prompt. Our approach fine-tunes a diffusion model on a single texture, and learns to embed that statistical distribution in the output domain…

Don't Look at the Camera: Achieving Perceived Eye Contact Open

Alice L. L. Gao, Samyukta Jayakumar, Marcello Maniglia, Brian Curless, Ira Kemelmacher-Shlizerman , et al. · 2024

Computer science Psychology Medicine

We consider the question of how to best achieve the perception of eye contact when a person is captured by camera and then rendered on a 2D display. For single subjects photographed by a camera, conventional wisdom tells us that looking di…

Animating Street View Open

Mengyi Shan, Brian Curless, Ira Kemelmacher-Shlizerman, Steven M. Seitz · 2023

Computer science Geography

We present a system that automatically brings street view imagery to life by\npopulating it with naturally behaving, animated pedestrians and vehicles. Our\napproach is to remove existing people and vehicles from the input image, insert\nm…

Generative Powers of Ten Open

Xiaojuan Wang, Janne Kontkanen, Brian Curless, Steve Seitz, Ira Kemelmacher , et al. · 2023

Computer science Mathematics Geography

We present a method that uses a text-to-image model to generate consistent content across multiple image scales, enabling extreme semantic zooms into a scene, e.g., ranging from a wide-angle landscape view of a forest to a macro shot of an…

Total Selfie: Generating Full-Body Selfies Open

Bo‐Wei Chen, Brian Curless, Ira Kemelmacher-Shlizerman, Steve Seitz · 2023

Computer science Mathematics Sociology

We present a method to generate full-body selfies from photographs originally taken at arms length. Because self-captured photos are typically taken close up, they have limited field of view and exaggerated perspective that distorts facial…

Controllable Light Diffusion for Portraits Open

David Futschik, Kelvin Ritland, James Vecore, Sean Fanello, Sergio Orts‐Escolano , et al. · 2023

Computer science Art Physics

We introduce light diffusion, a novel method to improve lighting in portraits, softening harsh shadows and specular highlights while preserving overall scene illumination. Inspired by professional photographers' diffusers and scrims, our m…

PersonNeRF: Personalized Reconstruction from Photo Collections Open

Chung-Yi Weng, Pratul P. Srinivasan, Brian Curless, Ira Kemelmacher-Shlizerman · 2023

Computer science

We present PersonNeRF, a method that takes a collection of photos of a subject (e.g. Roger Federer) captured across multiple years with arbitrary body poses and appearances, and enables rendering the subject with arbitrary novel combinatio…

Mates2Motion: Learning How Mechanical CAD Assemblies Work Open

James Noeckel, Benjamin T. Jones, Karl D. D. Willis, Brian Curless, Adriana Schulz · 2022

Computer science Engineering Physics

We describe our work on inferring the degrees of freedom between mated parts in mechanical assemblies using deep learning on CAD representations. We train our model using a large dataset of real-world mechanical assemblies consisting of CA…

A device‐agnostic shape model for automated body composition estimates from 3D optical scans Open

Isaac Y. Tian, M. C. Wong, Samantha Kennedy, Nisa N. Kelly, Yong E. Liu , et al. · 2022

Computer science Mathematics

Background Many predictors of morbidity caused by metabolic disease are associated with body shape. 3D optical (3DO) scanning captures body shape and has been shown to accurately and precisely predict body composition variables associated …

3D Moments from Near-Duplicate Photos Open

Qianqian Wang, Zhengqi Li, David Salesin, Noah Snavely, Brian Curless , et al. · 2022

Computer science Art Philosophy

We introduce 3D Moments, a new computational photography effect. As input we take a pair of near-duplicate photos, i.e., photos of moving subjects from similar viewpoints, common in people's photo collections. As output, we produce a video…

FILM: Frame Interpolation for Large Motion Open

Fitsum A. Reda, Janne Kontkanen, Eric Tabellion, Deqing Sun, Caroline Pantofaru , et al. · 2022

Computer science Materials science Geography

We present a frame interpolation algorithm that synthesizes multiple intermediate frames from two input images with large in-between motion. Recent methods use multiple networks to estimate optical flow or depth and a separate network dedi…

HumanNeRF: Free-viewpoint Rendering of Moving People from Monocular Video Open

Chung-Yi Weng, Brian Curless, Pratul P. Srinivasan, Jonathan T. Barron, Ira Kemelmacher-Shlizerman · 2022

Computer science

We introduce a free-viewpoint rendering method -- HumanNeRF -- that works on a given monocular video of a human performing complex body motions, e.g. a video from YouTube. Our method enables pausing the video at any frame and rendering the…

A Light Stage on Every Desk Open

Soumyadip Sengupta, Brian Curless, Ira Kemelmacher-Shlizerman, Steve Seitz · 2021

Computer science Biology Materials science

Every time you sit in front of a TV or monitor, your face is actively illuminated by time-varying patterns of light. This paper proposes to use this time-varying illumination for synthetic relighting of your face with any new illumination …

Brian Curless YOU? Author Swipe