Explanipedia

Score Distillation Sampling for Audio: Source Separation, Synthesis, and Beyond Open

Jessie Richter-Powell, Antonio Torralba, Jonathan Lorraine · 2025

We introduce Audio-SDS, a generalization of Score Distillation Sampling (SDS) to text-conditioned audio diffusion models. While SDS was initially designed for text-to-3D generation using image diffusion, its core idea of distilling a power…

LLaMA-Mesh: Unifying 3D Mesh Generation with Language Models Open

Z. Maggie Wang, Jonathan Lorraine, Yingyi Wang, Hang Su, J.Z. Zhu , et al. · 2024

Computer science Engineering

This work explores expanding the capabilities of large language models (LLMs) pretrained on text to generate 3D meshes within a unified model. This offers key advantages of (1) leveraging spatial knowledge already embedded in LLMs, derived…

Multi-student Diffusion Distillation for Better One-step Generators Open

Yongchen Song, Jonathan Lorraine, Weili Nie, Karsten Kreis, James Lucas · 2024

Computer science Environmental science Chemistry

Diffusion models achieve high-quality sample generation at the cost of a lengthy multistep inference procedure. To overcome this, diffusion distillation techniques produce student generators capable of matching or surpassing the teacher in…

Scalable Nested Optimization for Deep Learning Open

Jonathan Lorraine · 2024

Computer science

Gradient-based optimization has been critical to the success of machine learning, updating a single set of parameters to minimize a single loss. A growing number of applications rely on a generalization of this, where we have a bilevel or …

Improving Hyperparameter Optimization with Checkpointed Model Weights Open

Nikhil Mehta, Jonathan Lorraine, Steve Masson, Ramanathan Arunachalam, Zaid Pervaiz Bhat , et al. · 2024

Computer science

When training deep learning models, the performance depends largely on the selected hyperparameters. However, hyperparameter optimization (HPO) is often one of the most expensive parts of model design. Classical HPO methods treat this as a…

Training Data Attribution via Approximate Unrolled Differentiation Open

Juhan Bae, Wu Lin, Jonathan Lorraine, Roger Grosse · 2024

Computer science Psychology Geography

Many training data attribution (TDA) methods aim to estimate how a model's behavior would change if one or more data points were removed from the training set. Methods based on implicit differentiation, such as influence functions, can be …

LATTE3D: Large-scale Amortized Text-To-Enhanced3D Synthesis Open

Kevin Xie, Jonathan Lorraine, Tianshi Cao, Jun Gao, James M. Lucas , et al. · 2024

Computer science Mathematics Geography

Recent text-to-3D generation approaches produce impressive 3D results but require time-consuming optimization that can take up to an hour per prompt. Amortized methods like ATT3D optimize multiple prompts simultaneously to improve efficien…

Graph Metanetworks for Processing Diverse Neural Architectures Open

Derek Lim, Haggai Maron, Marc T. Law, Jonathan Lorraine, James M. Lucas · 2023

Computer science Mathematics Sociology

Neural networks efficiently encode learned information within their parameters. Consequently, many tasks can be unified by treating neural networks themselves as input data. When doing so, recent studies demonstrated the importance of acco…

Using Large Language Models for Hyperparameter Optimization Open

Michael R. Zhang, Nishkrit Desai, Juhan Bae, Jonathan Lorraine, Jimmy Ba · 2023

Computer science

This paper explores the use of foundational large language models (LLMs) in hyperparameter optimization (HPO). Hyperparameters are critical in determining the effectiveness of machine learning models, yet their optimization often relies on…

ATT3D: Amortized Text-to-3D Object Synthesis Open

Jonathan Lorraine, Kevin Xie, Xiaohui Zeng, Chen-Hsuan Lin, Towaki Takikawa , et al. · 2023

Computer science Physics Philosophy

Text-to-3D modelling has seen exciting progress by combining generative text-to-image models with image-to-3D methods like Neural Radiance Fields. DreamFusion recently achieved high-quality results but requires a lengthy, per-prompt optimi…

On Implicit Bias in Overparameterized Bilevel Optimization Open

Paul Vicol, Jonathan Lorraine, Fabian Pedregosa, David Duvenaud, Roger Grosse · 2022

Computer science Mathematics

Many problems in machine learning involve bilevel optimization (BLO), including hyperparameter optimization, meta-learning, and dataset distillation. Bilevel problems consist of two nested sub-problems, called the outer and inner problems,…

Task Selection for AutoML System Evaluation Open

Jonathan Lorraine, Nihesh Anderson, Chansoo Lee, Quentin De Laroussilhe, Mehadi Hassen · 2022

Computer science Engineering Economics

Our goal is to assess if AutoML system changes - i.e., to the search space or hyperparameter optimization - will improve the final model's performance on production tasks. However, we cannot test the changes on production tasks. Instead, w…

Lyapunov Exponents for Diversity in Differentiable Games Open

Jonathan Lorraine, Paul Vicol, Jack Parker-Holder, Tal Kachman, Luke Metz , et al. · 2021

Computer science Mathematics Physics

Ridge Rider (RR) is an algorithm for finding diverse solutions to optimization problems by following eigenvectors of the Hessian ("ridges"). RR is designed for conservative gradient systems (i.e., settings involving a single loss function)…

Input Convex Gradient Networks Open

Jack Richter-Powell, Jonathan Lorraine, Brandon Amos · 2021

Mathematics Computer science

The gradients of convex functions are expressive models of non-trivial vector fields. For example, Brenier's theorem yields that the optimal transport map between any two measures on Euclidean space under the squared distance is realized a…

Meta-Learning to Improve Pre-Training Open

Aniruddh Raghu, Jonathan Lorraine, Simon Kornblith, Matthew B. A. McDermott, David Duvenaud · 2021

Computer science Medicine Economics

Pre-training (PT) followed by fine-tuning (FT) is an effective method for training neural networks, and has led to significant performance improvements in many domains. PT can incorporate various design choices such as task and data reweig…

Complex Momentum for Learning in Games. Open

Jonathan Lorraine, David Acuna, Paul Vicol, David Duvenaud · 2021

Computer science Mathematics Economics

We generalize gradient descent with momentum for learning in differentiable games to have complex-valued momentum. We give theoretical motivation for our method by proving convergence on bilinear zero-sum games for simultaneous and alterna…

Complex Momentum for Optimization in Games Open

Jonathan Lorraine, David Acuna, Paul Vicol, David Duvenaud · 2021

Computer science Mathematics Economics

We generalize gradient descent with momentum for optimization in differentiable games to have complex-valued momentum. We give theoretical motivation for our method by proving convergence on bilinear zero-sum games for simultaneous and alt…

Optimizing Millions of Hyperparameters by Implicit Differentiation Open

Jonathan Lorraine, Paul Vicol, David Duvenaud · 2019

Computer science Mathematics Biology

We propose an algorithm for inexpensive gradient-based hyperparameter optimization that combines the implicit function theorem (IFT) with efficient inverse Hessian approximations. We present results about the relationship between the IFT a…

Understanding Neural Architecture Search Techniques Open

George Alexandru Adam, Jonathan Lorraine · 2019

Computer science Art Chemistry

Automatic methods for generating state-of-the-art neural network architectures without human experts have generated significant attention recently. This is because of the potential to remove human experts from the design loop which can red…

Self-Tuning Networks: Bilevel Optimization of Hyperparameters using Structured Best-Response Functions Open

Matthew Mackay, Paul Vicol, Jonathan Lorraine, David Duvenaud, Roger Grosse · 2019

Computer science Mathematics

Hyperparameter optimization can be formulated as a bilevel optimization problem, where the optimal parameters on the training set depend on the hyperparameters. We aim to adapt regularization hyperparameters for neural networks by fitting …

Stochastic Hyperparameter Optimization through Hypernetworks Open

Jonathan Lorraine, David Duvenaud · 2018

Computer science Mathematics

Machine learning models are often tuned by nesting optimization of model weights inside the optimization of hyperparameters. We give a method to collapse this nested optimization into joint stochastic optimization of weights and hyperparam…

Jonathan Lorraine YOU? Author Swipe