Explanipedia

Neuromodulatory Control Networks (NCNs): A Biologically Inspired Architecture for Dynamic LLM Processing Open

Jimmy Ba, Jamie Kiros, Geoffrey E. Hinton · 2025

Computer science Sociology

Large Language Models (LLMs) based on the Transformer architecture have achieved remarkable success, yet their core processing mechanisms remain largely static after training. While powerful, this static nature limits their ability to dyna…

Mastering diverse control tasks through world models Open

Danijar Hafner, Jurgis Pašukonis, Jimmy Ba, Timothy Lillicrap · 2025

Computer science Sociology Chemistry

Developing a general algorithm that learns to solve tasks across a wide range of applications has been a fundamental challenge in artificial intelligence. Although current reinforcement-learning algorithms can be readily applied to tasks s…

Report Cards: Qualitative Evaluation of Language Models Using Natural Language Summaries Open

Blair Yang, Fuyang Cui, Keiran Paster, Jimmy Ba, Pashootan Vaezipoor , et al. · 2024

Computer science History Philosophy

The rapid development and dynamic nature of large language models (LLMs) make it difficult for conventional quantitative benchmarks to accurately assess their capabilities. We propose report cards, which are human-interpretable, natural la…

The WMDP Benchmark: Measuring and Reducing Malicious Use With Unlearning Open

Nathaniel Li, Alexander Pan, Anjali Gopal, Summer Yue, Daniel C. Berrios , et al. · 2024

Computer science Geography

The White House Executive Order on Artificial Intelligence highlights the risks of large language models (LLMs) empowering malicious actors in developing biological, cyber, and chemical weapons. To measure these risks of malicious use, gov…

Using Large Language Models for Hyperparameter Optimization Open

Michael R. Zhang, Nishkrit Desai, Juhan Bae, Jonathan Lorraine, Jimmy Ba · 2023

Computer science

This paper explores the use of foundational large language models (LLMs) in hyperparameter optimization (HPO). Hyperparameters are critical in determining the effectiveness of machine learning models, yet their optimization often relies on…

OpenWebMath: An Open Dataset of High-Quality Mathematical Web Text Open

Keiran Paster, Marco Dos Santos, Zhangir Azerbayev, Jimmy Ba · 2023

Computer science Philosophy

There is growing evidence that pretraining on high quality, carefully thought-out tokens such as code or mathematics plays an important role in improving the reasoning abilities of large language models. For example, Minerva, a PaLM model …

Identifying the Risks of LM Agents with an LM-Emulated Sandbox Open

Yangjun Ruan, Honghua Dong, Andrew Wang, Silviu Pitis, Yongchao Zhou , et al. · 2023

Computer science Business Geography

Recent advances in Language Model (LM) agents and tool use, exemplified by applications like ChatGPT Plugins, enable a rich set of capabilities but also amplify potential risks - such as leaking private data or causing financial losses. Id…

STEVE-1: A Generative Model for Text-to-Behavior in Minecraft Open

Shalev Lifshitz, Keiran Paster, Harris Chan, Jimmy Ba, Sheila A. McIlraith · 2023

Computer science History Psychology

Constructing AI models that respond to text instructions is challenging, especially for sequential decision-making tasks. This work introduces a methodology, inspired by unCLIP, for instruction-tuning generative models of behavior without …

Training on Thin Air: Improve Image Classification with Generated Data Open

Yongchao Zhou, Hshmat Sahak, Jimmy Ba · 2023

Computer science

Acquiring high-quality data for training discriminative models is a crucial yet challenging aspect of building effective predictive systems. In this paper, we present Diffusion Inversion, a simple yet effective method that leverages the pr…

AlpacaFarm: A Simulation Framework for Methods that Learn from Human Feedback Open

Yann Dubois, Xuechen Li, Rohan Taori, Tianyi Zhang, Ishaan Gulrajani , et al. · 2023

Computer science Political science

Large language models (LLMs) such as ChatGPT have seen widespread adoption due to their strong instruction-following abilities. Developing these LLMs involves a complex yet poorly understood workflow requiring training with human feedback.…

Clinical Camel: An Open Expert-Level Medical Language Model with Dialogue-Based Knowledge Encoding Open

Augustin Toma, Patrick R. Lawler, Jimmy Ba, Rahul G. Krishnan, Barry B. Rubin , et al. · 2023

Computer science Medicine Political science

We present Clinical Camel, an open large language model (LLM) explicitly tailored for clinical research. Fine-tuned from LLaMA-2 using QLoRA, Clinical Camel achieves state-of-the-art performance across medical benchmarks among openly avail…

Residual Prompt Tuning: Improving Prompt Tuning with Residual Reparameterization Open

Anastasia Razdaibiedina, Yuning Mao, Rui Hou, Madian Khabsa, Michael Lewis , et al. · 2023

Computer science Physics Geography

Prompt tuning is one of the successful approaches for parameter-efficient tuning of pre-trained language models. Despite being arguably the most parameter-efficient (tuned soft prompts constitute <0.1% of total parameters), it typically pe…

TR0N: Translator Networks for 0-Shot Plug-and-Play Conditional Generation Open

Zhaoyan Liu, Noël Vouitsis, Satya Krishna Gorti, Jimmy Ba, Gabriel Loaiza-Ganem · 2023

Computer science Psychology Chemistry

We propose TR0N, a highly general framework to turn pre-trained unconditional generative models, such as GANs and VAEs, into conditional models. The conditioning can be highly arbitrary, and requires only a pre-trained auxiliary model. For…

Boosted Prompt Ensembles for Large Language Models Open

Silviu Pitis, Michael R. Zhang, Andrew Wang, Jimmy Ba · 2023

Computer science

Methods such as chain-of-thought prompting and self-consistency have pushed the frontier of language model reasoning performance with no additional training. To further improve performance, we propose a prompt ensembling method for large l…

PIFiA: Self-supervised Approach for Protein Functional Annotation from Single-Cell Imaging Data Open

Anastasia Razdaibiedina, Alexander Brechalov, Helena Friesen, Mojca Mattiazzi Ušaj, Myra Paz David Masinas , et al. · 2023

Computer science Biology

Fluorescence microscopy data describe protein localization patterns at single-cell resolution and have the potential to reveal whole-proteome functional information with remarkable precision. Yet, extracting biologically meaningful represe…

Mastering Diverse Domains through World Models Open

Danijar Hafner, Jurgis Pašukonis, Jimmy Ba, Timothy Lillicrap · 2023

Computer science Mathematics Materials science

Developing a general algorithm that learns to solve tasks across a wide range of applications has been a fundamental challenge in artificial intelligence. Although current reinforcement learning algorithms can be readily applied to tasks s…

Residual Prompt Tuning: improving prompt tuning with residual reparameterization Open

Anastasia Razdaibiedina, Yuning Mao, Madian Khabsa, Michael Lewis, Rui Hou , et al. · 2023

Computer science Mathematics Physics

Prompt tuning is one of the successful approaches for parameter-efficient tuning of pre-trained language models. Despite being arguably the most parameter-efficient (tuned soft prompts constitute <0.1% of total parameters), it typically pe…

Multi-Rate VAE: Train Once, Get the Full Rate-Distortion Curve Open

Juhan Bae, Michael R. Zhang, Michael Ruan, Eric Wang, So Hasegawa , et al. · 2022

Computer science Biology Economics

Variational autoencoders (VAEs) are powerful tools for learning latent representations of data used in a wide range of applications. In practice, VAEs usually require multiple training rounds to choose the amount of information the latent …

Large Language Models Are Human-Level Prompt Engineers Open

Yongchao Zhou, Andrei Ioan Muresanu, Ziwen Han, Keiran Paster, Silviu Pitis , et al. · 2022

Computer science Biology Philosophy

By conditioning on natural language instructions, large language models (LLMs) have displayed impressive capabilities as general-purpose computers. However, task performance depends significantly on the quality of the prompt used to steer …

Exploring Low Rank Training of Deep Neural Networks Open

Siddhartha Rao Kamalakara, Acyr Locatelli, Bharat Venkitesh, Jimmy Ba, Yarin Gal , et al. · 2022

Computer science Sociology Mathematics

Training deep neural networks in low rank, i.e. with factorised layers, is of particular interest to the community: it offers efficiency over unfactorised training in terms of both memory consumption and training time. Prior work has focus…

Dataset Distillation using Neural Feature Regression Open

Yongchao Zhou, Ehsan Nezhadarya, Jimmy Ba · 2022

Computer science Mathematics Chemistry

Dataset distillation aims to learn a small synthetic dataset that preserves most of the information from the original dataset. Dataset distillation can be formulated as a bi-level meta-learning problem where the outer loop optimizes the me…

You Can't Count on Luck: Why Decision Transformers and RvS Fail in Stochastic Environments Open

Keiran Paster, Sheila A. McIlraith, Jimmy Ba · 2022

Computer science Chemistry Economics

Recently, methods such as Decision Transformer that reduce reinforcement learning to a prediction task and solve it via supervised learning (RvS) have become popular due to their simplicity, robustness to hyperparameters, and strong overal…

High-dimensional Asymptotics of Feature Learning: How One Gradient Step Improves the Representation Open

Jimmy Ba, Murat A. Erdogdu, Taiji Suzuki, Zhichao Wang, Denny Wu , et al. · 2022

Mathematics Physics

We study the first gradient descent step on the first-layer parameters $\boldsymbol{W}$ in a two-layer neural network: $f(\boldsymbol{x}) = \frac{1}{\sqrt{N}}\boldsymbol{a}^\topσ(\boldsymbol{W}^\top\boldsymbol{x})$, where $\boldsymbol{W}\i…

Learning Domain Invariant Representations in Goal-conditioned Block MDPs Open

Beining Han, Chongyi Zheng, Harris Chan, Keiran Paster, Michael R. Zhang , et al. · 2021

Computer science Psychology Mathematics

Deep Reinforcement Learning (RL) is successful in solving many complex Markov Decision Processes (MDPs) problems. However, agents often face unanticipated environmental changes after deployment in the real world. These changes are often sp…

INT: An Inequality Benchmark for Evaluating Generalization in Theorem Proving Open

Yuhuai Wu, Albert Jiang, Jimmy Ba, Roger Grosse · 2021

Computer science Mathematics Geography

In learning-assisted theorem proving, one of the most critical challenges is to generalize to theorems unlike those seen at training time. In this paper, we introduce INT, an INequality Theorem proving benchmark designed to test agents’ ge…

Clockwork Variational Autoencoders Open

Vaibhav Saxena, Jimmy Ba, Danijar Hafner · 2021

Computer science Economics Physics

Deep learning has enabled algorithms to generate realistic images. However, accurately predicting long video sequences requires understanding long-term dependencies and remains an open challenge. While existing video prediction models succ…

Clockwork Variational Autoencoders for Video Prediction Open

Vaibhav Saxena, Jimmy Ba, Danijar Hafner · 2021

Computer science Geography Physics

Deep learning has enabled algorithms to generate realistic images. However, accurately predicting long video sequences requires understanding long-term dependencies and remains an open challenge. While existing video prediction models succ…

Jimmy Ba YOU? Author Swipe