Explanipedia

FreshStack: Building Realistic Benchmarks for Evaluating Retrieval on Technical Documents Open

Nandan Thakur, Jimmy Lin, Sam Havens, Michael Carbin, Andrew Drozdov · 2025

We introduce FreshStack, a holistic framework for automatically building information retrieval (IR) evaluation benchmarks by incorporating challenging questions and answers. FreshStack conducts the following steps: (1) automatic corpus col…

Learning to Keep a Promise: Scaling Language Model Decoding Parallelism with Learned Asynchronous Decoding Open

Jin Tian, Ellie Y. Cheng, Zack Ankner, Nikunj Saunshi, Blake M. Elias , et al. · 2025

Computer science Mathematics

Decoding with autoregressive large language models (LLMs) traditionally occurs sequentially, generating one token after another. An emerging line of work explored parallel decoding by identifying and simultaneously generating semantically …

Inference Plans for Hybrid Particle Filtering Open

Ellie Y. Cheng, Eric Atkinson, Guillaume Baudart, Louis Mandel, Michael Carbin · 2025

Computer science Geography

Advanced probabilistic programming languages (PPLs) using hybrid particle filtering combine symbolic exact inference and Monte Carlo methods to improve inference performance. These systems use heuristics to partition random variables withi…

Drowning in Documents: Consequences of Scaling Reranker Inference Open

M. Jacob, Erik Lindgren, Matei Zaharia, Michael Carbin, Omar Khattab , et al. · 2024

Computer science Mathematics

Rerankers, typically cross-encoders, are computationally intensive but are frequently used because they are widely assumed to outperform cheaper initial IR systems. We challenge this assumption by measuring reranker performance for full re…

Long Context RAG Performance of Large Language Models Open

Quinn Leng, Jacob Portes, Sam Havens, Matei Zaharia, Michael Carbin · 2024

Computer science History Philosophy

Retrieval Augmented Generation (RAG) has emerged as a crucial technique for enhancing the accuracy of Large Language Models (LLMs) by incorporating external information. With the advent of LLMs that support increasingly longer context leng…

Learning to Compile Programs to Neural Networks Open

Logan Weber, Jesse Michel, Alex Renda, Michael Carbin · 2024

Computer science

A $\textit{neural surrogate of a program}$ is a neural network that mimics the behavior of a program. Researchers have used these neural surrogates to automatically tune program inputs, adapt programs to new settings, and accelerate comput…

The T-Complexity Costs of Error Correction for Control Flow in Quantum Computation Open

Charles Yuan, Michael Carbin · 2024

Computer science Mathematics Physics

Numerous quantum algorithms require the use of quantum error correction to overcome the intrinsic unreliability of physical qubits. However, quantum error correction imposes a unique performance bottleneck, known as -complexity, that can…

Distributions for Compositionally Differentiating Parametric Discontinuities Open

Jesse Michel, Kevin Mu, Xuanda Yang, Sai Praveen Bangaru, Elias Rojas Collins , et al. · 2024

Geology Mathematics

Computations in physical simulation, computer graphics, and probabilistic inference often require the differentiation of discontinuous processes due to contact, occlusion, and changes at a point in time. Popular differentiable programming …

Quantum Control Machine: The Limits of Control Flow in Quantum Programming Open

Charles Yuan, Agnes Villanyi, Michael Carbin · 2024

Computer science Physics

Quantum algorithms for tasks such as factorization, search, and simulation rely on control flow such as branching and iteration that depends on the value of data in superposition. High-level programming abstractions for control flow, such …

BioMedLM: A 2.7B Parameter Language Model Trained On Biomedical Text Open

Elliot Bolton, Abhinav Venigalla, Michihiro Yasunaga, David Hall, Betty Xiong , et al. · 2024

Computer science Philosophy

Models such as GPT-4 and Med-PaLM 2 have demonstrated impressive performance on a wide variety of biomedical NLP tasks. However, these models have hundreds of billions of parameters, are computationally expensive to run, require users to s…

The T-Complexity Costs of Error Correction for Control Flow in Quantum Computation Open

Charles Yuan, Michael Carbin · 2023

Computer science Geography Physics

Numerous quantum algorithms require the use of quantum error correction to overcome the intrinsic unreliability of physical qubits. However, error correction imposes a unique performance bottleneck, known as T-complexity, that can make an …

Turaco: Complexity-Guided Data Sampling for Training Neural Surrogates of Programs Open

Alex Renda, Yi Ding, Michael Carbin · 2023

Computer science Chemistry Materials science

Programmers and researchers are increasingly developing surrogates of programs, models of a subset of the observable behavior of a given program, to solve a variety of software development challenges. Programmers train surrogates from meas…

The Cost of Down-Scaling Language Models: Fact Recall Deteriorates before In-Context Learning Open

Jin Tian, Nolan Clement, Xin Dong, Vaishnavh Nagarajan, Michael Carbin , et al. · 2023

Computer science Psychology Mathematics

How does scaling the number of parameters in large language models (LLMs) affect their core capabilities? We study two natural scaling techniques -- weight pruning and simply training a smaller or larger model, which we refer to as dense s…

Verifying Performance Properties of Probabilistic Inference Open

Eric Atkinson, Ellie Y. Cheng, Guillaume Baudart, Louis Mandel, Michael Carbin · 2023

Computer science Mathematics

In this extended abstract, we discuss the opportunity to formally verify that inference systems for probabilistic programming guarantee good performance. In particular, we focus on hybrid inference systems that combine exact and approximat…

Computably Continuous Reinforcement-Learning Objectives Are PAC-Learnable Open

Cambridge Yang, Michael L. Littman, Michael Carbin · 2023

Computer science Mathematics

In reinforcement learning, the classic objectives of maximizing discounted and finite-horizon cumulative rewards are PAC-learnable: There are algorithms that learn a near-optimal policy with high probability using a finite amount of sample…

Quantum Control Machine: The Limits of Control Flow in Quantum Programming Open

Charles Yuan, Agnes Villanyi, Michael Carbin · 2023

Computer science Mathematics Physics

Quantum algorithms for tasks such as factorization, search, and simulation rely on control flow such as branching and iteration that depends on the value of data in superposition. High-level programming abstractions for control flow, such …

Computably Continuous Reinforcement-Learning Objectives are PAC-learnable Open

Cambridge Yang, Michael L. Littman, Michael Carbin · 2023

Computer science Mathematics Psychology

In reinforcement learning, the classic objectives of maximizing discounted and finite-horizon cumulative rewards are PAC-learnable: There are algorithms that learn a near-optimal policy with high probability using a finite amount of sample…

Acela: Predictable Datacenter-level Maintenance Job Scheduling Open

Yi Ding, Aijia Gao, Thibaud Ryden, Kaushik Mitra, Sukumar Kalmanje , et al. · 2022

Computer science Engineering Art

Datacenter operators ensure fair and regular server maintenance by using automated processes to schedule maintenance jobs to complete within a strict time budget. Automating this scheduling problem is challenging because maintenance job du…

Tower: data structures in Quantum superposition Open

Charles Yuan, Michael Carbin · 2022

Computer science Physics

Emerging quantum algorithms for problems such as element distinctness, subset sum, and closest pair demonstrate computational advantages by relying on abstract data structures. Practically realizing such an algorithm as a program for a qua…

Semi-symbolic inference for efficient streaming probabilistic programming Open

Eric Atkinson, Charles Yuan, Guillaume Baudart, Louis Mandel, Michael Carbin · 2022

Computer science Biology

A streaming probabilistic program receives a stream of observations and produces a stream of distributions that are conditioned on these observations. Efficient inference is often possible in a streaming context using Rao-Blackwellized par…

Pruning's Effect on Generalization Through the Lens of Training and Regularization Open

Tian Jin, Michael Carbin, Daniel M. Roy, Jonathan Frankle, Gintare Karolina Dziugaite · 2022

Computer science Mathematics Biology

Practitioners frequently observe that pruning improves model generalization. A long-standing hypothesis based on bias-variance trade-off attributes this generalization improvement to model size reduction. However, recent studies on over-pa…

Semi-Symbolic Inference for Efficient Streaming Probabilistic Programming Open

Eric Atkinson, Charles Yuan, Guillaume Baudart, Louis Mandel, Michael Carbin · 2022

Computer science Biology

Efficient inference is often possible in a streaming context using Rao-Blackwellized particle filters (RBPFs), which exactly solve inference problems when possible and fall back on sampling approximations when necessary. While RBPFs can be…

On the (In)Tractability of Reinforcement Learning for LTL Objectives Open

Cambridge Yang, Michael L. Littman, Michael Carbin · 2022

Computer science Mathematics Economics

In recent years, researchers have made significant progress in devising reinforcement-learning algorithms for optimizing linear temporal logic (LTL) objectives and LTL-like objectives. Despite these advancements, there are fundamental limi…

SCOPE: Safe Exploration for Dynamic Computer Systems Optimization Open

Hyunji Kim, Ahsan Pervaiz, Henry Hoffmann, Michael Carbin, Yi Ding · 2022

Computer science Engineering Philosophy

Modern computer systems need to execute under strict safety constraints (e.g., a power limit), but doing so often conflicts with their ability to deliver high performance (i.e. minimal latency). Prior work uses machine learning to automati…

Cello: Efficient Computer Systems Optimization with Predictive Early Termination and Censored Regression Open

Yi Ding, Alex Renda, Ahsan Pervaiz, Michael Carbin, Henry Hoffmann · 2022

Computer science Mathematics Art

Sample-efficient machine learning (SEML) has been widely applied to find optimal latency and power tradeoffs for configurable computer systems. Instead of randomly sampling from the configuration space, SEML reduces the search cost by dram…

Twist: sound reasoning for purity and entanglement in Quantum programs Open

Charles Yuan, Christopher M. McNally, Michael Carbin · 2022

Computer science Mathematics Physics

Quantum programming languages enable developers to implement algorithms for quantum computers that promise computational breakthroughs in classically intractable tasks. Programming quantum computers requires awareness of entanglement , the…

On the (In)Tractability of Reinforcement Learning for LTL Objectives Open

Cambridge Yang, Michael L. Littman, Michael Carbin · 2021

Computer science Mathematics Psychology

In recent years, researchers have made significant progress in devising reinforcement-learning algorithms for optimizing linear temporal logic (LTL) objectives and LTL-like objectives. Despite these advancements, there are fundamental limi…

Checking Bounded-Memory Execution for Delayed Sampling on Probabilistic Streams Open

Eric Atkinson, Guillaume Baudart, Louis Mandel, Charles Yuan, Michael Carbin · 2021

Computer science Mathematics

International audience

Programming with neural surrogates of programs Open

Alex Renda, Yi Ding, Michael Carbin · 2021

Computer science Physics

Surrogates, models that mimic the behavior of programs, form the basis of a\nvariety of development workflows. We study three surrogate-based design\npatterns, evaluating each in case studies on a large-scale CPU simulator.\n With surrogat…

Generalizable and interpretable learning for configuration extrapolation Open

Yi Ding, Ahsan Pervaiz, Michael Carbin, Henry Hoffmann · 2021

Computer science Mathematics

Modern software applications are increasingly configurable, which puts a burden on users to tune these configurations for their target hardware and workloads. To help users, machine learning techniques can model the complex relationships b…

Michael Carbin YOU? Author Swipe