Explanipedia

Quantum Probe Tomography Open

Sitan Chen, Jordan Cotler, Hsin-Yuan Huang · 2025

Characterizing quantum many-body systems is a fundamental problem across physics, chemistry, and materials science. While significant progress has been made, many existing Hamiltonian learning protocols demand digital quantum control over …

Efficient Pauli Channel Estimation with Logarithmic Quantum Memory Open

Sitan Chen, Weiyuan Gong · 2025

In this work, we consider one of the prototypical tasks for characterizing the structure of noise in quantum devices: estimating eigenvalues of an n-qubit Pauli noise channel. Prior work [Chen , Phys. Rev. A 105, 032435 (2022)] has proved …

Train for the Worst, Plan for the Best: Understanding Token Ordering in Masked Diffusions Open

Jaeyeon Kim, Kulin Shah, Vasilis Kontonis, Sham M. Kakade, Sitan Chen · 2025

Computer science Geography

In recent years, masked diffusion models (MDMs) have emerged as a promising alternative approach for generative modeling over discrete domains. Compared to autoregressive models (ARMs), MDMs trade off complexity at training time with flexi…

Blink of an eye: a simple theory for feature localization in generative models Open

Man Li, Aayush Karan, Sitan Chen · 2025

Computer science Psychology Philosophy

Large language models can exhibit unexpected behavior in the blink of an eye. In a recent computer use demo, a language model switched from coding to Googling pictures of Yellowstone, and these sudden shifts in behavior have also been obse…

Gradient dynamics for low-rank fine-tuning beyond kernels Open

Arif Kerem Dayi, Sitan Chen · 2024

Mathematics Computer science Physics

LoRA has emerged as one of the de facto methods for fine-tuning foundation models with low computational cost and memory footprint. The idea is to only train a low-rank perturbation to the weights of a pre-trained model, given supervised d…

Unrolled denoising networks provably learn optimal Bayesian inference Open

Aayush Karan, Kulin Shah, Sitan Chen, Yonina C. Eldar · 2024

Computer science

Much of Bayesian inference centers around the design of estimators for inverse problems which are optimal assuming the data comes from a known prior. But what do these optimality guarantees mean if the prior is unknown? In recent years, al…

What does guidance do? A fine-grained analysis in a simple setting Open

M. Chidambaram, Khashayar Gatmiry, Sitan Chen, Holden Lee, Jianfeng Lu · 2024

Computer science Philosophy

The use of guidance in diffusion models was originally motivated by the premise that the guidance-modified score is that of the data distribution tilted by a conditional likelihood raised to some power. In this work we clarify this misconc…

Predicting quantum channels over general product distributions Open

Sitan Chen, J de Pont, Jun-Ting Hsieh, Hsin-Yuan Huang, Jane Lange , et al. · 2024

Computer science Mathematics Physics

We investigate the problem of predicting the output behavior of unknown quantum channels. Given query access to an $n$-qubit channel $E$ and an observable $O$, we aim to learn the mapping \begin{equation*} ρ\mapsto \mathrm{Tr}(O E[ρ]) \end…

Faster Diffusion Sampling with Randomized Midpoints: Sequential and Parallel Open

Shivam Gupta, Linda Cai, Sitan Chen · 2024

Computer science Mathematics Physics

Sampling algorithms play an important role in controlling the quality and runtime of diffusion model inference. In recent years, a number of works~\cite{chen2023sampling,chen2023ode,benton2023error,lee2022convergence} have proposed schemes…

Optimal tradeoffs for estimating Pauli observables Open

Sitan Chen, Weiyuan Gong, Qi Ye · 2024

Physics Economics

We revisit the problem of Pauli shadow tomography: given copies of an unknown $n$-qubit quantum state $ρ$, estimate $\text{tr}(Pρ)$ for some set of Pauli operators $P$ to within additive error $ε$. This has been a popular testbed for explo…

Learning general Gaussian mixtures with efficient score matching Open

Sitan Chen, Vasilis Kontonis, Kulin Shah · 2024

Mathematics Computer science Chemistry

We study the problem of learning mixtures of $k$ Gaussians in $d$ dimensions. We make no separation assumptions on the underlying mixture components: we only require that the covariance matrices have bounded condition number and that the m…

Critical windows: non-asymptotic theory for feature emergence in diffusion models Open

M.H. Li, Sitan Chen · 2024

Computer science Mathematics Physics

We develop theory to understand an intriguing property of diffusion models for image generation that we term critical windows. Empirically, it has been observed that there are narrow time intervals in sampling during which particular featu…

An optimal tradeoff between entanglement and copy complexity for state tomography Open

Sitan Chen, Jerry Li, Allen Liu · 2024

Computer science Physics

There has been significant interest in understanding how practical constraints on contemporary quantum devices impact the complexity of quantum learning. For the classic question of tomography, recent work tightly characterized the copy co…

Provably learning a multi-head attention layer Open

Sitan Chen, Yuanzhi Li · 2024

Computer science Psychology Materials science

The multi-head attention layer is one of the key components of the transformer architecture that sets it apart from traditional feed-forward models. Given a sequence length $k$, attention matrices $\mathbfΘ_1,\ldots,\mathbfΘ_m\in\mathbb{R}…

Learning to Predict Arbitrary Quantum Processes Open

Hsin-Yuan Huang, Sitan Chen, John Preskill · 2023

Computer science Mathematics Physics

We present an efficient machine-learning (ML) algorithm for predicting any unknown quantum process E over n qubits. For a wide range of distributions D on arbitrary n-qubit states, we show that this ML algorithm can learn to predict any lo…

Efficient Pauli channel estimation with logarithmic quantum memory Open

Sitan Chen, Weiyuan Gong · 2023

Physics Computer science Mathematics

Here we revisit one of the prototypical tasks for characterizing the structure of noise in quantum devices: estimating every eigenvalue of an $n$-qubit Pauli noise channel to error $ε$. Prior work [14] proved no-go theorems for this task i…

A faster and simpler algorithm for learning shallow networks Open

Sitan Chen, Shyam Narayanan · 2023

Computer science Mathematics Physics

We revisit the well-studied problem of learning a linear combination of $k$ ReLU activations given labeled examples drawn from the standard $d$-dimensional Gaussian measure. Chen et al. [CDG+23] recently gave the first algorithm for this p…

Learning Mixtures of Gaussians Using the DDPM Objective Open

Kulin Shah, Sitan Chen, Adam R. Klivans · 2023

Computer science Mathematics Physics

Recent works have shown that diffusion models can learn essentially any distribution provided one can perform score estimation. Yet it remains poorly understood under what settings score estimation is possible, let alone when practical gra…

Learning Polynomial Transformations via Generalized Tensor Decompositions Open

Sitan Chen, Jerry Li, Yuanzhi Li, Anru R. Zhang · 2023

Mathematics Computer science Philosophy

We consider the problem of learning high dimensional polynomial transformations of Gaussians. Given samples of the form f(x), where x∼N(0,Ir) is hidden and f: ℝr → ℝd is a function where every output coordinate is a low-degree polynomial, …

Learning Narrow One-Hidden-Layer ReLU Networks Open

Sitan Chen, Zehao Dou, Surbhi Goel, Adam R Klivans, Raghu Meka · 2023

Computer science Mathematics Physics

We consider the well-studied problem of learning a linear combination of $k$ ReLU activations with respect to a Gaussian distribution on inputs in $d$ dimensions. We give the first polynomial-time algorithm that succeeds whenever $k$ is a …

Restoration-Degradation Beyond Linear Diffusions: A Non-Asymptotic Analysis For DDIM-Type Samplers Open

Sitan Chen, Giannis Daras, Alexandros G. Dimakis · 2023

Mathematics Computer science Economics

We develop a framework for non-asymptotic analysis of deterministic samplers used for diffusion generative modeling. Several recent works have analyzed stochastic samplers using tools like Girsanov's theorem and a chain rule variant of the…

Flowers: precious food and medicine resources Open

Xuqiang Liu, Senye Wang, Lili Cui, Huihui Zhou, Yuhang Liu , et al. · 2022

Biology Chemistry Medicine

Flower plants are popular all over the world and important sources of ornamental plants, bioactive molecules and nutrients. Flowers have a wide range of biological activities and beneficial pharmacological effects. Flowers and their active…

Learning to predict arbitrary quantum processes Open

Hsin-Yuan Huang, Sitan Chen, John Preskill · 2022

Mathematics Computer science Physics

We present an efficient machine learning (ML) algorithm for predicting any unknown quantum process $\mathcal{E}$ over $n$ qubits. For a wide range of distributions $\mathcal{D}$ on arbitrary $n$-qubit states, we show that this ML algorithm…

The Complexity of NISQ Open

Sitan Chen, Jordan Cotler, Hsin-Yuan Huang, Jerry Li · 2022

Computer science Mathematics Physics

The recent proliferation of NISQ devices has made it imperative to understand their computational power. In this work, we define and study the complexity class $\textsf{NISQ} $, which is intended to encapsulate problems that can be efficie…

When Does Adaptivity Help for Quantum State Learning? Open

Sitan Chen, Brice Huang, Jerry Li, Allen Liu, Mark Sellke · 2022

Computer science Mathematics Physics

We consider the classic question of state tomography: given copies of an unknown quantum state $ρ\in\mathbb{C}^{d\times d}$, output $\widehatρ$ which is close to $ρ$ in some sense, e.g. trace distance or fidelity. When one is allowed to ma…

Quantum advantage in learning from experiments Open

Hsin-Yuan Huang, Michael Broughton, Jordan Cotler, Sitan Chen, Jerry Li , et al. · 2022

Computer science Physics

Quantum technology promises to revolutionize how we learn about the physical world. An experiment that processes quantum data with a quantum computer could have substantial advantages over conventional experiments in which quantum states a…

Kalman filtering with adversarial corruptions Open

Sitan Chen, Frederic Koehler, Ankur Moitra, Morris Yau · 2022

Computer science Mathematics Physics

Here we revisit the classic problem of linear quadratic estimation, i.e. estimating the trajectory of a linear dynamical system from noisy measurements. The celebrated Kalman filter gives an optimal estimator when the measurement noise is …

Sitan Chen YOU? Author Swipe