Explanipedia

Efficiently learning and sampling multimodal distributions with data-based initialization Open

Frederic Koehler, Holden Lee, Thuy-Duong Vuong · 2024

Computer science

We consider the problem of sampling a multimodal distribution with a Markov chain given a small number of samples from the stationary measure. Although mixing can be arbitrarily slow, we show that if the Markov chain has a $k$th order spec…

Trickle-Down in Localization Schemes and Applications Open

Nima Anari, Frederic Koehler, Thuy-Duong Vuong · 2024

Mathematics Computer science Physics

Trickle-down is a phenomenon in high-dimensional expanders with many important applications — for example, it is a key ingredient in various constructions of high-dimensional expanders or the proof of rapid mixing for the basis exchange wa…

Influences in Mixing Measures Open

Frederic Koehler, Noam Lifshitz, Dor Minzer, Elchanan Mossel · 2024

Mathematics Computer science Physics

The theory of influences in product measures has profound applications in theoretical computer science, combinatorics, and discrete probability. This deep theory is intimately connected to functional inequalities and to the Fourier analysi…

Inferring Dynamic Networks from Marginals with Iterative Proportional Fitting Open

Serina Chang, Frederic Koehler, Zhaonan Qu, Jure Leskovec, Johan Ugander · 2024

Computer science Mathematics

A common network inference problem, arising from real-world data constraints, is how to infer a dynamic network from its time-aggregated adjacency matrix and time-varying marginals (i.e., row and column sums). Prior approaches to this prob…

Lasso with Latents: Efficient Estimation, Covariate Rescaling, and Computational-Statistical Gaps Open

Jonathan A. Kelner, Frederic Koehler, Raghu Meka, Dhruv Rohatgi · 2024

Mathematics Computer science Economics

It is well-known that the statistical performance of Lasso can suffer significantly when the covariates of interest have strong correlations. In particular, the prediction error of Lasso becomes much worse than computationally inefficient …

Optimistic Rates: A Unifying Theory for Interpolation Learning and Regularization in Linear Regression Open

Lijia Zhou, Frederic Koehler, Danica J. Sutherland, Nathan Srebro · 2024

Mathematics Computer science

We study a localized notion of uniform convergence known as an “optimistic rate” [ 34 , 39 ] for linear regression with Gaussian data. Our refined analysis avoids the hidden constant and logarithmic factor in existing results, which are kn…

Sampling Multimodal Distributions with the Vanilla Score: Benefits of Data-Based Initialization Open

Frederic Koehler, Thuy-Duong Vuong · 2023

Computer science Mathematics Philosophy

There is a long history, as well as a recent explosion of interest, in statistical and generative modeling approaches based on score functions -- derivatives of the log-likelihood of a distribution. In seminal works, Hyvärinen proposed van…

Universality of Spectral Independence with Applications to Fast Mixing in Spin Glasses Open

Nima Anari, Vishesh Jain, Frederic Koehler, Huy Tuan Pham, Thuy-Duong Vuong · 2023

Mathematics Physics Philosophy

We study Glauber dynamics for sampling from discrete distributions $μ$ on the hypercube $\{\pm 1\}^n$. Recently, techniques based on spectral independence have successfully yielded optimal $O(n)$ relaxation times for a host of different di…

Influences in Mixing Measures Open

Frederic Koehler, Noam Lifshitz, Dor Minzer, Elchanan Mossel · 2023

Mathematics Computer science Physics

The theory of influences in product measures has profound applications in theoretical computer science, combinatorics, and discrete probability. This deep theory is intimately connected to functional inequalities and to the Fourier analysi…

Uniform Convergence with Square-Root Lipschitz Loss Open

Lijia Zhou, Zhen Dai, Frederic Koehler, Nathan Srebro · 2023

Mathematics Computer science Economics

We establish generic uniform convergence guarantees for Gaussian data in terms of the Rademacher complexity of the hypothesis class and the Lipschitz constant of the square root of the scalar loss function. We show how these guarantees sub…

Feature Adaptation for Sparse Linear Regression Open

Jonathan A. Kelner, Frederic Koehler, Raghu Meka, Dhruv Rohatgi · 2023

Mathematics Computer science Philosophy

Sparse linear regression is a central problem in high-dimensional statistics. We study the correlated random design setting, where the covariates are drawn from a multivariate Gaussian $N(0,Σ)$, and we seek an estimator with small excess r…

A Non-Asymptotic Moreau Envelope Theory for High-Dimensional Generalized Linear Models Open

Lijia Zhou, Frederic Koehler, Pragya Sur, Danica J. Sutherland, Nathan Srebro · 2022

Mathematics Computer science Physics

We prove a new generalization bound that shows for any class of linear predictors in Gaussian space, the Rademacher complexity of the class and the training error under any continuous loss $\ell$ can control the test error under all Moreau…

Statistical Efficiency of Score Matching: The View from Isoperimetry Open

Frederic Koehler, Alexander Heckett, Andrej Risteski · 2022

Mathematics Computer science

Deep generative models parametrized up to a normalizing constant (e.g. energy-based models) are difficult to train by maximizing the likelihood of the data because the likelihood and/or gradients thereof cannot be explicitly or efficiently…

Kalman filtering with adversarial corruptions Open

Sitan Chen, Frederic Koehler, Ankur Moitra, Morris Yau · 2022

Computer science Mathematics Physics

Here we revisit the classic problem of linear quadratic estimation, i.e. estimating the trajectory of a linear dynamical system from noisy measurements. The celebrated Kalman filter gives an optimal estimator when the measurement noise is …

Distributional Hardness Against Preconditioned Lasso via Erasure-Robust Designs Open

Jonathan A. Kelner, Frederic Koehler, Raghu Meka, Dhruv Rohatgi · 2022

Mathematics Computer science Physics

Sparse linear regression with ill-conditioned Gaussian random designs is widely believed to exhibit a statistical/computational gap, but there is surprisingly little formal evidence for this belief, even in the form of examples that are ha…

Sampling Approximately Low-Rank Ising Models: MCMC meets Variational Methods Open

Frederic Koehler, Holden Lee, Andrej Risteski · 2022

Mathematics Computer science Physics

We consider Ising models on the hypercube with a general interaction matrix $J$, and give a polynomial time sampling algorithm when all but $O(1)$ eigenvalues of $J$ lie in an interval of length one, a situation which occurs in many models…

Double Balanced Sets in High Dimensional Expanders Open

Nima Anari, Vishesh Jain, Frederic Koehler, Huy Tuan Pham, Thuy-Duong Vuong · 2022

Mathematics Physics

Recent works have shown that expansion of pseudorandom sets is of great importance. However, all current works on pseudorandom sets are limited only to product (or approximate product) spaces, where Fourier Analysis methods could be applie…

Variational autoencoders in the presence of low-dimensional data: landscape and implicit bias Open

Frederic Koehler, V. K. Mehta, Andrej Risteski, Chenghui Zhou · 2021

Computer science Mathematics Physics

Variational Autoencoders are one of the most commonly used generative models, particularly for image data. A prominent difficulty in training VAEs is data that is supported on a lower-dimensional manifold. Recent work by Dai and Wipf (2020…

Optimistic Rates: A Unifying Theory for Interpolation Learning and Regularization in Linear Regression Open

Lijia Zhou, Frederic Koehler, Danica J. Sutherland, Nathan Srebro · 2021

Mathematics Computer science Physics

We study a localized notion of uniform convergence known as an "optimistic rate" (Panchenko 2002; Srebro et al. 2010) for linear regression with Gaussian data. Our refined analysis avoids the hidden constant and logarithmic factor in exist…

Kalman Filtering with Adversarial Corruptions Open

Sitan Chen, Frederic Koehler, Ankur Moitra, Morris Yau · 2021

Computer science Mathematics Physics

Here we revisit the classic problem of linear quadratic estimation, i.e. estimating the trajectory of a linear dynamical system from noisy measurements. The celebrated Kalman filter gives an optimal estimator when the measurement noise is …

Entropic Independence II: Optimal Sampling and Concentration via Restricted Modified Log-Sobolev Inequalities Open

Nima Anari, Vishesh Jain, Frederic Koehler, Huy Tuan Pham, Thuy-Duong Vuong · 2021

Mathematics Physics

We introduce a framework for obtaining tight mixing times for Markov chains based on what we call restricted modified log-Sobolev inequalities. Modified log-Sobolev inequalities (MLSI) quantify the rate of relative entropy contraction for …

Multidimensional Scaling: Approximation and Complexity Open

Erik D. Demaine, Adam Hesterberg, Frederic Koehler, Jayson Lynch, John Urschel · 2021

Computer science Mathematics Economics

Metric Multidimensional scaling (MDS) is a classical method for generating meaningful (non-linear) low-dimensional embeddings of high-dimensional data. MDS has a long history in the statistics, machine learning, and graph drawing communiti…

Reconstruction on Trees and Low-Degree Polynomials Open

Frederic Koehler, Elchanan Mossel · 2021

Mathematics Chemistry Physics

The study of Markov processes and broadcasting on trees has deep connections to a variety of areas including statistical physics, graphical models, phylogenetic reconstruction, Markov Chain Monte Carlo, and community detection in random gr…

On the Power of Preconditioning in Sparse Linear Regression Open

Jonathan A. Kelner, Frederic Koehler, Raghu Meka, Dhruv Rohatgi · 2021

Mathematics Computer science Physics

Sparse linear regression is a fundamental problem in high-dimensional statistics, but strikingly little is known about how to efficiently solve it without restrictive conditions on the design matrix. We consider the (correlated) random des…

Uniform Convergence of Interpolators: Gaussian Width, Norm Bounds, and Benign Overfitting Open

Frederic Koehler, Lijia Zhou, Danica J. Sutherland, Nathan Srebro · 2021

Mathematics Computer science Physics

We consider interpolation learning in high-dimensional linear regression with Gaussian data, and prove a generic uniform convergence guarantee on the generalization error of interpolators in an arbitrary hypothesis class in terms of the cl…

Entropic Independence I: Modified Log-Sobolev Inequalities for Fractionally Log-Concave Distributions and High-Temperature Ising Models Open

Nima Anari, Vishesh Jain, Frederic Koehler, Huy Tuan Pham, Thuy-Duong Vuong · 2021

Mathematics Physics

We introduce a notion called entropic independence that is an entropic analog of spectral notions of high-dimensional expansion. Informally, entropic independence of a background distribution $μ$ on $k$-sized subsets of a ground set of ele…

Entropic Independence in High-Dimensional Expanders: Modified Log-Sobolev Inequalities for Fractionally Log-Concave Polynomials and the Ising Model. Open

Nima Anari, Vishesh Jain, Frederic Koehler, Huy Tuan Pham, Thuy-Duong Vuong · 2021

Mathematics Physics

We introduce a notion called entropic independence for distributions $\mu$ defined on pure simplicial complexes, i.e., subsets of size $k$ of a ground set of elements. Informally, we call a background measure $\mu$ entropically independent…

Chow-Liu++: Optimal Prediction-Centric Learning of Tree Ising Models Open

Enric Boix-Adserà, Guy Bresler, Frederic Koehler · 2021

Mathematics Computer science Geography

We consider the problem of learning a tree-structured Ising model from data, such that subsequent predictions computed using the model are accurate. Concretely, we aim to learn a model such that posteriors $P(X_i|X_S)$ for small sets of va…

Online and Distribution-Free Robustness: Regression and Contextual Bandits with Huber Contamination Open

Sitan Chen, Frederic Koehler, Ankur Moitra, Morris Yau · 2020

Computer science Mathematics Chemistry

In this work we revisit two classic high-dimensional online learning problems, namely linear regression and contextual bandits, from the perspective of adversarial robustness. Existing works in algorithmic robust statistics make strong dis…

Frederic Koehler YOU? Author Swipe