Explanipedia

The power of quantum circuits in sampling Open

Guy Blanc, C.C. Koch, Jane Lange, Carmen Strassle, Li-Yang Tan · 2025

We give new evidence that quantum circuits are substantially more powerful than classical circuits. We show, relative to a random oracle, that polynomial-size quantum circuits can sample distributions that subexponential-size classical cir…

Computational-Statistical Tradeoffs from NP-hardness Open

Guy Blanc, Caleb Koch, Carmen Strassle, Li-Yang Tan · 2025

A central question in computer science and statistics is whether efficient algorithms can achieve the information-theoretic limits of statistical problems. Many computational-statistical tradeoffs have been shown under average-case assumpt…

Adaptive and oblivious statistical adversaries are equivalent Open

Guy Blanc, Gregory Valiant · 2024

Computer science

We resolve a fundamental question about the ability to perform a statistical task, such as learning, when an adversary corrupts the sample. Such adversaries are specified by the types of corruption they can make and their level of knowledg…

The Sample Complexity of Smooth Boosting and the Tightness of the Hardcore Theorem Open

Guy Blanc, Alexandre Hayderi, Caleb Koch, Li-Yang Tan · 2024

Computer science Mathematics Physics

Smooth boosters generate distributions that do not place too much weight on any given example. Originally introduced for their noise-tolerant properties, such boosters have also found applications in differential privacy, reproducibility, …

A Strong Direct Sum Theorem for Distributional Query Complexity Open

Guy Blanc, Caleb Koch, Carmen Strassle, Li-Yang Tan · 2024

Mathematics Computer science

Consider the expected query complexity of computing the $k$-fold direct product $f^{\otimes k}$ of a function $f$ to error $\varepsilon$ with respect to a distribution $μ^k$. One strategy is to sequentially compute each of the $k$ copies t…

Harnessing the Power of Choices in Decision Tree Learning Open

Guy Blanc, Jane Lange, Chirag Pabbaraju, Colin E. Sullivan, Li-Yang Tan , et al. · 2023

Computer science Mathematics Philosophy

We propose a simple generalization of standard and empirically successful decision tree learning algorithms such as ID3, C4.5, and CART. These algorithms, which have been central to machine learning for decades, are greedy in nature: they …

A Strong Composition Theorem for Junta Complexity and the Boosting of Property Testers Open

Guy Blanc, Caleb Koch, Carmen Strassle, Li-Yang Tan · 2023

Mathematics Computer science Philosophy

We prove a strong composition theorem for junta complexity and show how such theorems can be used to generically boost the performance of property testers. The $\varepsilon$-approximate junta complexity of a function $f$ is the smallest in…

Lifting uniform learners via distributional decomposition Open

Guy Blanc, Jane Lange, Ali Malik, Li-Yang Tan · 2023

Mathematics Computer science Biology

We show how any PAC learning algorithm that works under the uniform distribution can be transformed, in a blackbox fashion, into one that works under an arbitrary and unknown distribution $\mathcal{D}$. The efficiency of our transformation…

Subsampling Suffices for Adaptive Data Analysis Open

Guy Blanc · 2023

Computer science Philosophy

Ensuring that analyses performed on a dataset are representative of the entire population is one of the central problems in statistics. Most classical techniques assume that the dataset is independent of the analyst's query and break down …

Certification with an NP Oracle Open

Guy Blanc, Caleb Koch, Jane Lange, Carmen Strassle, Li-Yang Tan · 2022

Mathematics Physics Computer science

In the certification problem, the algorithm is given a function $f$ with certificate complexity $k$ and an input $x^\star$, and the goal is to find a certificate of size $\le \text{poly}(k)$ for $f$'s value at $x^\star$. This problem is in…

Multitask Learning via Shared Features: Algorithms and Hardness Open

Konstantina Bairaktari, Guy Blanc, Li-Yang Tan, Jonathan Ullman, Lydia Zakynthinou · 2022

Computer science Mathematics Economics

We investigate the computational efficiency of multitask learning of Boolean functions over the $d$-dimensional hypercube, that are related by means of a feature representation of size $k \ll d$ shared across all tasks. We present a polyno…

A Query-Optimal Algorithm for Finding Counterfactuals Open

Guy Blanc, Caleb Koch, Jane Lange, Li-Yang Tan · 2022

Mathematics Physics Philosophy

We design an algorithm for finding counterfactuals with strong theoretical guarantees on its performance. For any monotone model $f : X^d \to \{0,1\}$ and instance $x^\star$, our algorithm makes \[ {S(f)^{O(Δ_f(x^\star))}\cdot \log d}\] qu…

Open Problem: Properly learning decision trees in polynomial time? Open

Guy Blanc, Jane Lange, Mingda Qiao, Li-Yang Tan · 2022

Computer science Mathematics

The authors recently gave an $n^{O(\log\log n)}$ time membership query algorithm for properly learning decision trees under the uniform distribution (Blanc et al., 2021). The previous fastest algorithm for this problem ran in $n^{O(\log n)…

Popular decision tree algorithms are provably noise tolerant Open

Guy Blanc, Jane Lange, Ali Malik, Li-Yang Tan · 2022

Computer science Mathematics

Using the framework of boosting, we prove that all impurity-based decision tree learning algorithms, including the classic ID3, C4.5, and CART, are highly noise tolerant. Our guarantees hold under the strongest noise model of nasty noise, …

Multiway Online Correlated Selection Open

Guy Blanc, Moses Charikar · 2022

Computer science Mathematics Biology

We give a $0.5368$-competitive algorithm for edge-weighted online bipartite matching. Prior to our work, the best competitive ratio was $0.5086$ due to Fahrbach, Huang, Tao, and Zadimoghaddam (FOCS 2020). They achieved their breakthrough r…

The Query Complexity of Certification Open

Guy Blanc, Caleb Koch, Jane Lange, Li-Yang Tan · 2022

Mathematics Computer science Biology

We study the problem of {\sl certification}: given queries to a function $f : \{0,1\}^n \to \{0,1\}$ with certificate complexity $\le k$ and an input $x^\star$, output a size-$k$ certificate for $f$'s value on $x^\star$. This abstractly mo…

On Testing Decision Tree Open

Guy Blanc, Jane Lange, Mingda Qiao, Li-Yang Tan · 2022

Mathematics Computer science

In this paper, we study testing decision tree of size and depth that are significantly smaller than the number of attributes n. Our main result addresses the problem of poly(n,1/ε) time algorithms with poly(s,1/ε) query complexity (indepen…

On the power of adaptivity in statistical adversaries Open

Guy Blanc, Jane Lange, Ali Malik, Li-Yang Tan · 2021

Computer science Mathematics Physics

We study a fundamental question concerning adversarial noise models in statistical problems where the algorithm receives i.i.d. draws from a distribution $\mathcal{D}$. The definitions of these adversaries specify the type of allowable cor…

Provably efficient, succinct, and precise explanations Open

Guy Blanc, Jane Lange, Li-Yang Tan · 2021

Computer science Mathematics Economics

We consider the problem of explaining the predictions of an arbitrary blackbox model $f$: given query access to $f$ and an instance $x$, output a small set of $x$'s features that in conjunction essentially determines $f(x)$. We design an e…

Decision tree heuristics can fail, even in the smoothed setting Open

Guy Blanc, Jane Lange, Mingda Qiao, Li-Yang Tan · 2021

Computer science Mathematics

Greedy decision tree learning heuristics are mainstays of machine learning practice, but theoretical justification for their empirical success remains elusive. In fact, it has long been known that there are simple target functions for whic…

Learning stochastic decision trees Open

Guy Blanc, Jane Lange, Li-Yang Tan · 2021

Mathematics Computer science

We give a quasipolynomial-time algorithm for learning stochastic decision trees that is optimally resilient to adversarial noise. Given an $η$-corrupted set of uniform random samples labeled by a size-$s$ stochastic decision tree, our algo…

Decision Tree Heuristics Can Fail, Even in the Smoothed Setting Open

Guy Blanc, Jane Lange, Li-Yang Tan · 2021

Computer science Mathematics Chemistry

Greedy decision tree learning heuristics are mainstays of machine learning practice, but theoretical justification for their empirical success remains elusive. In fact, it has long been known that there are simple target functions for whic…

Decision Tree Heuristics Can Fail, Even in the Smoothed Setting Open

Guy Blanc, Jane Lange, Mingda Qiao, Li-Yang Tan · 2021

Computer science Mathematics Philosophy

Greedy decision tree learning heuristics are mainstays of machine learning practice, but theoretical justification for their empirical success remains elusive. In fact, it has long been known that there are simple target functions for whic…

Learning Stochastic Decision Trees Open

Guy Blanc, Jane Lange, Li-Yang Tan · 2021

Mathematics Computer science

We give a quasipolynomial-time algorithm for learning stochastic decision trees that is optimally resilient to adversarial noise. Given an η-corrupted set of uniform random samples labeled by a size-s stochastic decision tree, our algorith…

Testing and reconstruction via decision trees. Open

Guy Blanc, Jane Lange, Li-Yang Tan · 2020

Mathematics Computer science Physics

We study sublinear and local computation algorithms for decision trees, focusing on testing and reconstruction. Our first result is a tester that runs in $\mathrm{poly}(\log s, 1/\varepsilon)\cdot n\log n$ time, makes $\mathrm{poly}(\log s…

Reconstructing decision trees Open

Guy Blanc, Jane Lange, Li-Yang Tan · 2020

Mathematics Computer science Biology

We give the first {\sl reconstruction algorithm} for decision trees: given queries to a function $f$ that is $\mathrm{opt}$-close to a size-$s$ decision tree, our algorithm provides query access to a decision tree $T$ where: $\circ$ $T$ ha…

Estimating decision tree learnability with polylogarithmic sample complexity Open

Guy Blanc, Neha Gupta, Jane Lange, Li-Yang Tan · 2020

Mathematics Computer science Chemistry

We show that top-down decision tree learning heuristics are amenable to highly efficient learnability estimation: for monotone target functions, the error of the decision tree hypothesis constructed by these heuristics can be estimated wit…

Query strategies for priced information, revisited Open

Guy Blanc, Jane Lange, Li-Yang Tan · 2020

Mathematics Computer science Philosophy

We consider the problem of designing query strategies for priced information, introduced by Charikar et al. In this problem the algorithm designer is given a function $f : \{0,1\}^n \to \{-1,1\}$ and a price associated with each of the $n$…

Universal guarantees for decision tree induction via a higher-order splitting criterion Open

Guy Blanc, Neha Gupta, Jane Lange, Li-Yang Tan · 2020

Mathematics Computer science Philosophy

We propose a simple extension of top-down decision tree learning heuristics such as ID3, C4.5, and CART. Our algorithm achieves provable guarantees for all target functions $f: \{-1,1\}^n \to \{-1,1\}$ with respect to the uniform distribut…

Efficient hyperparameter optimization by way of PAC-Bayes bound minimization Open

John J. Cherian, Andrew G. Taube, Robert T. McGibbon, Panagiotis Angelikopoulos, Guy Blanc , et al. · 2020

Computer science Mathematics

Identifying optimal values for a high-dimensional set of hyperparameters is a problem that has received growing attention given its importance to large-scale machine learning applications such as neural architecture search. Recently develo…

Guy Blanc YOU? Author Swipe