Explanipedia

ROC-n-reroll: How verifier imperfection affects test-time scaling Open

Florian E. Dorner, Yatong Chen, Alessandra G. Cruz, Fanny Yang · 2025

Test-time scaling aims to improve language model performance by leveraging additional compute during inference. Many works have empirically studied techniques such as Best-of-N (BoN) and Rejection Sampling (RS) that make use of a verifier …

Learning Pareto manifolds in high dimensions: How can regularization help? Open

Tobias Wegel, Filip Kovačević, Alexandru Ţifrea, Fanny Yang · 2025

Simultaneously addressing multiple objectives is becoming increasingly important in modern machine learning. At the same time, data is often high-dimensional and costly to label. For a single objective such as prediction risk, conventional…

Efficient Randomized Experiments Using Foundation Models Open

Piersilvio De Bartolomeis, Javier Abad, Guanbo Wang, Konstantin Donhauser, Raymond Duch , et al. · 2025

Randomized experiments are the preferred approach for evaluating the effects of interventions, but they are costly and often yield estimates with substantial uncertainty. On the other hand, in silico experiments leveraging foundation model…

Achievable distributional robustness when the robust risk is only partially identified Open

Julia Kostin, Nicola Gnecco, Fanny Yang · 2025

Computer science Economics Mathematics

In safety-critical applications, machine learning models should generalize well under worst-case distribution shifts, that is, have a small robust risk. Invariance-based algorithms can provably take advantage of structural assumptions on t…

Atmospheric Transport Modeling of CO<sub>2</sub> With Neural Networks Open

Vitus Benson, Ana Bastos, Christian Reimers, Alexander J. Winkler, Fanny Yang , et al. · 2025

Environmental science Computer science

Accurately describing the distribution of in the atmosphere with atmospheric tracer transport models is essential for greenhouse gas monitoring and verification support systems to aid implementation of international climate agreements. Lar…

Copyright-Protected Language Generation via Adaptive Model Fusion Open

J. Abad, Konstantin Donhauser, Francesco Pinto, Fanny Yang · 2024

Computer science Philosophy

The risk of language models reproducing copyrighted material from their training data has led to the development of various protective measures. Among these, inference-time strategies that impose constraints via post-processing have shown …

Atmospheric Transport Modeling of CO$_2$ with Neural Networks Open

Vitus Benson, Ana Bastos, Christian Reimers, Alexander J. Winkler, Fanny Yang , et al. · 2024

Environmental science Business Computer science

Accurately describing the distribution of CO$_2$ in the atmosphere with atmospheric tracer transport models is essential for greenhouse gas monitoring and verification support systems to aid implementation of international climate agreemen…

Strong Copyright Protection for Language Models via Adaptive Model Fusion Open

J. Abad, Konstantin Donhauser, Francesco Pinto, Fanny Yang · 2024

Computer science Philosophy

The risk of language models unintentionally reproducing copyrighted material from their training data has led to the development of various protective measures. In this paper, we propose model fusion as an effective solution to safeguard a…

Detecting critical treatment effect bias in small subgroups Open

Piersilvio De Bartolomeis, J. Abad, Konstantin Donhauser, Fanny Yang · 2024

Psychology Medicine Economics

Randomized trials are considered the gold standard for making informed decisions in medicine, yet they often lack generalizability to the patient populations in clinical practice. Observational studies, on the other hand, cover a broader p…

Mini-Workshop: Interpolation and Over-parameterization in Statistics and Machine Learning Open

Mihkail Belkin, Alexandre B. Tsybakov, Fanny Yang · 2024

Computer science Mathematics

In recent years it has become clear that, contrary to traditional statistical beliefs, methods that interpolate (fit exactly) the noisy training data, can still be statistically optimal. In particular, this phenomenon of “be- nign overfitt…

Graph Neural Networks for Atmospheric Transport Modeling of CO2  Open

Vitus Benson, Ana Bastos, Christian Reimers, Alexander J. Winkler, Fanny Yang , et al. · 2024

Computer science Environmental science

Large deep neural network emulators are poised to revolutionize numerical weather prediction (NWP). Recent models like GraphCast or NeuralGCM can now compete and sometimes outperform traditional NWP systems, all at much lower computational…

Privacy-preserving data release leveraging optimal transport and particle gradient descent Open

Konstantin Donhauser, J. Abad, Neha Hulkund, Fanny Yang · 2024

Computer science Physics Geology

We present a novel approach for differentially private data synthesis of protected tabular datasets, a relevant task in highly sensitive domains such as healthcare and government. Current state-of-the-art methods predominantly use marginal…

Robust Mixture Learning when Outliers Overwhelm Small Groups Open

Daniil Dmitriev, Rares-Darius Buha i, Stefan Tiegel, A.A. Wolters, Gleb Novikov , et al. · 2024

Computer science Mathematics

We study the problem of estimating the means of well-separated mixtures when an adversary may add arbitrary outliers. While strong guarantees are available when the outlier fraction is significantly smaller than the minimum mixing weight, …

Hidden yet quantifiable: A lower bound for confounding strength using randomized trials Open

Piersilvio De Bartolomeis, J. Abad, Konstantin Donhauser, Fanny Yang · 2023

Computer science Mathematics Medicine

In the era of fast-paced precision medicine, observational studies play a major role in properly evaluating new treatments in clinical practice. Yet, unobserved confounding can significantly compromise causal conclusions drawn from non-ran…

Can semi-supervised learning use all the data effectively? A lower bound perspective Open

Alexandru Ţifrea, Gizem Yüce, Amartya Sanyal, Fanny Yang · 2023

Computer science Mathematics Physics

Prior works have shown that semi-supervised learning algorithms can leverage unlabeled data to improve over the labeled sample complexity of supervised learning (SL) algorithms. However, existing theoretical analyses focus on regimes where…

How robust accuracy suffers from certified training with convex relaxations Open

Piersilvio De Bartolomeis, Jacob Clarysse, Amartya Sanyal, Fanny Yang · 2023

Computer science Mathematics Psychology

Adversarial attacks pose significant threats to deploying state-of-the-art classifiers in safety-critical applications. Two classes of methods have emerged to address this issue: empirical defences and certified defences. Although certifie…

PILLAR: How to make semi-private learning more effective Open

Francesco Pinto, Yaxi Hu, Fanny Yang, Amartya Sanyal · 2023

Computer science Engineering Chemistry

In Semi-Supervised Semi-Private (SP) learning, the learner has access to both public unlabelled and private labelled data. We propose a computationally efficient algorithm that, under mild assumptions on the data, provably achieves signifi…

Certified private data release for sparse Lipschitz functions Open

Konstantin Donhauser, Johan Lokna, Amartya Sanyal, March T. Boedihardjo, Robert Hönig , et al. · 2023

Computer science Mathematics Political science

As machine learning has become more relevant for everyday applications, a natural requirement is the protection of the privacy of the training data. When the relevant learning questions are unknown in advance, or hyper-parameter tuning pla…

Strong inductive biases provably prevent harmless interpolation Open

Michael Aerni, Marco Milanta, Konstantin Donhauser, Fanny Yang · 2023

Mathematics Computer science Economics

Classical wisdom suggests that estimators should avoid fitting noise to achieve good generalization. In contrast, modern overparameterized models can yield small test error despite interpolating noise -- a phenomenon often called "benign o…

Tight bounds for maximum $\ell_1$-margin classifiers Open

Stefan Stojanović, Konstantin Donhauser, Fanny Yang · 2022

Mathematics Computer science

Popular iterative algorithms such as boosting methods and coordinate descent on linear models converge to the maximum $\ell_1$-margin classifier, a.k.a. sparse hard-margin SVM, in high dimensional regimes where the data is linearly separab…

Margin-based sampling in high dimensions: When being active is less efficient than staying passive Open

Alexandru Ţifrea, Jacob Clarysse, Fanny Yang · 2022

Computer science Psychology Philosophy

It is widely believed that given the same labeling budget, active learning (AL) algorithms like margin-based active learning achieve better predictive performance than passive learning (PL), albeit at a higher computational cost. Recent em…

How unfair is private learning ? Open

Amartya Sanyal, Yaxi Hu, Fanny Yang · 2022

Computer science

As machine learning algorithms are deployed on sensitive data in critical decision making processes, it is becoming increasingly important that they are also private and fair. In this paper, we show that, when the data has a long-tailed st…

Provable concept learning for interpretable predictions using variational autoencoders Open

Armeen Taeb, Nicolò Ruggeri, Carina Schnuck, Fanny Yang · 2022

Computer science

In safety-critical applications, practitioners are reluctant to trust neural networks when no interpretable explanations are available. Many attempts to provide such explanations revolve around pixel-based attributions or use previously kn…

Provable concept learning for interpretable predictions using\n variational autoencoders Open

Armeen Taeb, Nicolò Ruggeri, Carina Schnuck, Fanny Yang · 2022

Computer science

In safety-critical applications, practitioners are reluctant to trust neural\nnetworks when no interpretable explanations are available. Many attempts to\nprovide such explanations revolve around pixel-based attributions or use\npreviously…

Fast Rates for Noisy Interpolation Require Rethinking the Effects of Inductive Bias Open

Konstantin Donhauser, Nicolò Ruggeri, Stefan Stojanović, Fanny Yang · 2022

Mathematics Computer science Psychology

Good generalization performance on high-dimensional data crucially hinges on a simple structure of the ground truth and a corresponding strong inductive bias of the estimator. Even though this intuition is valid for regularized models, in …

Why adversarial training can hurt robust accuracy Open

Jacob Clarysse, Julia Hörmann, Fanny Yang · 2022

Computer science Mathematics Chemistry

Machine learning classifiers with high test accuracy often perform poorly under adversarial attacks. It is commonly believed that adversarial training alleviates this issue. In this paper, we demonstrate that, surprisingly, the opposite ma…

Tight bounds for minimum l1-norm interpolation of noisy data Open

Guillaume Wang, Konstantin Donhauser, Fanny Yang · 2021

Mathematics Computer science Physics

We provide matching upper and lower bounds of order $σ^2/\log(d/n)$ for the prediction error of the minimum $\ell_1$-norm interpolator, a.k.a. basis pursuit. Our result is tight up to negligible terms when $d \gg n$, and is the first to im…

Self-supervised Reinforcement Learning with Independently Controllable Subgoals Open

Andrii Zadaianchuk, Georg Martius, Fanny Yang · 2021

Computer science

To successfully tackle challenging manipulation tasks, autonomous agents must learn a diverse set of skills and how to combine them. Recently, self-supervised agents that set their own abstract goals by exploiting the discovered structure …

Interpolation can hurt robust generalization even when there is no noise Open

Konstantin Donhauser, Alexandru Ţifrea, Michael Aerni, Reinhard Heckel, Fanny Yang · 2021

Computer science Mathematics Geology

Numerous recent works show that overparameterization implicitly reduces variance for min-norm interpolators and max-margin classifiers. These findings suggest that ridge regularization has vanishing benefits in high dimensions. We challeng…

How rotational invariance of common kernels prevents generalization in high dimensions Open

Konstantin Donhauser, Mingqi Wu, Fanny Yang · 2021

Mathematics Physics Biology

Kernel ridge regression is well-known to achieve minimax optimal rates in low-dimensional settings. However, its behavior in high dimensions is much less understood. Recent work establishes consistency for kernel regression under certain a…

Fanny Yang YOU? Author Swipe