Explanipedia

InvisibleInk: High-Utility and Low-Cost Text Generation with Differential Privacy Open

Vishnu Vinod, Krishna Pillutla, Abhradeep Thakurta · 2025

As major progress in LLM-based long-form text generation enables paradigms such as retrieval-augmented generation (RAG) and inference-time scaling, safely incorporating private information into the generation remains a critical open questi…

Correlated Noise Mechanisms for Differentially Private Learning Open

Krishna Pillutla, Jalaj Upadhyay, Christopher A. Choquette-Choo, Krishnamurthy Dvijotham, Arun Ganesh , et al. · 2025

This monograph explores the design and analysis of correlated noise mechanisms for differential privacy (DP), focusing on their application to private training of AI and machine learning models via the core primitive of estimation of weigh…

An Inversion Theorem for Buffered Linear Toeplitz (BLT) Matrices and Applications to Streaming Differential Privacy Open

H. Brendan McMahan, Krishna Pillutla · 2025

Buffered Linear Toeplitz (BLT) matrices are a family of parameterized lower-triangular matrices that play an important role in streaming differential privacy with correlated noise. Our main result is a BLT inversion theorem: the inverse of…

Fine-Tuning Large Language Models with User-Level Differential Privacy Open

Zachary Charles, Arun Ganesh, Ryan M. McKenna, H. Brendan McMahan, Nicole Mitchell , et al. · 2024

Computer science Engineering

We investigate practical and scalable algorithms for training large language models (LLMs) with user-level differential privacy (DP) in order to provably safeguard all the examples contributed by each user. We study two variants of DP-SGD …

Efficient and Near-Optimal Noise Generation for Streaming Differential Privacy Open

Krishnamurthy Krishnamurthy, Dvijotham, H. Brendan McMahan, Krishna Pillutla, Thomas Steinke , et al. · 2024

Computer science Engineering

In the task of differentially private (DP) continual counting, we receive a stream of increments and our goal is to output an approximate running total of these increments, without revealing too much about any specific increment. Despite i…

Distributionally Robust Optimization with Bias and Variance Reduction Open

Ronak Mehta, Vincent Roulet, Krishna Pillutla, Zaïd Harchaoui · 2023

Computer science Mathematics Business

We consider the distributionally robust optimization (DRO) problem with spectral risk-based uncertainty set and $f$-divergence penalty. This formulation includes common risk-sensitive learning objectives such as regularized condition value…

User Inference Attacks on Large Language Models Open

Nikhil Kandpal, Krishna Pillutla, Alina Oprea, Peter Kairouz, Christopher A. Choquette-Choo , et al. · 2023

Computer science

Fine-tuning is a common and effective method for tailoring large language models (LLMs) to specialized tasks and applications. In this paper, we study the privacy implications of fine-tuning LLMs on user data. To this end, we consider a re…

Correlated Noise Provably Beats Independent Noise for Differentially Private Learning Open

Christopher A. Choquette-Choo, Krishnamurthy Dvijotham, Krishna Pillutla, Arun Ganesh, Thomas Steinke , et al. · 2023

Computer science Mathematics Biology

Differentially private learning algorithms inject noise into the learning process. While the most common private learning algorithm, DP-SGD, adds independent Gaussian noise in each iteration, recent work on matrix factorization mechanisms …

Towards Federated Foundation Models: Scalable Dataset Pipelines for Group-Structured Learning Open

Zachary Charles, Nicole Mitchell, Krishna Pillutla, Michael Reneer, Zachary Garrett · 2023

Computer science Physics Geography

We introduce Dataset Grouper, a library to create large-scale group-structured (e.g., federated) datasets, enabling federated learning simulation at the scale of foundation models. This library facilitates the creation of group-structured …

Unleashing the Power of Randomization in Auditing Differentially Private ML Open

Krishna Pillutla, Galen Andrew, Peter Kairouz, H. Brendan McMahan, Alina Oprea , et al. · 2023

Computer science Engineering Business

We present a rigorous methodology for auditing differentially private machine learning algorithms by adding multiple carefully designed examples called canaries. We take a first principles approach based on three key components. First, we …

Modified Gauss-Newton Algorithms under Noise Open

Krishna Pillutla, Vincent Roulet, Sham M. Kakade, Zaïd Harchaoui · 2023

Computer science Mathematics Physics

Gauss-Newton methods and their stochastic version have been widely used in machine learning and signal processing. Their nonsmooth counterparts, modified Gauss-Newton or prox-linear algorithms, can lead to contrasting outcomes when compare…

MAUVE Scores for Generative Models: Theory and Practice Open

Krishna Pillutla, Lang Liu, John Thickstun, Sean Welleck, Swabha Swayamdipta , et al. · 2022

Computer science Mathematics Philosophy

Generative artificial intelligence has made significant strides, producing text indistinguishable from human prose and remarkably photorealistic images. Automatically measuring how close the generated data distribution is to the target dis…

Stochastic Optimization for Spectral Risk Measures Open

Ronak Mehta, Vincent Roulet, Krishna Pillutla, Lang Liu, Zaïd Harchaoui · 2022

Computer science Mathematics Economics

Spectral risk objectives - also called $L$-risks - allow for learning systems to interpolate between optimizing average-case performance (as in empirical risk minimization) and worst-case performance on a task. We develop stochastic algori…

Statistical and Computational Guarantees for Influence Diagnostics Open

Jillian Fisher, Lang Liu, Krishna Pillutla, Yejin Choi, Zaïd Harchaoui · 2022

Computer science Mathematics Chemistry

Influence diagnostics such as influence functions and approximate maximum influence perturbations are popular in machine learning and in AI domain applications. Influence diagnostics are powerful statistical tools to identify influential d…

Tackling Distribution Shifts in Federated Learning with Superquantile Aggregation Open

Krishna Pillutla, Yassine Laguel, Jérôme Malick, Zaïd Harchaoui · 2022

Computer science Mathematics

International audience

Differentially Private Federated Quantiles with the Distributed Discrete Gaussian Mechanism Open

Krishna Pillutla, Yassine Laguel, Jérôme Malick, Zaïd Harchaoui · 2022

Computer science Mathematics Physics

International audience

Federated Learning with Heterogeneous Data: A Superquantile Optimization Approach Open

Krishna Pillutla, Yassine Laguel, Jérôme Malick, Zaïd Harchaoui · 2022

Computer science

We present a federated learning framework that is designed to robustly deliver good predictive performance across individual clients with heterogeneous data. The proposed approach hinges upon a superquantile-based learning objective that c…

Federated Learning with Partial Model Personalization Open

Krishna Pillutla, Kshitiz Malik, Abdelrahman Mohamed, Michael Rabbat, Maziar Sanjabi , et al. · 2022

Computer science Economics

We consider two federated learning algorithms for training partially personalized models, where the shared and personal parameters are updated either simultaneously or alternately on the devices. Both algorithms have been proposed in the l…

Superquantiles at Work: Machine Learning Applications and Efficient Subgradient Computation Open

Yassine Laguel, Krishna Pillutla, Jérôme Malick, Zaïd Harchaoui · 2022

Computer science Mathematics Chemistry

R. Tyrell Rockafellar and collaborators introduced, in a series of works, new regression modeling methods based on the notion of superquantile (or conditional value-at-risk). These methods have been influential in economics, finance, manag…

Robust Aggregation for Federated Learning Open

Krishna Pillutla, Sham M. Kakade, Zaïd Harchaoui · 2022

Computer science Art Chemistry

Federated learning is the centralized training of statistical models from decentralized data on mobile devices while preserving the privacy of each device. We present a robust aggregation approach to make federated learning robust to setti…

Federated Learning with Superquantile Aggregation for Heterogeneous Data Open

Krishna Pillutla, Yassine Laguel, Jérôme Malick, Zaïd Harchaoui · 2021

Computer science

We present a federated learning framework that is designed to robustly deliver good predictive performance across individual clients with heterogeneous data. The proposed approach hinges upon a superquantile-based learning objective that c…

LLC: Accurate, Multi-purpose Learnt Low-dimensional Binary Codes Open

Aditya Kusupati, Matthew Wallingford, Vivek Ramanujan, Raghav Somani, Jae Sung Park , et al. · 2021

Computer science Mathematics

Learning binary representations of instances and classes is a classical problem with several high potential applications. In modern settings, the compression of high-dimensional neural representations to low-dimensional binary codes is a c…

Divergence Frontiers for Generative Models: Sample Complexity, Quantization Level, and Frontier Integral Open

Lang Liu, Krishna Pillutla, Sean Welleck, Sewoong Oh, Yejin Choi , et al. · 2021

Computer science Mathematics Philosophy

The spectacular success of deep generative models calls for quantitative tools to measure their statistical performance. Divergence frontiers have recently been proposed as an evaluation framework for generative models, due to their abilit…

Divergence Frontiers for Generative Models: Sample Complexity, Quantization Effects, and Frontier Integrals Open

Lang Liu, Krishna Pillutla, Sean Welleck, Sewoong Oh, Yejin Choi , et al. · 2021

Computer science Mathematics Geography

The spectacular success of deep generative models calls for quantitative tools to measure their statistical performance. Divergence frontiers have recently been proposed as an evaluation framework for generative models, due to their abilit…

LLC: Accurate, Multi-purpose Learnt Low-dimensional Binary Codes Open

Aditya Kusupati, Matthew Wallingford, Vivek Ramanujan, Raghav Somani, Jae Sung Park , et al. · 2021

Computer science Mathematics

Learning binary representations of instances and classes is a classical problem with several high potential applications. In modern settings, the compression of high-dimensional neural representations to low-dimensional binary codes is a c…

A Superquantile Approach to Federated Learning with Heterogeneous Devices Open

Yassine Laguel, Krishna Pillutla, Jérôme Malick, Zaïd Harchaoui · 2021

Computer science Political science Sociology

International audience

Krishna Pillutla YOU? Author Swipe