Proceedings of the ACM on Programming Languages • Vol 8 • No PLDI
Program Analysis for Adaptive Data Analysis
June 2024 • Jiawen Liu, Weihao Qu, Marco Gaboardi, Deepak Garg, Jonathan Ullman
Data analyses are usually designed to identify some property of the population from which the data are drawn, generalizing beyond the specific data sample. For this reason, data analyses are often designed in a way that guarantees that they produce a low generalization error. That is, they are designed so that the result of a data analysis run on a sample data does not differ too much from the result one would achieve by running the analysis over the entire population. An adaptive data analysis can be seen as a pr…