Explanipedia

Inference on the Proportion of Variance Explained in Principal Component Analysis Open

Ronan Perry, Snigdha Panigrahi, Jacob Bien, Daniela Witten · 2025

Principal component analysis (PCA) is a longstanding and well-studied approach for dimension reduction. It rests upon the assumption that the underlying signal in the data has low rank, and thus can be well-summarized using a small number …

On the minimum strength of (unobserved) covariates to overturn an insignificant result Open

Danielle Tsao, Ronan Perry, Carlos Cinelli · 2024

Mathematics Economics

We study conditions under which the addition of variables to a regression equation can turn a previously statistically insignificant result into a significant one. Specifically, we characterize the minimum strength of association required …

Infer-and-widen, or not? Open

Ronan Perry, Zichun Xu, Olivia McGough, Daniela Witten · 2024

Computer science Mathematics

In recent years, there has been substantial interest in the task of selective inference: inference on a parameter that is selected from the data. Many of the existing proposals fall into what we refer to as the \emph{infer-and-widen} frame…

Manifold Oblique Random Forests: Towards Closing the Gap on Convolutional Deep Networks Open

Adam Li, Ronan Perry, Chester Huynh, Tyler M. Tomita, Ronak Mehta , et al. · 2023

Computer science Mathematics Philosophy

Decision forests, in particular random forests and gradient boosting trees have demonstrated state-of-the-art accuracy compared to other methods in many supervised learning scenarios. Forests dominate other methods in tabular data, that is…

Causal Discovery in Heterogeneous Environments Under the Sparse Mechanism Shift Hypothesis Open

Ronan Perry, Julius von Kügelgen, Bernhard Schölkopf · 2022

Computer science Mathematics Philosophy

Machine learning approaches commonly rely on the assumption of independent and identically distributed (i.i.d.) data. In reality, however, this assumption is almost always violated due to distribution shifts between environments. Although …

neurodata/graspy: GraSPy 0.3 Open

Jaewon Chung, Benjamin Pedigo, Eric Bridgeford, Bijan Varjavand, jheiko , et al. · 2020

Computer science

Announcement: GraSPy 0.3 We're happy to announce the release of GraSPy 0.3! GraSPy is a Python package for understanding the properties of random graphs that arise from modern datasets, such as social networks and brain networks. For more …

mvlearn: Multiview Machine Learning in Python Open

Ronan Perry, Gavin Mischler, Richard Guo, Theodore Lee, Alexander Chang , et al. · 2020

Computer science

As data are generated more and more from multiple disparate sources, multiview data sets, where each sample has features in distinct views, have ballooned in recent years. However, no comprehensive package exists that enables non-specialis…

Nonpar MANOVA via Independence Testing Open

Sambit Panda, Cencheng Shen, Ronan Perry, Jelle Zorn, Antoine Lutz , et al. · 2019

Mathematics Chemistry

The $k$-sample testing problem tests whether or not $k$ groups of data points are sampled from the same distribution. Multivariate analysis of variance (MANOVA) is currently the gold standard for $k$-sample testing but makes strong, often …

Universally Consistent K-Sample Tests via Dependence Measures Open

Sambit Panda, Cencheng Shen, Ronan Perry, Jelle Zorn, Antoine Lutz , et al. · 2019

Mathematics Computer science Business

The K-sample testing problem involves determining whether K groups of data points are each drawn from the same distribution. Analysis of variance is arguably the most classical method to test mean differences, along with several recent met…

Nonparametric MANOVA via Independence Testing Open

Sambit Panda, Cencheng Shen, Ronan Perry, Jelle Zorn, Antoine Lutz , et al. · 2019

Mathematics Chemistry

The $k$-sample testing problem tests whether or not $k$ groups of data points are sampled from the same distribution. Multivariate analysis of variance (MANOVA) is currently the gold standard for $k$-sample testing but makes strong, often …

Manifold Oblique Random Forests: Towards Closing the Gap on Convolutional Deep Networks Open

Adam Li, Ronan Perry, Chester Huynh, Tyler M. Tomita, Ronak Mehta , et al. · 2019

Computer science Philosophy

Decision forests (Forests), in particular random forests and gradient boosting trees, have demonstrated state-of-the-art accuracy compared to other methods in many supervised learning scenarios. In particular, Forests dominate other method…

MANIFOLD FORESTS: CLOSING THE GAP ON NEURAL NETWORKS Open

Ronan Perry, Tyler M. Tomita, Jesse Patsolic, Benjamin Falk, Joshua T Vogelstein · 2019

Computer science Mathematics Philosophy

Decision forests (DFs), in particular random forests and gradient boosting trees, have demonstrated state-of-the-art accuracy compared to other methods in many supervised learning scenarios. In particular, DFs dominate other methods in tab…

Random Forests for Adaptive Nearest Neighbor Estimation of Information-Theoretic Quantities Open

Ronan Perry, Ronak Mehta, Richard Guo, Jesús Arroyo, Michael Powell , et al. · 2019

Computer science Mathematics Physics

Information-theoretic quantities, such as conditional entropy and mutual information, are critical data summaries for quantifying uncertainty. Current widely used approaches for computing such quantities rely on nearest neighbor methods an…

Ronan Perry YOU? Author Swipe