Explanipedia

Contextures: Representations from Contexts Open

Runtian Zhai, Kai‐Cheng Yang, Che-Ping Tsai, Burak Varici, J. Zico Kolter , et al. · 2025

Despite the empirical success of foundation models, we do not have a systematic characterization of the representations that these models learn. In this paper, we establish the contexture theory. It shows that a large class of representati…

LogiCity: Advancing Neuro-Symbolic AI with Abstract Urban Simulation Open

Bowen Li, Zhaoyu Li, Qiwei Du, Jinqi Luo, Wenshan Wang , et al. · 2024

Computer science Psychology

Recent years have witnessed the rapid development of Neuro-Symbolic (NeSy) AI systems, which integrate symbolic reasoning into deep neural networks. However, most of the existing benchmarks for NeSy AI fail to provide long-horizon reasonin…

Identifying General Mechanism Shifts in Linear Causal Representations Open

Tianyu Chen, Kevin Bello, Francesco Locatello, Bryon Aragam, Pradeep Ravikumar · 2024

Computer science Mathematics Philosophy

We consider the linear causal representation learning setting where we observe a linear mixing of $d$ unknown latent factors, which follow a linear structural causal model. Recent work has shown that it is possible to recover the latent fa…

Markov Equivalence and Consistency in Differentiable Structure Learning Open

Chang Deng, Kevin Bello, Pradeep Ravikumar, Bryon Aragam · 2024

Computer science Mathematics

Existing approaches to differentiable structure learning of directed acyclic graphs (DAGs) rely on strong identifiability assumptions in order to guarantee that global minimizers of the acyclicity-constrained optimization problem identifie…

Do LLMs dream of elephants (when told not to)? Latent concept association and associative memory in transformers Open

Yibo Jiang, Goutham Rajendran, Pradeep Ravikumar, Bryon Aragam · 2024

Psychology Computer science Mathematics

Large Language Models (LLMs) have the capacity to store and recall facts. Through experimentation with open-source models, we observe that this ability to retrieve facts can be easily manipulated by changing contexts, even without altering…

On the Origins of Linear Representations in Large Language Models Open

Yibo Jiang, Goutham Rajendran, Pradeep Ravikumar, Bryon Aragam, Victor Veitch · 2024

Computer science Mathematics Philosophy

Recent works have argued that high-level semantic concepts are encoded "linearly" in the representation space of large language models. In this work, we study the origins of such linear representations. To that end, we introduce a simple l…

Learning Interpretable Concepts: Unifying Causal Representation Learning and Foundation Models Open

Goutham Rajendran, Simon Buchholz, Bryon Aragam, Bernhard Schölkopf, Pradeep Ravikumar · 2024

Computer science Psychology Philosophy

To build intelligent machine learning systems, there are two broad approaches. One approach is to build inherently interpretable models, as endeavored by the growing field of causal representation learning. The other approach is to build h…

Spectrally Transformed Kernel Regression Open

Runtian Zhai, Rattana Pukdee, Roger Jin, Maria-Florina Balcan, Pradeep Ravikumar · 2024

Mathematics Computer science

Unlabeled data is a key component of modern machine learning. In general, the role of unlabeled data is to impose a form of smoothness, usually from the similarity information encoded in a base kernel, such as the $ε$-neighbor kernel or th…

An Interventional Perspective on Identifiability in Gaussian LTI Systems with Independent Component Analysis Open

Goutham Rajendran, Patrik Reizinger, Wieland Brendel, Pradeep Ravikumar · 2023

Computer science Mathematics Biology

We investigate the relationship between system identification and intervention design in dynamical systems. While previous research demonstrated how identifiable representation learning methods, such as Independent Component Analysis (ICA)…

Responsible AI (RAI) Games and Ensembles Open

Yash P. Gupta, Runtian Zhai, Arun Sai Suggala, Pradeep Ravikumar · 2023

Computer science Mathematics Chemistry

Several recent works have studied the societal effects of AI; these include issues such as fairness, robustness, and safety. In many of these objectives, a learner seeks to minimize its worst-case loss over a set of predefined distribution…

Sample based Explanations via Generalized Representers Open

Che-Ping Tsai, Chih‐Kuan Yeh, Pradeep Ravikumar · 2023

Mathematics Computer science Chemistry

We propose a general class of sample based explanations of machine learning models, which we term generalized representers. To measure the effect of a training sample on a model's test prediction, generalized representers use two component…

Identifying Representations for Intervention Extrapolation Open

Sorawit Saengkyongam, Elan Rosenfeld, Pradeep Ravikumar, Niklas Pfister, Jonas Peters · 2023

Mathematics Computer science Political science

The premise of identifiable and causal representation learning is to improve the current representation learning paradigm in terms of generalizability or robustness. Despite recent progress in questions of identifiability, more theoretical…

Individual Fairness Under Uncertainty Open

Wenbin Zhang, Zichong Wang, Ju-Yong Kim, Cheng Cheng, Thomas Oommen , et al. · 2023

Computer science Materials science Philosophy

Algorithmic fairness, the research field of making machine learning (ML) algorithms fair, is an established area in ML. As ML technologies expand their application domains, including ones with high societal impact, it becomes essential to …

iSCAN: Identifying Causal Mechanism Shifts among Nonlinear Additive Noise Models Open

Tianyu Chen, Kevin Bello, Bryon Aragam, Pradeep Ravikumar · 2023

Computer science Mathematics Physics

Structural causal models (SCMs) are widely used in various disciplines to represent causal relationships among variables in complex systems. Unfortunately, the underlying causal structure is often unknown, and estimating it from data remai…

Global Optimality in Bivariate Gradient-based DAG Learning Open

Chang Deng, Kevin Bello, Bryon Aragam, Pradeep Ravikumar · 2023

Computer science Mathematics Philosophy

Recently, a new class of non-convex optimization problems motivated by the statistical problem of learning an acyclic directed graphical model from data has attracted significant interest. While existing work uses standard first-order opti…

Learning Linear Causal Representations from Interventions under General Nonlinear Mixing Open

Simon Buchholz, Goutham Rajendran, Elan Rosenfeld, Bryon Aragam, Bernhard Schölkopf , et al. · 2023

Mathematics Computer science Psychology

We study the problem of learning causal representations from unknown, latent interventions in a general setting, where the latent distribution is Gaussian but the mixing function is completely general. We prove strong identifiability resul…

Understanding Augmentation-based Self-Supervised Representation Learning via RKHS Approximation and Regression Open

Runtian Zhai, Bingbin Liu, Andrej Risteski, J. Zico Kolter, Pradeep Ravikumar · 2023

Computer science Mathematics Political science

Data augmentation is critical to the empirical success of modern self-supervised representation learning, such as contrastive learning and masked language modeling. However, a theoretical understanding of the exact role of augmentation rem…

Representer Point Selection for Explaining Regularized High-dimensional Models Open

Che-Ping Tsai, Jiong Zhang, Eli Chien, Hsiang‐Fu Yu, Cho‐Jui Hsieh , et al. · 2023

Computer science Mathematics Biology

We introduce a novel class of sample-based explanations we term high-dimensional representers, that can be used to explain the predictions of a regularized high-dimensional model in terms of importance weights for each of the training samp…

Optimizing NOTEARS Objectives via Topological Swaps Open

Chang Deng, Kevin Bello, Bryon Aragam, Pradeep Ravikumar · 2023

Computer science Mathematics Biology

Recently, an intriguing class of non-convex optimization problems has emerged in the context of learning directed acyclic graphs (DAGs). These problems involve minimizing a given loss or score function, subject to a non-convex continuous c…

Learning with Explanation Constraints Open

Rattana Pukdee, Dylan Sam, J. Zico Kolter, Maria-Florina Balcan, Pradeep Ravikumar · 2023

Computer science Physics Philosophy

As larger deep learning models are hard to interpret, there has been a recent focus on generating explanations of these black-box models. In contrast, we may have apriori explanations of how models should behave. In this paper, we formaliz…

Individual Fairness under Uncertainty Open

Wenbin Zhang, Ju-Yong Kim, Zichong Wang, Pradeep Ravikumar, Jeremy Weiss · 2023

Computer science Mathematics Philosophy

Algorithmic fairness, the research field of making machine learning (ML) algorithms fair, is an established area in ML. As ML technologies expand their application domains, including ones with high societal impact, it becomes essential to …

Nash Equilibria and Pitfalls of Adversarial Training in Adversarial Robustness Games Open

Maria-Florina Balcan, Rattana Pukdee, Pradeep Ravikumar, Hongyang Zhang · 2022

Computer science Mathematics Chemistry

Adversarial training is a standard technique for training adversarially robust models. In this paper, we study adversarial training as an alternating best-response strategy in a 2-player zero-sum game. We prove that even in a simple scenar…

Label Propagation with Weak Supervision Open

Rattana Pukdee, Dylan Sam, Maria-Florina Balcan, Pradeep Ravikumar · 2022

Computer science Philosophy Geography

Semi-supervised learning and weakly supervised learning are important paradigms that aim to reduce the growing demand for labeled data in current machine learning applications. In this paper, we introduce a novel analysis of the classical …

DAGMA: Learning DAGs via M-matrices and a Log-Determinant Acyclicity Characterization Open

Kevin Bello, Bryon Aragam, Pradeep Ravikumar · 2022

Mathematics Computer science Physics

The combinatorial problem of learning directed acyclic graphs (DAGs) from data was recently framed as a purely continuous optimization problem by leveraging a differentiable acyclicity characterization of DAGs based on the trace of a matri…

Concept Gradient: Concept-based Interpretation Without Linear Assumption Open

Andrew Bai, Chih‐Kuan Yeh, Pradeep Ravikumar, Neil Y. C. Lin, Cho‐Jui Hsieh · 2022

Computer science Mathematics Political science

Concept-based interpretations of black-box models are often more intuitive for humans to understand. The most widely adopted approach for concept-based interpretation is Concept Activation Vector (CAV). CAV relies on learning a linear rela…

Identifiability of deep generative models without auxiliary information Open

Bohdan Kivva, Goutham Rajendran, Pradeep Ravikumar, Bryon Aragam · 2022

Computer science Mathematics Physics

We prove identifiability of a broad class of deep latent variable models that (a) have universal approximation capabilities and (b) are the decoders of variational autoencoders that are commonly used in practice. Unlike existing work, our …

Building Robust Ensembles via Margin Boosting Open

Dinghuai Zhang, Hongyang Zhang, Aaron Courville, Yoshua Bengio, Pradeep Ravikumar , et al. · 2022

Computer science Chemistry Geography

In the context of adversarial robustness, a single model does not usually have enough power to defend against all possible adversarial attacks, and as a result, has sub-optimal robustness. Consequently, an emerging line of work has focused…

Faith-Shap: The Faithful Shapley Interaction Index Open

Che-Ping Tsai, Chih‐Kuan Yeh, Pradeep Ravikumar · 2022

Mathematics Computer science Biology

Shapley values, which were originally designed to assign attributions to individual players in coalition games, have become a commonly used approach in explainable machine learning to provide attributions to input features for black-box ma…

Human-Centered Concept Explanations for Neural Networks Open

Chih‐Kuan Yeh, Been Kim, Pradeep Ravikumar · 2022

Computer science Psychology Philosophy

Understanding complex machine learning models such as deep neural networks with explanations is crucial in various applications. Many explanations stem from the model perspective, and may not necessarily effectively communicate why the mod…

Threading the Needle of On and Off-Manifold Value Functions for Shapley Explanations Open

Chih‐Kuan Yeh, Kuan-Yun Lee, Frederick Liu, Pradeep Ravikumar · 2022

Mathematics Computer science Engineering

A popular explainable AI (XAI) approach to quantify feature importance of a given model is via Shapley values. These Shapley values arose in cooperative games, and hence a critical ingredient to compute these in an XAI context is a so-call…

Pradeep Ravikumar YOU? Author Swipe