Explanipedia

Potemkin Understanding in Large Language Models Open

Marina Mancoridis, Keyon Vafa · 2025

Large language models (LLMs) are regularly evaluated using benchmark datasets. But what justifies making inferences about an LLM's capabilities based on its answers to a curated set of questions? This paper first introduces a formal framew…

Estimating wage disparities using foundation models Open

Keyon Vafa, Susan Athey, David M. Blei · 2025

The rise of foundation models marks a paradigm shift in machine learning: instead of training specialized models from scratch, foundation models are trained on massive datasets before being adjusted or fine-tuned to make predictions on sma…

Estimating Wage Disparities Using Foundation Models Open

Keyon Vafa, Susan Athey, David M. Blei · 2024

Economics Computer science Political science

The rise of foundation models marks a paradigm shift in machine learning: instead of training specialized models from scratch, foundation models are first trained on massive datasets before being adapted or fine-tuned to make predictions o…

LABOR-LLM: Language-Based Occupational Representations with Large Language Models Open

Tianyu Du, Ayush Kanodia, Herman Brunborg, Keyon Vafa, Susan Athey · 2024

Psychology Computer science Philosophy

Vafa et al. (2024) introduced a transformer-based econometric model, CAREER, that predicts a worker's next job as a function of career history (an "occupation model"). CAREER was initially estimated ("pre-trained") using a large, unreprese…

Evaluating the World Model Implicit in a Generative Model Open

Keyon Vafa, Justin Y. Chen, Jon Kleinberg, Sendhil Mullainathan, Ashesh Rambachan · 2024

Computer science

Recent work suggests that large language models may implicitly learn world models. How should we assess this possibility? We formalize this question for the case where the underlying reality is governed by a deterministic finite automaton.…

Do Large Language Models Perform the Way People Expect? Measuring the Human Generalization Function Open

Keyon Vafa, Ashesh Rambachan, Sendhil Mullainathan · 2024

Computer science Psychology Philosophy

What makes large language models (LLMs) impressive is also what makes them hard to evaluate: their diversity of uses. To evaluate these models, we must understand the purposes they will be used for. We consider a setting where these deploy…

Revisiting Topic-Guided Language Models Open

Carolina Zheng, Keyon Vafa, David M. Blei · 2023

Computer science Chemistry Philosophy

A recent line of work in natural language processing has aimed to combine language models and topic models. These topic-guided language models augment neural language models with topic models, unsupervised learning methods that can discove…

An Invariant Learning Characterization of Controlled Text Generation Open

Carolina Zheng, Claudia Shi, Keyon Vafa, Amir Feder, David M. Blei · 2023

Computer science Mathematics

Controlled generation refers to the problem of creating text that contains stylistic or semantic attributes of interest. Many approaches reduce this problem to training a predictor of the desired attribute. For example, researchers hoping …

An Invariant Learning Characterization of Controlled Text Generation Open

Carolina Zheng, Claudia Shi, Keyon Vafa, Amir Feder, David M. Blei · 2023

Computer science Mathematics

Controlled generation refers to the problem of creating text that contains stylistic or semantic attributes of interest. Many approaches reduce this problem to training a predictor of the desired attribute. For example, researchers hoping …

CAREER: A Foundation Model for Labor Sequence Data Open

Keyon Vafa, Emil Palikot, Tianyu Du, Ayush Kanodia, Susan Athey , et al. · 2022

Computer science Economics Mathematics

Labor economists regularly analyze employment data by fitting predictive models to small, carefully constructed longitudinal survey datasets. Although machine learning methods offer promise for such problems, these survey datasets are too …

Assessing the Effects of Friend-to-Friend Texting onTurnout in the 2018 US Midterm Elections Open

Aaron Schein, Keyon Vafa, Dhanya Sridhar, Victor Veitch, Jeffrey M. Quinn , et al. · 2021

Computer science Psychology Medicine

Recent mobile app technology lets people systematize the process of messaging their friends to urge them to vote. Prior to the most recent US midterm elections in 2018, the mobile app Outvote randomized an aspect of their system, hoping to…

Rationales for Sequential Predictions Open

Keyon Vafa, Yuntian Deng, David M. Blei, Alexander M. Rush · 2021

Computer science Mathematics Economics

Sequence models are a critical component of modern NLP systems, but their predictions are difficult to explain. We consider model explanations though rationales, subsets of context that can explain individual model predictions. We find seq…

Text-Based Ideal Points Open

Keyon Vafa, Suresh Naidu, David M. Blei · 2020

Computer science Political science Mathematics

Ideal point models analyze lawmakers' votes to quantify their political positions, or ideal points. But votes are not the only way to express a political position. Lawmakers also give speeches, release press statements, and post tweets. In…

Text-Based Ideal Points Open

Keyon Vafa, Suresh Naidu, David M. Blei · 2020

Computer science Political science Mathematics

Ideal point models analyze lawmakers’ votes to quantify their political positions, or ideal points. But votes are not the only way to express a political position. Lawmakers also give speeches, release press statements, and post tweets. In…

Discrete Flows: Invertible Generative Models of Discrete Data Open

Dustin Tran, Keyon Vafa, Kumar Krishna Agrawal, Laurent Dinh, Ben Poole · 2019

Computer science Mathematics

While normalizing flows have led to significant advances in modeling high-dimensional continuous distributions, their applicability to discrete distributions remains unknown. In this paper, we show that flows can in fact be extended to dis…

Discrete Flows: Invertible Generative Models of Discrete Data Open

Dustin Tran, Keyon Vafa, Kumar Krishna Agrawal, Laurent Dinh, Ben Poole · 2019

Computer science Mathematics

While normalizing flows have led to significant advances in modeling high-dimensional continuous distributions, their applicability to discrete distributions remains unknown. In this paper, we show that flows can in fact be extended to dis…

Training and Inference for Deep Gaussian Processes Open

Keyon Vafa · 2016

Computer science Physics

An ideal model for regression is not only accurate, but also computationally efficient, easy to tune without overfitting, and able to provide certainty estimates. In this thesis, we explore deep Gaussian processes (deep GPs), a class of mo…

Replication Data for: Price Discrimination in The Princeton Review’s Online SAT Tutoring Service Open

Keyon Vafa · 2015

Computer science Psychology Economics

This dataset was used for this paper published on 9/1/2015 on Technology Science. http://techscience.org/a/2015090102/

Keyon Vafa YOU? Author Swipe