Explanipedia

Discriminative Feature-Rich Modeling for Syntax-Based Machine Translation Open

Kevin Gimpel · 2025

Fully-automated, high-quality machine translation promises to revolutionize human communication. But as anyone who has used a machine translation system knows, we are not there yet. In this thesis, we address four areas in which we believe…

A baseline for detecting misclassified and out-of-distribution examples in neural networks Open

Dan Hendrycks, Kevin Gimpel · 2025

We consider the two related problems of detecting if an example is misclassified or out-of-distribution. We present a simple baseline that utilizes probabilities from softmax distributions. Correctly classified examples tend to have greate…

Structured Tree Alignment for Evaluation of (Speech) Constituency Parsing Open

Freda Shi, Kevin Gimpel, Karen Livescu · 2024

We present the structured average intersection-over-union ratio (STRUCT-IOU), a similarity metric between constituency parse trees motivated by the problem of evaluating speech parsers. STRUCT-IOU enables comparison between a constituency …

GEE! Grammar Error Explanation with Large Language Models Open

Yixiao Song, Kalpesh Krishna, Rajesh Bhatt, Kevin Gimpel, Mohit Iyyer · 2023

Grammatical error correction tools are effective at correcting grammatical errors in users' input sentences but do not provide users with \textit{natural language} explanations about their errors. Such explanations are essential for helpin…

MAP's not dead yet: Uncovering true language model modes by conditioning away degeneracy Open

Davis Yoshida, Kartik Goyal, Kevin Gimpel · 2023

It has been widely observed that exact or approximate MAP (mode-seeking) decoding from natural language generation (NLG) models consistently leads to degenerate outputs (Holtzman et al., 2019; Stahlberg and Byrne, 2019). Prior work has att…

Audio-Visual Neural Syntax Acquisition Open

Cheng-I Lai, Freda Shi, Puyuan Peng, Yoon Kim, Kevin Gimpel , et al. · 2023

We study phrase structure induction from visually-grounded speech. The core idea is to first segment the speech waveform into sequences of word segments, and subsequently induce phrase structure using the inferred segment-level continuous …

The Benefits of Label-Description Training for Zero-Shot Text Classification Open

Lingyu Gao, Debanjan Ghosh, Kevin Gimpel · 2023

Pretrained language models have improved zero-shot text classification by allowing the transfer of semantic knowledge from the training data in order to classify among specific label sets in downstream tasks. We propose a simple way to fur…

The Benefits of Label-Description Training for Zero-Shot Text Classification Open

Lingyu Gao, Debanjan Ghosh, Kevin Gimpel · 2023

Pretrained language models have improved zero-shot text classification by allowing the transfer of semantic knowledge from the training data in order to classify among specific label sets in downstream tasks. We propose a simple way to fur…

Deep Clustering of Text Representations for Supervision-Free Probing of Syntax Open

Vikram Gupta, Haoyue Shi, Kevin Gimpel, Mrinmaya Sachan · 2022

We explore deep clustering of multilingual text representations for unsupervised model interpretation and induction of syntax. As these representations are high-dimensional, out-of-the-box methods like K-means do not work well. Thus, our a…

Chess as a Testbed for Language Model State Tracking Open

Shubham Toshniwal, Sam Wiseman, Karen Livescu, Kevin Gimpel · 2022

Transformer language models have made tremendous strides in natural language understanding tasks. However, the complexity of natural language makes it challenging to ascertain how accurately these models are tracking the world state underl…

"What makes a question inquisitive?" A Study on Type-Controlled Inquisitive Question Generation Open

Lingyu Gao, Debanjan Ghosh, Kevin Gimpel · 2022

We propose a type-controlled framework for inquisitive question generation. We annotate an inquisitive question dataset with question types, train question type classifiers, and finetune models for type-controlled question generation. Empi…

“What makes a question inquisitive?” A Study on Type-Controlled Inquisitive Question Generation Open

Lingyu Gao, Debanjan Ghosh, Kevin Gimpel · 2022

We propose a type-controlled framework for inquisitive question generation. We annotate an inquisitive question dataset with question types, train question type classifiers, and finetune models for type-controlled question generation. Empi…

Baked-in State Probing Open

Shubham Toshniwal, Sam Wiseman, Karen Livescu, Kevin Gimpel · 2022

Neural language models have been analyzed for their linguistic and extra-linguistic knowledge via probing. Of particular interest has been the following question: how much can a language model trained only on form learn about meaning? Rece…

Paraphrastic Representations at Scale Open

John Wieting, Kevin Gimpel, Graham Neubig, Taylor Berg-Kirkpatrick · 2022

We present a system that allows users to train their own state-of-the-art paraphrastic sentence representations in a variety of languages. We release trained models for English, Arabic, German, Spanish, French, Russian, Turkish, and Chines…

Substructure Distribution Projection for Zero-Shot Cross-Lingual Dependency Parsing Open

Freda Shi, Kevin Gimpel, Karen Livescu · 2022

We present substructure distribution projection (SubDP), a technique that projects a distribution over structures in one domain to another, by projecting substructure distributions separately. Models for the target domain can then be train…

SummScreen: A Dataset for Abstractive Screenplay Summarization Open

Mingda Chen, Zewei Chu, Sam Wiseman, Kevin Gimpel · 2022

We introduce SummScreen, a summarization dataset comprised of pairs of TV series transcripts and human written recaps. The dataset provides a challenging testbed for abstractive summarization for several reasons. Plot details are often exp…

Reconsidering the Past: Optimizing Hidden States in Language Models Open

Davis Yoshida, Kevin Gimpel · 2021

We present Hidden-State Optimization (HSO), a gradient-based method for improving the performance of transformer language models at inference time. Similar to dynamic evaluation (Krause et al., 2018), HSO computes the gradient of the log-p…

Substructure Distribution Projection for Zero-Shot Cross-Lingual Dependency Parsing Open

Haoyue Shi, Kevin Gimpel, Karen Livescu · 2021

We present substructure distribution projection (SubDP), a technique that projects a distribution over structures in one domain to another, by projecting substructure distributions separately. Models for the target domains can be then trai…

TVStoryGen: A Dataset for Generating Stories with Character Descriptions Open

Mingda Chen, Kevin Gimpel · 2021

We introduce TVStoryGen, a story generation dataset that requires generating detailed TV show episode recaps from a brief summary and a set of documents describing the characters involved. Unlike other story generation datasets, TVStoryGen…

TVRecap: A Dataset for Generating Stories with Character Descriptions. Open

Mingda Chen, Kevin Gimpel · 2021

We introduce TVRecap, a story generation dataset that requires generating detailed TV show episode recaps from a brief summary and a set of documents describing the characters involved. Unlike other story generation datasets, TVRecap conta…

Paraphrastic Representations at Scale Open

John Wieting, Kevin Gimpel, Graham Neubig, Taylor Berg-Kirkpatrick · 2021

We present a system that allows users to train their own state-of-the-art paraphrastic sentence representations in a variety of languages. We also release trained models for English, Arabic, German, French, Spanish, Russian, Turkish, and C…

SummScreen: A Dataset for Abstractive Screenplay Summarization Open

Mingda Chen, Zewei Chu, Sam M. Wiseman, Kevin Gimpel · 2021

We introduce SummScreen, a summarization dataset comprised of pairs of TV series transcripts and human written recaps. The dataset provides a challenging testbed for abstractive summarization for several reasons. Plot details are often exp…

Learning Chess Blindfolded: Evaluating Language Models on State Tracking. Open

Shubham Toshniwal, Sam Wiseman, Karen Livescu, Kevin Gimpel · 2021

Transformer language models have made tremendous strides in natural language understanding tasks. However, the complexity of natural language makes it challenging to ascertain how accurately these models are tracking the world state underl…

Chess as a Testbed for Language Model State Tracking Open

Shubham Toshniwal, Sam M. Wiseman, Karen Livescu, Kevin Gimpel · 2021

Transformer language models have made tremendous strides in natural language understanding tasks. However, the complexity of natural language makes it challenging to ascertain how accurately these models are tracking the world state underl…

Substructure Substitution: Structured Data Augmentation for NLP Open

Haoyue Shi, Karen Livescu, Kevin Gimpel · 2021

We study a family of data augmentation methods, substructure substitution (SUB2), for natural language processing (NLP) tasks. SUB2 generates new examples by substituting substructures (e.g., subtrees or subsequences) with ones with the sa…

Unsupervised Label Refinement Improves Dataless Text Classification Open

Zewei Chu, Karl Stratos, Kevin Gimpel · 2021

Dataless text classification is capable of classifying documents into previously unseen labels by assigning a score to any document paired with a label description.While promising, it crucially relies on accurate descriptions of the label …

On Generalization in Coreference Resolution Open

Shubham Toshniwal, Patrick Xia, Sam Wiseman, Karen Livescu, Kevin Gimpel · 2021

While coreference resolution is defined independently of dataset domain, most models for performing coreference resolution do not transfer well to unseen domains. We consolidate a set of 8 coreference resolution datasets targeting differen…

Substructure Substitution: Structured Data Augmentation for NLP Open

Haoyue Shi, Karen Livescu, Kevin Gimpel · 2021

We study a family of data augmentation methods, substructure substitution (SUB 2 ), that generalizes prior methods.SUB 2 generates new examples by substituting substructures (e.g., subtrees or subsequences) with others having the same labe…

FlowPrior: Learning Expressive Priors for Latent Variable Sentence Models Open

Xiaoan Ding, Kevin Gimpel · 2021

Variational autoencoders (VAEs) are widely used for latent variable modeling of text. We focus on variations that learn expressive prior distributions over the latent variable. We find that existing training strategies are not effective fo…

WikiTableT: A Large-Scale Data-to-Text Dataset for Generating Wikipedia Article Sections Open

Mingda Chen, Sam Wiseman, Kevin Gimpel · 2021

Datasets for data-to-text generation typically focus either on multi-domain, single-sentence generation or on single-domain, long-form generation.In this work, we cast generating Wikipedia sections as a data-to-text generation task and cre…

Kevin Gimpel YOU? Author Swipe