Explanipedia

From Feedback to Checklists: Grounded Evaluation of AI-Generated Clinical Notes Open

Karen Zhou, John Giorgi, Pranav Mani, Peng Xu, Davis Liang , et al. · 2025

AI-generated clinical notes are increasingly used in healthcare, but evaluating their quality remains a challenge due to high subjectivity and limited scalability of expert review. Existing automated metrics often fail to align with real-w…

The Curious Language Model: Strategic Test-Time Information Acquisition Open

Michael C. Cooper, Rohan Wadhawan, John Michael Giorgi, Chenhao Tan, Davis Liang · 2025

Decision-makers often possess insufficient information to render a confident decision. In these cases, the decision-maker can often undertake actions to acquire the necessary information about the problem at hand, e.g., by consulting knowl…

RoAST: Robustifying Language Models via Adversarial Perturbation with Selective Training Open

Jaehyung Kim, Yuning Mao, Rui Hou, Hanchao Yu, Davis Liang , et al. · 2023

Fine-tuning pre-trained language models (LMs) has become the de facto standard in many NLP tasks. Nevertheless, fine-tuned LMs are still prone to robustness issues, such as adversarial robustness and model calibration. Several perspectives…

Co-training and Co-distillation for Quality Improvement and Compression of Language Models Open

Hayeon Lee, Rui Hou, Jongpil Kim, Davis Liang, Hongbo Zhang , et al. · 2023

Computer science Engineering Physics

Knowledge Distillation (KD) compresses computationally expensive pre-trained language models (PLMs) by transferring their knowledge to smaller models, allowing their use in resource-constrained or real-time settings. However, most smaller …

The Belebele Benchmark: a Parallel Reading Comprehension Dataset in 122 Language Variants Open

Lucas Bandarkar, Davis Liang, Benjamin Müller, Mikel Artetxe, Satya Narayan Shukla , et al. · 2023

Computer science Geography Philosophy

We present Belebele, a multiple-choice machine reading comprehension (MRC) dataset spanning 122 language variants. Significantly expanding the language coverage of natural language understanding (NLU) benchmarks, this dataset enables the e…

A Study on Knowledge Distillation from Weak Teacher for Scaling Up Pre-trained Language Models Open

Hayeon Lee, Rui Hou, Jongpil Kim, Davis Liang, Sung Ju Hwang , et al. · 2023

Computer science Mathematics Chemistry

Distillation from Weak Teacher (DWT) is a method of transferring knowledge from a smaller, weaker teacher model to a larger student model to improve its performance. Previous studies have shown that DWT can be effective in the vision domai…

XLM-V: Overcoming the Vocabulary Bottleneck in Multilingual Masked Language Models Open

Davis Liang, Hila Gonen, Yuning Mao, Rui Hou, Naman Goyal , et al. · 2023

Computer science Economics Philosophy

Large multilingual language models typically rely on a single vocabulary shared across 100+ languages. As these models have increased in parameter count and depth, vocabulary size has remained largely unchanged. This \textit{vocabulary bot…

A Study on Knowledge Distillation from Weak Teacher for Scaling Up Pre-trained Language Models Open

Hayeon Lee, Rui Hou, Jongpil Kim, Davis Liang, Sung Ju Hwang , et al. · 2023

Computer science Mathematics Medicine

Distillation from Weak Teacher (DWT) is a method of transferring knowledge from a smaller, weaker teacher model to a larger student model to improve its performance. Previous studies have shown that DWT can be effective in the vision domai…

Generating Hashtags for Short-form Videos with Guided Signals Open

Tiezheng Yu, Hanchao Yu, Davis Liang, Yuning Mao, Shaoliang Nie , et al. · 2023

Computer science Physics

Short-form video hashtag recommendation (SVHR) aims to recommend hashtags to content creators from videos and corresponding descriptions. Most prior studies regard SVHR as a classification or ranking problem and select hashtags from a set …

RoAST: Robustifying Language Models via Adversarial Perturbation with Selective Training Open

Jaehyung Kim, Yuning Mao, Rui Hou, Hanchao Yu, Davis Liang , et al. · 2023

Computer science Physics Chemistry

Fine-tuning pre-trained language models (LMs) has become the de facto standard in many NLP tasks. Nevertheless, fine-tuned LMs are still prone to robustness issues, such as adversarial robustness and model calibration. Several perspectives…

Co-training and Co-distillation for Quality Improvement and Compression of Language Models Open

Hayeon Lee, Rui Hou, Jongpil Kim, Davis Liang, Hongbo Zhang , et al. · 2023

Computer science Engineering Geography

Knowledge Distillation (KD) compresses computationally expensive pre-trained language models (PLMs) by transferring their knowledge to smaller models, allowing their use in resource-constrained or real-time settings. However, most smaller …

Query Rewriting for Effective Misinformation Discovery Open

Ashkan Kazemi, Artem Abzaliev, Naihao Deng, Rui Hou, Davis Liang , et al. · 2022

Computer science Philosophy

We propose a novel system to help fact-checkers formulate search queries for known misinformation claims and effectively search across multiple social media platforms. We introduce an adaptable rewriting strategy, where editing actions for…

Attention-guided Generative Models for Extractive Question Answering Open

Peng Xu, Davis Liang, Zhiheng Huang, Bing Xiang · 2021

Computer science Physics Biology

We propose a novel method for applying Transformer models to extractive question answering (QA) tasks. Recently, pretrained generative sequence-to-sequence (seq2seq) models have achieved great success in question answering. Contributing to…

Multiplicative Position-aware Transformer Models for Language Understanding Open

Zhiheng Huang, Davis Liang, Peng Xu, Bing Xiang · 2021

Computer science Mathematics Engineering

Transformer models, which leverage architectural improvements like self-attention, perform remarkably well on Natural Language Processing (NLP) tasks. The self-attention mechanism is position agnostic. In order to capture positional orderi…

Decoding and Diversity in Machine Translation Open

Nicholas Roberts, Davis Liang, Graham Neubig, Zachary C. Lipton · 2020

Computer science Political science Biology

Neural Machine Translation (NMT) systems are typically evaluated using automated metrics that assess the agreement between generated translations and ground truth candidates. To improve systems with respect to these metrics, NLP researcher…

Improve Transformer Models with Better Relative Position Embeddings Open

Zhiheng Huang, Davis Liang, Peng Xu, Bing Xiang · 2020

Computer science Mathematics Engineering

Transformer architectures rely on explicit position encodings in order to preserve a notion of word order. In this paper, we argue that existing work does not fully utilize position information. For example, the initial proposal of a sinus…

Embedding-based Zero-shot Retrieval through Query Generation Open

Davis Liang, Peng Xu, Siamak Shakeri, Cícero Nogueira dos Santos, Ramesh Nallapati , et al. · 2020

Computer science Mathematics Physics

Passage retrieval addresses the problem of locating relevant passages, usually from a large corpus, given a query. In practice, lexical term-matching algorithms like BM25 are popular choices for retrieval owing to their efficiency. However…

TRANS-BLSTM: Transformer with Bidirectional LSTM for Language Understanding Open

Zhiheng Huang, Peng Xu, Davis Liang, Ajay Mishra, Bing Xiang · 2020

Computer science Engineering Philosophy

Bidirectional Encoder Representations from Transformers (BERT) has recently achieved state-of-the-art performance on a broad range of NLP tasks including sentence classification, machine translation, and question answering. The BERT model …

Improve Transformer Models with Better Relative Position Embeddings Open

Zhiheng Huang, Davis Liang, Peng Xu, Bing Xiang · 2020

Computer science Mathematics Engineering

The transformer model has demonstrated superior results on NLP tasks including machine translation and question answering. In this paper, we argue that the position information is not fully utilized in existing work. For example, the initi…

Pseudolikelihood Reranking with Masked Language Models. Open

Julián Salazar, Davis Liang, Toan Nguyen, Katrin Kirchhoff · 2019

Computer science Mathematics Philosophy

We rerank with scores from pretrained masked language models like BERT to improve ASR and NMT performance. These log-pseudolikelihood scores (LPLs) can outperform large, autoregressive language models (GPT-2) in out-of-the-box scoring. RoB…

Learning Noise-Invariant Representations for Robust Speech Recognition Open

Davis Liang, Zhiheng Huang, Zachary C. Lipton · 2018

Computer science Mathematics Political science

Despite rapid advances in speech recognition, current models remain brittle to superficial perturbations to their inputs. Small amounts of noise can destroy the performance of an otherwise state-of-the-art model. To harden models against b…

Deep Automated Multi-task Learning Open

Davis Liang, Yan Shu · 2017

Computer science Philosophy Economics

Multi-task learning (MTL) has recently contributed to learning better representations in service of various NLP tasks. MTL aims at improving the performance of a primary task, by jointly training on a secondary task. This paper introduces …

Automated Multi-task Learning Open

Davis Liang · 2017

Computer science Philosophy Economics

Multi-task learning (MTL) has recently contributed to learning better representations in service of various natural language (NLP) tasks. MTL aims at improving the performance of a primary task by jointly training on a secondary task. This…

Davis Liang YOU? Author Swipe