Explanipedia

Xlnet: Generalized Autoregressive Pretraining for Language Understanding Open

Zhilin Yang, Zihang Dai, Yiming Yang, Jaime Carbonell, Ruslan Salakhutdinov , et al. · 2025

With the capability of modeling bidirectional contexts, denoising autoencoding based pretraining like BERT achieves better performance than pretraining approaches based on autoregressive language modeling. However, relying on corrupting th…

CAWET: Context-Aware Worst-Case Execution Time Estimation Using Transformers Open

Zihang Dai, Zhilin Yang, Yiming Yang, Jaime Carbonell, Quoc V. Le , et al. · 2023

Computer science Engineering

This paper presents CAWET, a hybrid worst-case program timing estimation technique. CAWET identifies the longest execution path using static techniques, whereas the worst-case execution time (WCET) of basic blocks is predicted using an adv…

Language Technologies for Humanitarian Aid Open

Jaime Carbonell, Alon Lavie, Lori Levin, Alan W. Black · 2022

Geography Political science History

Humanitarian aid missions, whether emergency famine relief, establishment of medical clinics, or missions in conjunction with peace-keeping operations, require on-demand communication with the indigenous population. If such operations take…

Document Representation and Query Expansion Models for Blog Recommendation Open

Jaime Arguello, Jonathan L. Elsas, Jamie Callan, Jaime Carbonell · 2021

Computer science Economics Political science

We explore several different document representation models and two query expansion models for the task of recommending blogs to a user in response to a query. Blog relevance ranking differs from traditional document ranking in ad-hocinfor…

StructSum: Summarization via Structured Representations Open

Vidhisha Balachandran, Artidoro Pagnoni, Jay Yoon Lee, Dheeraj Rajagopal, Jaime Carbonell , et al. · 2021

Computer science

ive text summarization aims at compressing the information of a long source document into a rephrased, condensed summary. Despite advances in modeling techniques, abstractive summarization models still suffer from several key challenges: (…

Efficient Meta Lifelong-Learning with Limited Memory Open

Zirui Wang, Sanket Vaibhav Mehta, Barnabás Póczos, Jaime Carbonell · 2020

Computer science Psychology Economics

Current natural language processing models work well on a single task, yet they often fail to continuously learn new tasks without forgetting previous ones as they are re-trained throughout their lifetime, a challenge known as lifelong lea…

Harnessing Code Switching to Transcend the Linguistic Barrier Open

Ashiqur R. KhudaBukhsh, Shriphani Palakodety, Jaime Carbonell · 2020

Computer science Philosophy Biology

Code mixing (or code switching) is a common phenomenon observed in social-media content generated by a linguistically diverse user-base. Studies show that in the Indian sub-continent, a substantial fraction of social media posts exhibit co…

Voice for the Voiceless: Active Sampling to Detect Comments Supporting the Rohingyas Open

Shriphani Palakodety, Ashiqur R. KhudaBukhsh, Jaime Carbonell · 2020

Computer science Political science Sociology

The Rohingya refugee crisis is one of the biggest humanitarian crises of modern times with more than 700,000 Rohingyas rendered homeless according to the United Nations High Commissioner for Refugees. While it has received sustained press …

Improving Candidate Generation for Low-resource Cross-lingual Entity Linking Open

Shuyan Zhou, Shruti Rijhwani, John Wieting, Jaime Carbonell, Graham Neubig · 2020

Computer science Philosophy Economics

Cross-lingual entity linking (XEL) is the task of finding referents in a target-language knowledge base (KB) for mentions extracted from source-language texts. The first step of (X)EL is candidate generation, which retrieves a list of plau…

StructSum: Incorporating Latent and Explicit Sentence Dependencies for Single Document Summarization. Open

Vidhisha Balachandran, Artidoro Pagnoni, Jay Yoon Lee, Dheeraj Rajagopal, Jaime Carbonell , et al. · 2020

Computer science Biology Philosophy

Traditional preneural approaches to single document summarization relied on modeling the intermediate structure of a document before generating the summary. In contrast, the current state of the art neural summarization models do not prese…

Efficient Meta Lifelong-Learning with Limited Memory Open

Zirui Wang, Sanket Vaibhav Mehta, Barnabás Póczos, Jaime Carbonell · 2020

Computer science Psychology Economics

Current natural language processing models work well on a single task, yet they often fail to continuously learn new tasks without forgetting previous ones as they are re-trained throughout their lifetime, a challenge known as lifelong lea…

Soft Gazetteers for Low-Resource Named Entity Recognition Open

Shruti Rijhwani, Shuyan Zhou, Graham Neubig, Jaime Carbonell · 2020

Computer science Economics

Traditional named entity recognition models use gazetteers (lists of entities) as features to improve performance. Although modern neural network models do not require such hand-crafted features for strong performance, recent work has demo…

Mining Insights from Large-Scale Corpora Using Fine-Tuned Language Models Open

Shriphani Palakodety, Ashiqur R. KhudaBukhsh, Jaime Carbonell · 2020

Computer science Geography

Mining insights from large volume of social media texts with minimal supervision is a highly challenging Natural Language Processing (NLP) task. While Language Models' (LMs) efficacy in several downstream tasks is well-studied, assessing t…

Hope Speech Detection: A Computational Analysis of the Voice of Peace Open

Shriphani Palakodety, Ashiqur R. KhudaBukhsh, Jaime Carbonell · 2020

Computer science Political science Psychology

The recent Pulwama terror attack (February 14, 2019, Pulwama, Kashmir) triggered a chain of escalating events between India and Pakistan adding another episode to their 70-year-old dispute over Kashmir. The present era of ubiquitious socia…

Optimizing Data Usage via Differentiable Rewards Open

Xinyi Wang, Hieu Pham, Paul Michel, Antonios Anastasopoulos, Jaime Carbonell , et al. · 2019

Computer science Mathematics Biology

To acquire a new skill, humans learn better and faster if a tutor, based on their current knowledge level, informs them of how much attention they should pay to particular content or practice problems. Similarly, a machine learning model c…

Cross-lingual Alignment vs Joint Training: A Comparative Study and A Simple Unified Framework Open

Zi-Rui Wang, Jiateng Xie, Ruochen Xu, Yiming Yang, Graham Neubig , et al. · 2019

Computer science Economics Engineering

Learning multilingual representations of text has proven a successful method for many cross-lingual transfer learning tasks. There are two main paradigms for learning such representations: (1) alignment, which maps different independently …

Voice for the Voiceless: Active Sampling to Detect Comments Supporting\n the Rohingyas Open

Shriphani Palakodety, Ashiqur R. KhudaBukhsh, Jaime Carbonell · 2019

Political science Computer science Sociology

The Rohingya refugee crisis is one of the biggest humanitarian crises of\nmodern times with more than 600,000 Rohingyas rendered homeless according to\nthe United Nations High Commissioner for Refugees. While it has received\nsustained pre…

Learning Rhyming Constraints using Structured Adversaries Open

Harsh Jhamtani, Sanket Vaibhav Mehta, Jaime Carbonell, Taylor Berg-Kirkpatrick · 2019

Computer science Philosophy Physics

Existing recurrent neural language models often fail to capture higher-level structure present in text: for example, rhyming patterns present in poetry. Much prior work on poetry generation uses manually defined constraints which are satis…

CMU-01 at the SIGMORPHON 2019 Shared Task on Crosslinguality and Context\n in Morphology Open

Aditi Chaudhary, Elizabeth Salesky, Gayatri Bhat, David R. Mortensen, Jaime Carbonell , et al. · 2019

Computer science History Philosophy

This paper presents the submission by the CMU-01 team to the SIGMORPHON 2019\ntask 2 of Morphological Analysis and Lemmatization in Context. This task\nrequires us to produce the lemma and morpho-syntactic description of each token\nin a s…

Gradient-Based Inference for Networks with Output Constraints Open

Jay Yoon Lee, Sanket Vaibhav Mehta, Michael Wick, Jean-Baptiste Tristan, Jaime Carbonell · 2019

Computer science Economics Biology

Practitioners apply neural networks to increasingly complex problems in natural language processing, such as syntactic parsing and semantic role labeling that have rich output structures. Many such structured-prediction problems require de…

Domain Adaptation of Neural Machine Translation by Lexicon Induction Open

Junjie Hu, Mengzhou Xia, Graham Neubig, Jaime Carbonell · 2019

Computer science Mathematics Psychology

It has been previously noted that neural machine translation (NMT) is very sensitive to domain shift. In this paper, we argue that this is a dual effect of the highly lexicalized nature of NMT, resulting in failure for sentences with large…

Data-Driven Approach to Multiple-Source Domain Adaptation. Open

Petar Stojanov, Mingming Gong, Jaime Carbonell, Kun Zhang · 2019

Computer science Mathematics Engineering

A key problem in domain adaptation is determining what to transfer across different domains. We propose a data-driven method to represent these changes across multiple source domains and perform unsupervised domain adaptation. We assume th…

Low-Dimensional Density Ratio Estimation for Covariate Shift Correction. Open

Petar Stojanov, Mingming Gong, Jaime Carbonell, Kun Zhang · 2019

Computer science Mathematics Political science

Covariate shift is a prevalent setting for supervised learning in the wild when the training and test data are drawn from different time periods, different but related domains, or via different sampling strategies. This paper addresses a t…

Learning Rhyming Constraints using Structured Adversaries Open

Harsh Jhamtani, Sanket Vaibhav Mehta, Jaime Carbonell, Taylor Berg-Kirkpatrick · 2019

Computer science Engineering History

Harsh Jhamtani, Sanket Vaibhav Mehta, Jaime Carbonell, Taylor Berg-Kirkpatrick. Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processin…

CMU-01 at the SIGMORPHON 2019 Shared Task on Crosslinguality and Context in Morphology Open

Aditi Chaudhary, Elizabeth Salesky, Gayatri Bhat, David R. Mortensen, Jaime Carbonell , et al. · 2019

Computer science History Philosophy

This paper presents the submission by the CMU-01 team to the SIGMORPHON 2019 task 2 of Morphological Analysis and Lemmatization in Context. This task requires us to produce the lemma and morpho-syntactic description of each token in a sequ…

A Little Annotation does a Lot of Good: A Study in Bootstrapping Low-resource Named Entity Recognizers Open

Aditi Chaudhary, Jiateng Xie, Zaid Sheikh, Graham Neubig, Jaime Carbonell · 2019

Computer science Economics

Most state-of-the-art models for named entity recognition (NER) rely on the availability of large amounts of labeled data, making them challenging to extend to new, lower-resourced languages. However, there are now several proposed approac…

Domain Adaptation of Neural Machine Translation by Lexicon Induction Open

Junjie Hu, Mengzhou Xia, Graham Neubig, Jaime Carbonell · 2019

Computer science Mathematics Philosophy

It has been previously noted that neural machine translation (NMT) is very sensitive to domain shift. In this paper, we argue that this is a dual effect of the highly lexicalized nature of NMT, resulting in failure for sentences with large…

Transformer-XL: Attentive Language Models beyond a Fixed-Length Context Open

Zihang Dai, Zhilin Yang, Yiming Yang, Jaime Carbonell, Quoc V. Le , et al. · 2019

Computer science Engineering

Transformers have a potential of learning longer-term dependency, but are limited by a fixed-length context in the setting of language modeling. We propose a novel neural architecture Transformer-XL that enables learning dependency beyond …

Jaime Carbonell YOU? Author Swipe