Explanipedia

CTC-GMM: CTC guided modality matching for fast and accurate streaming speech translation Open

Rui Zhao, Jinyu Li, Ruchao Fan, Matt Post · 2024

Models for streaming speech translation (ST) can achieve high accuracy and low latency if they're developed with vast amounts of paired audio in the source language and written text in the target language. Yet, these text labels for the ta…

PyMarian: Fast Neural Machine Translation and Evaluation in Python Open

Thamme Gowda, Roman Grundkiewicz, Elijah Rippeth, Matt Post, Marcin Junczys-Dowmunt · 2024

Computer science Chemistry

The deep learning language of choice these days is Python; measured by factors such as available libraries and technical support, it is hard to beat. At the same time, software written in lower-level programming languages like C++ retain a…

Recovering document annotations for sentence-level bitext Open

Rachel Wicks, Matt Post, Philipp Koehn · 2024

Computer science Psychology Philosophy

Data availability limits the scope of any given task. In machine translation, historical models were incapable of handling longer contexts, so the lack of document-level datasets was less noticeable. Now, despite the emergence of long-sequ…

Navigating the Metrics Maze: Reconciling Score Magnitudes and Accuracies Open

Tom Kocmi, Vilém Zouhar, Christian Federmann, Matt Post · 2024

Computer science Mathematics Psychology

Ten years ago a single metric, BLEU, governed progress in machine translation research. For better or worse, there is no such consensus today, and consequently it is difficult for researchers to develop and retain the kinds of heuristic in…

Improving Word Sense Disambiguation in Neural Machine Translation with Salient Document Context Open

Elijah Rippeth, Marine Carpuat, Kevin Duh, Matt Post · 2023

Computer science Philosophy Biology

Lexical ambiguity is a challenging and pervasive problem in machine translation (\mt). We introduce a simple and scalable approach to resolve translation ambiguity by incorporating a small amount of extra-sentential context in neural \mt. …

Identifying Context-Dependent Translations for Evaluation Set Production Open

Rachel Wicks, Matt Post · 2023

Computer science Biology

A major impediment to the transition to context-aware machine translation is the absence of good evaluation metrics and test sets. Sentences that require context to be translated correctly are rare in test sets, reducing the utility of sta…

SLIDE: Reference-free Evaluation for Machine Translation using a Sliding Document Window Open

Vikas Raunak, Tom Kocmi, Matt Post · 2023

Computer science Economics Philosophy

Reference-based metrics that operate at the sentence-level typically outperform quality estimation metrics, which have access only to the source and system output. This is unsurprising, since references resolve ambiguities that may be pres…

SOTASTREAM: A Streaming Approach to Machine Translation Training Open

Matt Post, Thamme Gowda, Roman Grundkiewicz, Huda Khayrallah, Rohit Jain , et al. · 2023

Computer science Sociology Mathematics

Many machine translation toolkits make use of a data preparation step wherein raw data is transformed into a tensor format that can be used directly by the trainer. This preparation step is increasingly at odds with modern research and dev…

Do GPTs Produce Less Literal Translations? Open

Vikas Raunak, Arul Menezes, Matt Post, Hany Hassan Awadallah · 2023

Computer science Mathematics Engineering

Large Language Models (LLMs) such as GPT-3 have emerged as general-purpose language models capable of addressing many natural language generation or understanding tasks. On the task of Machine Translation (MT), multiple works have investig…

Multilingual Pixel Representations for Translation and Effective Cross-lingual Transfer Open

Elizabeth Salesky, Neha Verma, Philipp Koehn, Matt Post · 2023

Computer science Chemistry Philosophy

We introduce and demonstrate how to effectively train multilingual machine translation models with pixel representations. We experiment with two different data settings with a variety of language and script coverage, demonstrating improved…

Escaping the sentence-level paradigm in machine translation Open

Matt Post, Marcin Junczys-Dowmunt · 2023

Computer science Engineering Chemistry

It is well-known that document context is vital for resolving a range of translation ambiguities, and in fact the document setting is the most natural setting for nearly all translation. It is therefore unfortunate that machine translation…

Two Decades of the ACL Anthology: Development, Impact, and Open Challenges Open

Marcel Bollmann, Nathan Schneider, Arne Köhn, Matt Post · 2023

Computer science Mathematics

The ACL Anthology is a prime resource for research papers within computational linguistics and natural language processing, while continuing to be an open-source and community-driven project. Since Gildea et al. (2018) reported on its stat…

Multilingual Pixel Representations for Translation and Effective Cross-lingual Transfer Open

Elizabeth Salesky, Neha Verma, Philipp Koehn, Matt Post · 2023

Computer science Chemistry Philosophy

Pretrained multilingual translation models using either pixel or subword (bpe) representations trained on the many-to-one parallel TED-59 dataset, accompanying the EMNLP'23 paper "Multilingual Pixel Representations for Translation and Effe…

Identifying Context-Dependent Translations for Evaluation Set Production Open

Rachel Wicks, Matt Post · 2023

Computer science Biology

A major impediment to the transition to contextual machine translation is the absence of good evaluation metrics and test sets. Sentences that require context to be translated correctly are rare in test sets, reducing the utility of standa…

Evaluating Metrics for Document-context Evaluation in Machine Translation Open

Vikas Raunak, Tom Kocmi, Matt Post · 2023

Computer science Engineering Biology

We describe our submission of a new metric, SLIDE (Raunak et al., 2023), to the WMT 2023 metrics task. SLIDE is a reference-free quality-estimation metric that works by constructing a fixed sentence-length window over the documents in a te…

Do GPTs Produce Less Literal Translations? Open

Vikas Raunak, Arul Menezes, Matt Post, Hany Hassan · 2023

Computer science Mathematics Engineering

Large Language Models (LLMs) such as GPT-3 have emerged as general-purpose language models capable of addressing many natural language generation or understanding tasks. On the task of Machine Translation (MT), multiple works have investig…

SOTASTREAM: A Streaming Approach to Machine Translation Training Open

Matt Post, Thamme Gowda, Roman Grundkiewicz, Huda Khayrallah, Rohit Jain , et al. · 2023

Computer science Chemistry

Matt Post, Thamme Gowda, Roman Grundkiewicz, Huda Khayrallah, Rohit Jain, Marcin Junczys-Dowmunt. Proceedings of the 3rd Workshop for Natural Language Processing Open Source Software (NLP-OSS 2023). 2023.

Operationalizing Specifications, In Addition to Test Sets for Evaluating Constrained Generative Models Open

Vikas Raunak, Matt Post, Arul Menezes · 2022

Computer science Engineering Mathematics

In this work, we present some recommendations on the evaluation of state-of-the-art generative models for constrained generation tasks. The progress on generative models has been rapid in recent years. These large-scale models have had thr…

Additive Interventions Yield Robust Multi-Domain Machine Translation Models Open

Elijah Rippeth, Matt Post · 2022

Computer science Mathematics Chemistry

Additive interventions are a recently-proposed mechanism for controlling target-side attributes in neural machine translation. In contrast to tag-based approaches which manipulate the raw source sequence, interventions work by directly mod…

SALTED: A Framework for SAlient Long-Tail Translation Error Detection Open

Vikas Raunak, Matt Post, Arul Menezes · 2022

Computer science Chemistry Physics

Traditional machine translation (MT) metrics provide an average measure of translation quality that is insensitive to the long tail of behavioral problems in MT. Examples include translation of numbers, physical units, dropped content and …

Large-Scale Streaming End-to-End Speech Translation with Neural Transducers Open

Jian Xue, Peidong Wang, Jinyu Li, Matt Post, Yashesh Gaur · 2022

Computer science Engineering

Neural transducers have been widely used in automatic speech recognition (ASR). In this paper, we introduce it to streaming end-to-end speech translation (ST), which aims to convert audio signals to texts in other languages directly. Compa…

SALTED: A Framework for SAlient Long-tail Translation Error Detection Open

Vikas Raunak, Matt Post, Arul Menezes · 2022

Computer science Physics Chemistry

Traditional machine translation (MT) metrics provide an average measure of translation quality that is insensitive to the long tail of behavioral problems. Examples include translation of numbers, physical units, dropped content and halluc…

The JHU-Microsoft Submission for WMT21 Quality Estimation Shared Task Open

Shuoyang Ding, Marcin Junczys-Dowmunt, Matt Post, Christian Federmann, Philipp Koehn · 2021

Computer science Engineering Philosophy

This paper presents the JHU-Microsoft joint submission for WMT 2021 quality estimation shared task. We only participate in Task 2 (post-editing effort estimation) of the shared task, focusing on the target-side word-level quality estimatio…

Matt Post YOU? Author Swipe