Explanipedia

Enhancing Performance on Seen and Unseen Dialogue Scenarios using Retrieval-Augmented End-to-End Task-Oriented System Open

Jianguo Zhang, Stephen Roller, Kun Qian, Zhiwei Liu, Rui Meng , et al. · 2023

End-to-end task-oriented dialogue (TOD) systems have achieved promising performance by leveraging sophisticated natural language understanding and natural language generation capabilities of pre-trained models. This work enables the TOD sy…

Leveraging Implicit Feedback from Deployment Data in Dialogue Open

Richard Yuanzhe Pang, Stephen Roller, Kyunghyun Cho, He He, Jason Weston · 2023

Computer science Psychology Geology

We study improving social conversational agents by learning from natural dialogue between users and a deployed model, without extra annotations. To implicitly measure the quality of a machine-generated utterance, we leverage signals like u…

A Theory on Adam Instability in Large-Scale Machine Learning Open

Igor Molybog, Peter J. Albert, Moya Chen, Zachary DeVito, David Esiobu , et al. · 2023

Computer science Mathematics Geography

We present a theory for the previously unexplained divergent behavior noticed in the training of large language models. We argue that the phenomenon is an artifact of the dominant optimization algorithm used for training, called Adam. We o…

Scaling Laws for Generative Mixed-Modal Language Models Open

Armen Aghajanyan, Lili Yu, Alexis Conneau, Wei-Ning Hsu, Karen Hambardzumyan , et al. · 2023

Computer science Mathematics Sociology

Generative language models define distributions over sequences of tokens that can represent essentially any combination of data modalities (e.g., any permutation of image tokens from VQ-VAEs, speech tokens from HuBERT, BPE tokens for langu…

Enhancing Performance on Seen and Unseen Dialogue Scenarios using Retrieval-Augmented End-to-End Task-Oriented System Open

Jianguo Zhang, Stephen Roller, Kun Qian, Zhiwei Liu, Rui Meng , et al. · 2023

Computer science History Engineering

Jianguo Zhang, Stephen Roller, Kun Qian, Zhiwei Liu, Rui Meng, Shelby Heinecke, Huan Wang, Silvio Savarese, Caiming Xiong. Proceedings of the 24th Meeting of the Special Interest Group on Discourse and Dialogue. 2023.

Human-Level Play in the Game of Diplomacy by Combining Language Models with Strategic Reasoning Open

Anton Bakhtin, Noam Brown, Emily Dinan, Gabriele Farina, Colin Flaherty , et al. · 2022

Computer science Psychology Political science

Despite much progress in training AI systems to imitate human language, building agents that use language to communicate intentionally with humans in interactive environments remains a major challenge. We introduce CICERO, the first AI age…

BlenderBot 3: a deployed conversational agent that continually learns to responsibly engage Open

Kurt Shuster, Jing Xu, Mojtaba Komeili, Da Young Ju, Eric M. Smith , et al. · 2022

Computer science Psychology History

We present BlenderBot 3, a 175B parameter dialogue model capable of open-domain conversation with access to the internet and a long-term memory, and having been trained on a large number of user defined tasks. We release both the model wei…

Language Models that Seek for Knowledge: Modular Search & Generation for Dialogue and Prompt Completion Open

Kurt Shuster, Mojtaba Komeili, Leonard Adolphs, Stephen Roller, Arthur Szlam , et al. · 2022

Computer science Biology

Language models (LMs) have recently been shown to generate more factual responses by employing modularity (Zhou et al., 2021) in combination with retrieval (Adolphs et al., 2021). We extend the recent approach of Adolphs et al. (2021) to i…

Human Evaluation of Conversations is an Open Problem: comparing the sensitivity of various methods for evaluating dialogue agents Open

Eric M. Smith, Orion Hsu, Rebecca Qian, Stephen Roller, Y-Lan Boureau , et al. · 2022

Computer science Engineering

At the heart of improving conversational AI is the open problem of how to evaluate conversations. Issues with automatic metrics are well known (Liu et al., 2016, arXiv:1603.08023), with human evaluations still considered the gold standard.…

Human Evaluation of Conversations is an Open Problem: comparing the sensitivity of various methods for evaluating dialogue agents Open

Eric E. Smith, Orion Hsu, Rebecca Qian, Stephen Roller, Y-Lan Boureau , et al. · 2022

Computer science Engineering

At the heart of improving conversational AI is the open problem of how to evaluate conversations. Issues with automatic metrics are well known (Liu et al., 2016), with human evaluations still considered the gold standard. Unfortunately, ho…

Analysing Off-The-Shelf Options for Question Answering with Portuguese FAQs Open

Susan Zhang, Stephen Roller, Naman Goyal, Mikel Artetxe, Moya Chen , et al. · 2022

Computer science Engineering History

Following the current interest in developing automatic question answering systems, we analyse alternative approaches for finding suitable answers from a list of Frequently Asked Questions (FAQs), in Portuguese. These rely on different tech…

Language Models that Seek for Knowledge: Modular Search & Generation for Dialogue and Prompt Completion Open

Kurt Shuster, Mojtaba Komeili, Leonard Adolphs, Stephen Roller, Arthur Szlam , et al. · 2022

Computer science Biology

Language models (LMs) have recently been shown to generate more factual responses by employing modularity (Zhou et al., 2022) in combination with retrieval (Adolphs et al., 2021). We extend the recent approach of Adolphs et al. (2021) to i…

Teaching Models new APIs: Domain-Agnostic Simulators for Task Oriented Dialogue Open

Moya Chen, Paul Crook, Stephen Roller · 2021

Computer science Mathematics Philosophy

We demonstrate that large language models are able to simulate Task Oriented Dialogues in novel domains, provided only with an API implementation and a list of goals. We show these simulations can formulate online, automatic metrics that c…

Hash Layers For Large Sparse Models Open

Stephen Roller, Sainbayar Sukhbaatar, Arthur Szlam, Jason Weston · 2021

Computer science Engineering Physics

We investigate the training of sparse layers that use different parameters for different inputs based on hashing in large Transformer models. Specifically, we modify the feedforward layer to hash to different sets of weights depending on t…

Staircase Attention for Recurrent Processing of Sequences Open

Da Young Ju, Stephen Roller, Sainbayar Sukhbaatar, Jason Weston · 2021

Computer science Engineering Biology

Attention mechanisms have become a standard tool for sequence modeling tasks, in particular by stacking self-attention layers over the entire input sequence as in the Transformer architecture. In this work we introduce a novel attention pr…

Not All Memories are Created Equal: Learning to Forget by Expiring Open

Sainbayar Sukhbaatar, Da Young Ju, Spencer Poff, Stephen Roller, Arthur Szlam , et al. · 2021

Computer science Psychology Physics

Attention mechanisms have shown promising results in sequence modeling tasks that require long-term memory. Recent work investigated mechanisms to reduce the computational cost of preserving and storing memories. However, not all content i…

Adding Chit-Chat to Enhance Task-Oriented Dialogues Open

Kai Sun, Seungwhan Moon, Paul Crook, Stephen Roller, Becka Silvert , et al. · 2021

Computer science Engineering Philosophy

Kai Sun, Seungwhan Moon, Paul Crook, Stephen Roller, Becka Silvert, Bing Liu, Zhiguang Wang, Honglei Liu, Eunjoon Cho, Claire Cardie. Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Lin…

Recipes for Building an Open-Domain Chatbot Open

Stephen Roller, Emily Dinan, Naman Goyal, Da Young Ju, Mary Williamson , et al. · 2021

Computer science Psychology Mathematics

Building open-domain chatbots is a challenging area for machine learning research. While prior work has shown that scaling neural models in the number of parameters and the size of the data they are trained on gives improved results, we sh…

Adding Chit-Chats to Enhance Task-Oriented Dialogues Open

Kai Sun, Seungwhan Moon, Paul Crook, Stephen Roller, Becka Silvert , et al. · 2020

Computer science Economics

Existing dialogue corpora and models are typically designed under two disjoint motives: while task-oriented systems focus on achieving functional goals (e.g., booking hotels), open-domain chatbots aim at making socially engaging conversati…

Adding Chit-Chat to Enhance Task-Oriented Dialogues Open

Kai Sun, Seungwhan Moon, Paul Crook, Stephen Roller, Becka Silvert , et al. · 2020

Computer science Economics

Existing dialogue corpora and models are typically designed under two disjoint motives: while task-oriented systems focus on achieving functional goals (e.g., booking hotels), open-domain chatbots aim at making socially engaging conversati…

Open-Domain Conversational Agents: Current Progress, Open Problems, and Future Directions Open

Stephen Roller, Y-Lan Boureau, Jason Weston, Antoine Bordes, Emily Dinan , et al. · 2020

Computer science Engineering Mathematics

We present our view of what is necessary to build an engaging open-domain conversational agent: covering the qualities of such an agent, the pieces of the puzzle that have been built so far, and the gaping holes we have not filled yet. We …

The Dialogue Dodecathlon: Open-Domain Knowledge and Image Grounded Conversational Agents Open

Kurt Shuster, Da Young Ju, Stephen Roller, Emily Dinan, Y-Lan Boureau , et al. · 2020

Computer science Psychology Mathematics

We introduce dodecaDialogue: a set of 12 tasks that measures if a conversational agent can communicate engagingly with personality and empathy, ask questions, answer questions by utilizing knowledge resources, discuss topics and situations…

Don’t Say That! Making Inconsistent Dialogue Unlikely with Unlikelihood Training Open

Margaret Li, Stephen Roller, Ilia Kulikov, Sean Welleck, Y-Lan Boureau , et al. · 2020

Computer science Biology

Generative dialogue models currently suffer from a number of problems which standard maximum likelihood training does not address. They tend to produce generations that (i) rely too much on copying from the context, (ii) contain repetition…

Don't Say That! Making Inconsistent Dialogue Unlikely with Unlikelihood Training Open

Margaret Li, Stephen Roller, Ilia Kulikov, Sean Welleck, Y-Lan Boureau , et al. · 2019

Computer science Biology

Generative dialogue models currently suffer from a number of problems which standard maximum likelihood training does not address. They tend to produce generations that (i) rely too much on copying from the context, (ii) contain repetition…

The Dialogue Dodecathlon: Open-Domain Knowledge and Image Grounded\n Conversational Agents Open

Kurt Shuster, Da Young Ju, Stephen Roller, Emily Dinan, Y-Lan Boureau , et al. · 2019

Computer science Psychology Mathematics

We introduce dodecaDialogue: a set of 12 tasks that measures if a\nconversational agent can communicate engagingly with personality and empathy,\nask questions, answer questions by utilizing knowledge resources, discuss\ntopics and situati…

ACUTE-EVAL: Improved Dialogue Evaluation with Optimized Questions and Multi-turn Comparisons Open

Margaret Li, Jason Weston, Stephen Roller · 2019

Computer science Psychology Political science

While dialogue remains an important end-goal of natural language research, the difficulty of evaluation is an oft-quoted reason why it remains troublesome to make real progress towards its solution. Evaluation difficulties are actually two…

Neural Text Generation with Unlikelihood Training Open

Sean Welleck, Ilia Kulikov, Stephen Roller, Emily Dinan, Kyunghyun Cho , et al. · 2019

Computer science Psychology Geography

Neural text generation is a key tool in natural language applications, but it is well known there are major problems at its core. In particular, standard likelihood training and decoding leads to dull and repetitive outputs. While some pos…

What makes a good conversation? How controllable attributes affect human judgments Open

Abigail See, Stephen Roller, Douwe Kiela, Jason Weston · 2019

Computer science Psychology Philosophy

A good conversation requires balance -- between simplicity and detail; staying on topic and changing it; asking questions and answering them. Although dialogue agents are commonly evaluated via human judgments of overall quality, the relat…

Inferring Concept Hierarchies from Text Corpora via Hyperbolic Embeddings Open

Matthew Le, Stephen Roller, Laetitia Papaxanthos, Douwe Kiela, Maximilian Nickel · 2019

Computer science Economics

We consider the task of inferring “is-a” relationships from large text corpora. For this purpose, we propose a new method combining hyperbolic embeddings and Hearst patterns. This approach allows us to set appropriate constraints for infer…

Stephen Roller YOU? Author Swipe