Explanipedia

PyTOD: Programmable Task-Oriented Dialogue with Execution Feedback Open

Alexandru Coca, Bo-Hsiang Tseng, Pete Boothroyd, Jinluo Cheng, Mark Gaynor , et al. · 2025

Programmable task-oriented dialogue (TOD) agents enable language models to follow structured dialogue policies, but their effectiveness hinges on accurate state tracking. We present PyTOD, an agent that generates executable code to track d…

How to Improve the Robustness of Closed-Source Models on NLI Open

Joe Stacey, Lisa Alazraki, Aran Ubhi, Beyza Ermiş, Aaron Mueller , et al. · 2025

Closed-source Large Language Models (LLMs) have become increasingly popular, with impressive performance across a wide range of natural language tasks. These models can be fine-tuned to further improve performance, but this often results i…

LUCID: LLM-Generated Utterances for Complex and Interesting Dialogues Open

Joe Stacey, Jianpeng Cheng, John Torr, Tristan Guigue, Joris Driesen , et al. · 2024

Spurred by recent advances in Large Language Models (LLMs), virtual assistants are poised to take a leap forward in terms of their dialogue capabilities. Yet a major bottleneck to achieving genuinely transformative task-oriented dialogue c…

Distilling Robustness into Natural Language Inference Models with Domain-Targeted Augmentation Open

Joe Stacey, Marek Rei · 2024

Knowledge distillation optimises a smaller student model to behave similarly to a larger teacher model, retaining some of the performance benefits. While this method can improve results on in-distribution examples, it does not necessarily …

Atomic Inference for NLI with Generated Facts as Atoms Open

Joe Stacey, Pasquale Minervini, Haim Dubossarsky, Oana-Maria Camburu, Marek Rei · 2023

With recent advances, neural models can achieve human-level performance on various natural language tasks. However, there are no guarantees that any explanations from these models are faithful, i.e. that they reflect the inner workings of …

Distilling Robustness into Natural Language Inference Models with Domain-Targeted Augmentation Open

Joe Stacey, Marek Rei · 2023

Knowledge distillation optimises a smaller student model to behave similarly to a larger teacher model, retaining some of the performance benefits. While this method can improve results on in-distribution examples, it does not necessarily …

When and Why Does Bias Mitigation Work? Open

Abhilasha Ravichander, Joe Stacey, Marek Rei · 2023

Neural models have been shown to exploit shallow surface features to perform language understanding tasks, rather than learning the deeper language understanding and reasoning skills that practitioners desire. Previous work has developed d…

Supervising Model Attention with Human Explanations for Robust Natural Language Inference Open

Joe Stacey, Yonatan Belinkov, Marek Rei · 2022

Natural Language Inference (NLI) models are known to learn from biases and artefacts within their training data, impacting how well they generalise to other unseen datasets. Existing de-biasing approaches focus on preventing the models fro…

Logical Reasoning with Span-Level Predictions for Interpretable and Robust NLI Models Open

Joe Stacey, Pasquale Minervini, Haim Dubossarsky, Marek Rei · 2022

Current Natural Language Inference (NLI) models achieve impressive results, sometimes outperforming humans when evaluating on in-distribution test sets. However, as these models are known to learn from annotation artefacts and dataset bias…

Logical Reasoning with Span-Level Predictions for Interpretable and Robust NLI Models Open

Joe Stacey, Pasquale Minervini, Haim Dubossarsky, Marek Rei · 2022

Current Natural Language Inference (NLI) models achieve impressive results, sometimes outperforming humans when evaluating on in-distribution test sets. However, as these models are known to learn from annotation artefacts and dataset bias…

Supervising Model Attention with Human Explanations for Robust Natural Language Inference Open

Joe Stacey, Yonatan Belinkov, Marek Rei · 2021

Natural Language Inference (NLI) models are known to learn from biases and artefacts within their training data, impacting how well they generalise to other unseen datasets. Existing de-biasing approaches focus on preventing the models fro…

Natural Language Inference with a Human Touch: Using Human Explanations to Guide Model Attention. Open

Joe Stacey, Yonatan Belinkov, Marek Rei · 2021

Natural Language Inference (NLI) models are known to learn from biases and artefacts within their training data, impacting how well the models generalise to other unseen datasets. While previous de-biasing approaches focus on preventing mo…

Avoiding the Hypothesis-Only Bias in Natural Language Inference via\n Ensemble Adversarial Training Open

Joe Stacey, Pasquale Minervini, Haim Dubossarsky, Sebastian Riedel, Tim Rocktäschel · 2020

Natural Language Inference (NLI) datasets contain annotation artefacts\nresulting in spurious correlations between the natural language utterances and\ntheir respective entailment classes. These artefacts are exploited by neural\nnetworks …

There is Strength in Numbers: Avoiding the Hypothesis-Only Bias in Natural Language Inference via Ensemble Adversarial Training. Open

Joe Stacey, Pasquale Minervini, Haim Dubossarsky, Sebastian Riedel, Tim Rocktäschel · 2020

Natural Language Inference (NLI) datasets contain annotation artefacts resulting in spurious correlations between the natural language utterances and their respective entailment classes. These artefacts are exploited by neural networks eve…

Avoiding the Hypothesis-Only Bias in Natural Language Inference via Ensemble Adversarial Training Open

Joe Stacey, Pasquale Minervini, Haim Dubossarsky, Sebastian Riedel, Tim Rocktäschel · 2020

Natural Language Inference (NLI) datasets contain annotation artefacts resulting in spurious correlations between the natural language utterances and their respective entailment classes. These artefacts are exploited by neural networks eve…

Joe Stacey YOU? Author Swipe