Explanipedia

Reference-free Evaluation Metrics for Text Generation: A Survey Open

Takumi Ito, Kees van Deemter, Jun Suzuki · 2025

Computer science

A number of automatic evaluation metrics have been proposed for natural language generation systems. The most common approach to automatic evaluation is the use of a reference-based metric that compares the model's output with gold-standar…

Human-annotated rationales and explainable text classification: a survey Open

Elize Herrewijnen, Dong Nguyen, Floris Bex, Kees van Deemter · 2024

Computer science Biology

Asking annotators to explain “why” they labeled an instance yields annotator rationales: natural language explanations that provide reasons for classifications. In this work, we survey the collection and use of annotator rationales. Human-…

Computational Modelling of Plurality and Definiteness in Chinese Noun Phrases Open

Yuqi Liu, Guanyi Chen, Kees van Deemter · 2024

Computer science Mathematics Philosophy

Theoretical linguists have suggested that some languages (e.g., Chinese and Japanese) are "cooler" than other languages based on the observation that the intended meaning of phrases in these languages depends more on their contexts. As a r…

Intrinsic Task-based Evaluation for Referring Expression Generation Open

Guanyi Chen, Fahime Same, Kees van Deemter · 2024

Computer science Engineering

Recently, a human evaluation study of Referring Expression Generation (REG) models had an unexpected conclusion: on \textsc{webnlg}, Referring Expressions (REs) generated by the state-of-the-art neural models were not only indistinguishabl…

Textual Summarisation of Large Sets: Towards a General Approach Open

Kittipitch Kuptavanich, Ehud Reiter, Kees van Deemter, Advaith Siddharthan · 2024

Computer science

We are developing techniques to generate summary descriptions of sets of objects. In this paper, we present and evaluate a rule-based NLG technique for summarising sets of bibliographical references in academic papers. This extends our pre…

The Pitfalls of Defining Hallucination Open

Kees van Deemter · 2024

Psychology Computer science

Despite impressive advances in Natural Language Generation (NLG) and Large Language Models (LLMs), researchers are still unclear about important aspects of NLG evaluation. To substantiate this claim, I examine current classifications of ha…

The Pitfalls of Defining Hallucination Open

Kees van Deemter · 2024

Computer science

Despite impressive advances in Natural Language Generation (NLG) and Large Language Models (LLMs), researchers are still unclear about important aspects of NLG evaluation. To substantiate this claim, I examine current classifications of ha…

Interpreting vision and language generative models with semantic visual priors Open

Michele Cafagna, Lina M. Rojas-Barahona, Kees van Deemter, Albert Gatt · 2023

Computer science Psychology Political science

When applied to Image-to-text models, explainability methods have two challenges. First, they often provide token-by-token explanations namely, they compute a visual explanation for each token of the generated sequence. This makes explanat…

Varieties of specification: Redefining over- and under-specification Open

Guanyi Chen, Kees van Deemter · 2023

Computer science Philosophy

A long tradition of research in theoretical, experimental and computational pragmatics has investigated over-specification and under-specification in referring expressions. Along broadly Gricean lines, these studies compare the amount of i…

Models of reference production: How do they withstand the test of time? Open

Fahime Same, Guanyi Chen, Kees van Deemter · 2023

Computer science Engineering Economics

In recent years, many NLP studies have focused solely on performance improvement. In this work, we focus on the linguistic and scientific aspects of NLP. We use the task of generating referring expressions in context (REG-in-context) as a …

Is Shortest Always Best? The Role of Brevity in Logic-to-Text Generation Open

Eduardo Calò, Jordi Levy, Albert Gatt, Kees van Deemter · 2023

Computer science

Some applications of artificial intelligence make it desirable that logical formulae be converted computationally to comprehensible natural language sentences. As there are many logical equivalents to a given formula, finding the most suit…

Computational Modelling of Quantifier Use: Corpus, Models, and Evaluation Open

Guanyi Chen, Kees van Deemter · 2023

Computer science Psychology Economics

A prominent strand of work in formal semantics investigates the ways in which human languages quantify the elements of a set, as when we say All A are B, Few A are B, and so on. Building on a growing body of empirical studies that shed lig…

Does ChatGPT have Theory of Mind? Open

Bart Holterman, Kees van Deemter · 2023

Psychology Computer science Philosophy

Theory of Mind (ToM) is the ability to understand human thinking and decision-making, an ability that plays a crucial role in social interaction between people, including linguistic communication. This paper investigates to what extent rec…

Missing Information, Unresponsive Authors, Experimental Flaws: The Impossibility of Assessing the Reproducibility of Previous Human Evaluations in NLP Open

Anja Belz, Craig Thomson, Ehud Reiter, Gavin Abercrombie, José M. Alonso , et al. · 2023

Computer science Medicine Biology

We report our efforts in identifying a set of previous human evaluations in NLP that would be suitable for a coordinated study examining what makes human evaluations in NLP more/less reproducible. We present our results and findings, which…

Interpreting Vision and Language Generative Models with Semantic Visual Priors Open

Michele Cafagna, Lina M. Rojas-Barahona, Kees van Deemter, Albert Gatt · 2023

Computer science Political science Biology

When applied to Image-to-text models, interpretability methods often provide token-by-token explanations namely, they compute a visual explanation for each token of the generated sequence. Those explanations are expensive to compute and un…

HL Dataset: Visually-grounded Description of Scenes, Actions and Rationales Open

Michele Cafagna, Kees van Deemter, Albert Gatt · 2023

Computer science Economics Philosophy

Current captioning datasets focus on object-centric captions, describing the visible objects in the image, e.g. "people eating food in a park". Although these datasets are useful to evaluate the ability of Vision & Language models to recog…

HL Dataset: Visually-grounded Description of Scenes, Actions and Rationales Open

Michele Cafagna, Kees van Deemter, Albert Gatt · 2023

Computer science Economics Philosophy

Current captioning datasets focus on object-centric captions, describing the visible objects in the image, often ending up stating the obvious (for humans), e.g. "people eating food in a park". Although these datasets are useful to evaluat…

Is Shortest Always Best? The Role of Brevity in Logic-to-Text Generation Open

Eduardo Calò, Jordi Levy, Albert Gatt, Kees van Deemter · 2023

Computer science Mathematics Physics

Some applications of artificial intelligence make it desirable that logical formulae be converted computationally to comprehensible natural language sentences. As there are many logical equivalents to a given formula, finding the most suit…

Models of reference production: How do they withstand the test of time? Open

Fahime Same, Guanyi Chen, Kees van Deemter · 2023

Computer science Engineering Biology

In recent years, many NLP studies have focused solely on performance improvement. In this work, we focus on the linguistic and scientific aspects of NLP. We use the task of generating referring expressions in context (REG-in-context) as a …

Dimensions of Explanatory Value in NLP Models Open

Kees van Deemter · 2023

Computer science Philosophy

Performance on a dataset is often regarded as the key criterion for assessing NLP models. I argue for a broader perspective, which emphasizes scientific explanation. I draw on a long tradition in the philosophy of science, and on the Bayes…

Understanding Cross-modal Interactions in V&L Models that Generate Scene Descriptions Open

Michele Cafagna, Kees van Deemter, Albert Gatt · 2022

Computer science Materials science

Image captioning models tend to describe images in an object-centric way, emphasising visible objects. But image descriptions can also abstract away from objects and describe the type of scene depicted. In this paper, we explore the potent…

Enhancing and Evaluating the Grammatical Framework Approach to Logic-to-Text Generation Open

Eduardo Calò, Elze van der Werf, Albert Gatt, Kees van Deemter · 2022

Computer science

Logic-to-text generation is an important yet underrepresented area of natural language generation (NLG). In particular, most previous works on this topic lack sound evaluation. We address this limitation by building and evaluating a system…

Neural referential form selection: Generalisability and interpretability Open

Guanyi Chen, Fahime Same, Kees van Deemter · 2022

Computer science Economics Psychology

In recent years, a range of Neural Referring Expression Generation (REG) systems have been built and they have often achieved encouraging results. However, these models are often thought to lack transparency and generality. Firstly, it is …

Understanding Cross-modal Interactions in V&L Models that Generate Scene Descriptions Open

Michele Cafagna, Kees van Deemter, Albert Gatt · 2022

Computer science Psychology Engineering

Image captioning models tend to describe images in an object-centric way, emphasising visible objects. But image descriptions can also abstract away from objects and describe the type of scene depicted. In this paper, we explore the potent…

Assessing Neural Referential Form Selectors on a Realistic Multilingual Dataset Open

Guanyi Chen, Fahime Same, Kees van Deemter · 2022

Computer science Mathematics Geography

Previous work on Neural Referring Expression Generation (REG) all uses WebNLG, an English dataset that has been shown to reflect a very limited range of referring expression (RE) use. To tackle this issue, we build a dataset based on the O…

Understanding the Use of Quantifiers in Mandarin Open

Guanyi Chen, Kees van Deemter · 2022

Computer science Psychology Philosophy

We introduce a corpus of short texts in Mandarin, in which quantified expressions figure prominently. We illustrate the significance of the corpus by examining the hypothesis (known as Huang's "coolness" hypothesis) that speakers of East A…

The Role of Explanatory Value in Natural Language Processing Open

Kees van Deemter · 2022

Computer science History Philosophy

A key aim of science is explanation, yet the idea of explaining language phenomena has taken a backseat in mainstream Natural Language Processing (NLP) and many other areas of Artificial Intelligence. I argue that explanation of linguistic…

Semeval-2022 Task 1: CODWOE -- Comparing Dictionaries and Word Embeddings Open

Timothee Mickus, Kees van Deemter, Laurence Mathieu, Denis Paperno · 2022

Computer science Philosophy Physics

Word embeddings have advanced the state of the art in NLP across numerous tasks. Understanding the contents of dense neural representations is of utmost interest to the computational semantics community. We propose to focus on relating the…

Evaluating Automatic Difficulty Estimation of Logic Formalization Exercises Open

Alexandra Mayn, Kees van Deemter · 2022

Computer science Political science Economics

Teaching logic effectively requires an understanding of the factors which cause logic students to struggle. Formalization exercises, which require the student to produce a formula corresponding to the natural language sentence, are a good …

Kees van Deemter YOU? Author Swipe