Explanipedia

Multidimensional Consistency Improves Reasoning in Language Models Open

Huiyuan Lai, Xiao Zhang, Malvina Nissim · 2025

While Large language models (LLMs) have proved able to address some complex reasoning tasks, we also know that they are highly sensitive to input variation, which can lead to different solution paths and final answers. Answer consistency a…

Multi-perspective Alignment for Increasing Naturalness in Neural Machine Translation Open

Huiyuan Lai, Esther Ploeger, Rik van Noord, Antonio Toral · 2025

Multi-perspective Alignment for Increasing Naturalness in Neural Machine Translation Open

Huiyuan Lai, Esther Ploeger, Rik van Noord, Antonio Toral · 2024

Neural machine translation (NMT) systems amplify lexical biases present in their training data, leading to artificially impoverished language in output translations. These language-level characteristics render automatic translations differ…

ContraMAE: Contrastive alignment masked autoencoder framework for cancer survival prediction Open

Suixue Wang, Huiyuan Lai, Shuling Wang, Qingchen Zhang · 2024

With the rapid advancement in multimodal fusion technology, the integration of pathological images with genomics data has achieved promising results in cancer survival prediction. However, most existing multimodal models are not pre-traine…

Towards Tailored Recovery of Lexical Diversity in Literary Machine Translation Open

Esther Ploeger, Huiyuan Lai, Rik van Noord, Antonio Toral · 2024

Machine translations are found to be lexically poorer than human translations. The loss of lexical diversity through MT poses an issue in the automatic translation of literature, where it matters not only what is written, but also how it i…

Fine-tuning with HED-IT: The impact of human post-editing for dialogical language models Open

Daniela Occhipinti, Michèle Marchi, Irene Mondella, Huiyuan Lai, Felice Dell’Orletta⋄ , et al. · 2024

Automatic methods for generating and gathering linguistic data have proven effective for fine-tuning Language Models (LMs) in languages less resourced than English. Still, while there has been emphasis on data quantity, less attention has …

mCoT: Multilingual Instruction Tuning for Reasoning Consistency in Language Models Open

Huiyuan Lai, Malvina Nissim · 2024

Large language models (LLMs) with Chain-of-thought (CoT) have recently emerged as a powerful technique for eliciting reasoning to improve various downstream tasks. As most research mainly focuses on English, with few explorations in a mult…

A Survey on Automatic Generation of Figurative Language: From Rule-based Systems to Large Language Models Open

Huiyuan Lai, Malvina Nissim · 2024

Figurative language generation (FLG) is the task of reformulating a given text to include a desired figure of speech, such as a hyperbole, a simile, and several others, while still being faithful to the original context. This is a fundamen…

Neural Text Rewriting: Style Transfer, Figurative Language, and Beyond Open

Huiyuan Lai · 2024

Neural networks have yielded great breakthroughs in NLP in recent years, but the vast majority of research has focused on literal language, while modelling text attributes, or style has received less attention. In this thesis, we focus on …

mCoT: Multilingual Instruction Tuning for Reasoning Consistency in Language Models Open

Huiyuan Lai, Malvina Nissim · 2024

Large language models (LLMs) with Chain-of-thought (CoT) have recently emerged as a powerful technique for eliciting reasoning to improve various downstream tasks. As most research mainly focuses on English, with few explorations in a mult…

Fine-tuning with HED-IT: The impact of human post-editing for dialogical language models Open

Daniela Occhipinti, Michèle Marchi, Irene Mondella, Huiyuan Lai, Felice Dell’Orletta⋄ , et al. · 2024

Automatic methods for generating and gathering linguistic data have proven effective for fine-tuning Language Models (LMs) in languages less resourced than English. Still, while there has been emphasis on data quantity, less attention has …

A text style transfer system for reducing the physician–patient expertise gap: An analysis with automatic and human evaluations Open

Luca Bacco, Felice Dell’Orletta⋄, Huiyuan Lai, Mario Merone, Malvina Nissim · 2023

Responsibility Perspective Transfer for Italian Femicide News Open

Gosse Minnema, Huiyuan Lai, Benedetta Muscato, Malvina Nissim · 2023

Different ways of linguistically expressing the same real-world event can lead to different perceptions of what happened. Previous work has shown that different descriptions of gender-based violence (GBV) influence the reader's perception …

Multilingual Multi-Figurative Language Detection Open

Huiyuan Lai, Antonio Toral, Malvina Nissim · 2023

Figures of speech help people express abstract concepts and evoke stronger emotions than literal expressions, thereby making texts more creative and engaging. Due to its pervasive and fundamental character, figurative language understandin…

Pre-Trained Language-Meaning Models for Multilingual Parsing and Generation Open

Chunliu Wang, Huiyuan Lai, Malvina Nissim, Johan Bos · 2023

Pre-trained language models (PLMs) have achieved great success in NLP and have recently been used for tasks in computational semantics. However, these tasks do not fully benefit from PLMs since meaning representations are not explicitly in…

Missing Information, Unresponsive Authors, Experimental Flaws: The Impossibility of Assessing the Reproducibility of Previous Human Evaluations in NLP Open

Anja Belz, Craig Thomson, Ehud Reiter, Gavin Abercrombie, José M. Alonso , et al. · 2023

We report our efforts in identifying a set of previous human evaluations in NLP that would be suitable for a coordinated study examining what makes human evaluations in NLP more/less reproducible. We present our results and findings, which…

Multidimensional Evaluation for Text Style Transfer Using ChatGPT Open

Huiyuan Lai, Antonio Toral, Malvina Nissim · 2023

We investigate the potential of ChatGPT as a multidimensional evaluator for the task of \emph{Text Style Transfer}, alongside, and in comparison to, existing automatic metrics as well as human judgements. We focus on a zero-shot setting, i…

Responsibility Perspective Transfer for Italian Femicide News Open

Gosse Minnema, Huiyuan Lai, Benedetta Muscato, Malvina Nissim · 2023

Different ways of linguistically expressing the same real-world event can lead to different perceptions of what happened. Previous work has shown that different descriptions of gender-based violence (GBV) influence the reader's perception …

Multilingual Multi-Figurative Language Detection Open

Huiyuan Lai, Antonio Toral, Malvina Nissim · 2023

Figures of speech help people express abstract concepts and evoke stronger emotions than literal expressions, thereby making texts more creative and engaging. Due to its pervasive and fundamental character, figurative language understandin…

Pre-Trained Language-Meaning Models for Multilingual Parsing and Generation Open

Chunliu Wang, Huiyuan Lai, Malvina Nissim, Johan Bos · 2023

Pre-trained language models (PLMs) have achieved great success in NLP and have recently been used for tasks in computational semantics. However, these tasks do not fully benefit from PLMs since meaning representations are not explicitly in…

Multi-Figurative Language Generation Open

Huiyuan Lai, Malvina Nissim · 2022

Figurative language generation is the task of reformulating a given text in the desired figure of speech while still being faithful to the original context. We take the first step towards multi-figurative language modelling by providing a …

Human Judgement as a Compass to Navigate Automatic Metrics for Formality Transfer Open

Huiyuan Lai, Jiali Mao, Antonio Toral, Malvina Nissim · 2022

Although text style transfer has witnessed rapid development in recent years, there is as yet no established standard for evaluation, which is performed using several automatic metrics, lacking the possibility of always resorting to human …

Multilingual Pre-training with Language and Task Adaptation for Multilingual Text Style Transfer Open

Huiyuan Lai, Antonio Toral, Malvina Nissim · 2022

We exploit the pre-trained seq2seq model mBART for multilingual text style transfer. Using machine translated data as well as gold aligned English sentences yields state-of-the-art results in the three target languages we consider. Besides…

Multilingual Pre-training with Language and Task Adaptation for Multilingual Text Style Transfer Open

Huiyuan Lai, Antonio Toral, Malvina Nissim · 2022

We exploit the pre-trained seq2seq model mBART for multilingual text style transfer. Using machine translated data as well as gold aligned English sentences yields state-of-the-art results in the three target languages we consider. Besides…

Human Judgement as a Compass to Navigate Automatic Metrics for Formality Transfer Open

Huiyuan Lai, Jiali Mao, Antonio Toral, Malvina Nissim · 2022

Although text style transfer has witnessed rapid development in recent years, there is as yet no established standard for evaluation, which is performed using several automatic metrics, lacking the possibility of always resorting to human …

Generic resources are what you need: Style transfer tasks without task-specific parallel training data Open

Huiyuan Lai, Antonio Toral, Malvina Nissim · 2021

Style transfer aims to rewrite a source text in a different target style while preserving its content. We propose a novel approach to this task that leverages generic resources, and without using any task-specific parallel (source-target) …

Thank you BART! Rewarding Pre-Trained Models Improves Formality Style Transfer Open

Huiyuan Lai, Antonio Toral, Malvina Nissim · 2021

Scarcity of parallel data causes formality style transfer models to have scarce success in preserving content. We show that fine-tuning pre-trained language (GPT-2) and sequence-to-sequence (BART) models boosts content preservation, and th…

On the interaction of automatic evaluation and task framing in headline style transfer Open

Lorenzo De Mattei, Michele Cafagna, Huiyuan Lai, Felice Dell’Orletta⋄, Malvina Nissim , et al. · 2021

An ongoing debate in the NLG community concerns the best way to evaluate systems, with human evaluation often being considered the most reliable method, compared to corpus-based metrics. However, tasks involving subtle textual differences,…

Thank you BART! Rewarding Pre-Trained Models Improves Formality Style Transfer Open

Huiyuan Lai, Antonio Toral, Malvina Nissim · 2021

Scarcity of parallel data causes formality style transfer models to have scarce success in preserving content. We show that fine-tuning pre-trained language (GPT-2) and sequence-to-sequence (BART) models boosts content preservation, and th…

Generic resources are what you need: Style transfer tasks without task-specific parallel training data Open

Huiyuan Lai, Antonio Toral, Malvina Nissim · 2021

Style transfer aims to rewrite a source text in a different target style while preserving its content. We propose a novel approach to this task that leverages generic resources, and without using any task-specific parallel (source–target) …

Huiyuan Lai YOU? Author Swipe