Explanipedia

Beyond Over-Refusal: Scenario-Based Diagnostics and Post-Hoc Mitigation for Exaggerated Refusals in LLMs Open

Shuzhou Yuan, Yijun Sun, Cong Zhao, Michael Färber · 2025

Large language models (LLMs) frequently produce false refusals, declining benign requests that contain terms resembling unsafe queries. We address this challenge by introducing two comprehensive benchmarks: the Exaggerated Safety Benchmark…

LLM in the Loop: Creating the ParaDeHate Dataset for Hate Speech Detoxification Open

Shuzhou Yuan, Ercong Nie, Lukas Kouba, Ashish Kangen, Helmut Schmid , et al. · 2025

Detoxification, the task of rewriting harmful language into non-toxic text, has become increasingly important amid the growing prevalence of toxic content online. However, high-quality parallel datasets for detoxification, especially for h…

Explainable LiDAR 3D Point Cloud Segmentation and Clustering for Detecting Airplane-Generated Wind Turbulence Open

Zhan Qu, Shuzhou Yuan, Michael Färber, Marius Brennfleck, Niklas Wartha , et al. · 2025

Explainable LiDAR 3D Point Cloud Segmentation and Clustering for Detecting Airplane-Generated Wind Turbulence Open

Zhan Qu, Shuzhou Yuan, Michael Färber, Marius Brennfleck, Niklas Wartha , et al. · 2025

Wake vortices - strong, coherent air turbulences created by aircraft - pose a significant risk to aviation safety and therefore require accurate and reliable detection methods. In this paper, we present an advanced, explainable machine lea…

Can Hallucinations Help? Boosting LLMs for Drug Discovery Open

Shuzhou Yuan, Michael Färber · 2025

Hallucinations in large language models (LLMs), plausible but factually inaccurate text, are often viewed as undesirable. However, recent work suggests that such outputs may hold creative potential. In this paper, we investigate whether ha…

Graph-Guided Textual Explanation Generation Framework Open

Shuzhou Yuan, Jingyi Sun, Ran Zhang, Michael Färber, Steffen Eger , et al. · 2025

Graph-Guided Textual Explanation Generation Framework Open

Shuzhou Yuan, Jingyi Sun, Ran Zhang, Michael Färber, Steffen Eger , et al. · 2024

Natural language explanations (NLEs) are commonly used to provide plausible free-text explanations of a model's reasoning about its predictions. However, recent work has questioned their faithfulness, as they may not accurately reflect the…

GraSAME: Injecting Token-Level Structural Information to Pretrained Language Models via Graph-guided Self-Attention Mechanism Open

Shuzhou Yuan, Michael Färber · 2024

Pretrained Language Models (PLMs) benefit from external knowledge stored in graph structures for various downstream tasks. However, bridging the modality gap between graph structures and text remains a significant challenge. Traditional me…

Decomposed Prompting: Probing Multilingual Linguistic Structure Knowledge in Large Language Models Open

Ercong Nie, Shuzhou Yuan, Bolei Ma, Helmut Schmid, Michael Färber , et al. · 2024

Probing the multilingual knowledge of linguistic structure in LLMs, often characterized as sequence labeling, faces challenges with maintaining output templates in current text-to-text prompting strategies. To solve this, we introduce a de…

Why Lift so Heavy? Slimming Large Language Models by Cutting Off the Layers Open

Shuzhou Yuan, Ercong Nie, Bolei Ma, Michael Färber · 2024

Large Language Models (LLMs) possess outstanding capabilities in addressing various natural language processing (NLP) tasks. However, the sheer size of these models poses challenges in terms of storage, training and inference due to the in…

GNNavi: Navigating the Information Flow in Large Language Models by Graph Neural Network Open

Shuzhou Yuan, Ercong Nie, Michael Färber, Helmut Schmid, Hinrich Schütze · 2024

Large Language Models (LLMs) exhibit strong In-Context Learning (ICL) capabilities when prompts with demonstrations are used. However, fine-tuning still remains crucial to further enhance their adaptability. Prompt-based fine-tuning proves…

ToPro: Token-Level Prompt Decomposition for Cross-Lingual Sequence Labeling Tasks Open

Bolei Ma, Ercong Nie, Shuzhou Yuan, Helmut Schmid, Michael Färber , et al. · 2024

Prompt-based methods have been successfully applied to multilingual pretrained language models for zero-shot cross-lingual understanding. However, most previous studies primarily focused on sentence-level classification tasks, and only a f…

ToPro: Token-Level Prompt Decomposition for Cross-Lingual Sequence Labeling Tasks Open

Bolei Ma, Ercong Nie, Shuzhou Yuan, Helmut Schmid, Michael Färber , et al. · 2024

Evaluating Generative Models for Graph-to-Text Generation Open

Shuzhou Yuan, Michael Färber · 2023

Large language models (LLMs) have been widely employed for graph-to-text generation tasks. However, the process of finetuning LLMs requires significant training resources and annotation work. In this paper, we explore the capability of gen…

Biases in scholarly recommender systems: impact, prevalence, and mitigation Open

Michael Färber, Melissa Coutinho, Shuzhou Yuan · 2023

With the remarkable increase in the number of scientific entities such as publications, researchers, and scientific topics, and the associated information overload in science, academic recommender systems have become increasingly important…

Biases in Scholarly Recommender Systems: Impact, Prevalence, and Mitigation Open

Michael Färber, Melissa Coutinho, Shuzhou Yuan · 2023

With the remarkable increase in the number of scientific entities such as publications, researchers, and scientific topics, and the associated information overload in science, academic recommender systems have become increasingly important…

Evaluating Generative Models for Graph-to-Text Generation Open

Shuzhou Yuan, Michael Färber · 2023

Large language models (LLMs) have been widely employed for graph-to-text generation tasks.However, the process of finetuning LLMs requires significant training resources and annotation work.In this paper, we explore the capability of gener…

Separating Hate Speech and Offensive Language Classes via Adversarial Debiasing Open

Shuzhou Yuan, Antonis Maronikolakis, Hinrich Schütze · 2022

Research to tackle hate speech plaguing online media has made strides in providing solutions, analyzing bias and curating data. A challenging problem is ambiguity between hate speech and offensive language, causing low performance both ove…

Shuzhou Yuan YOU? Author Swipe