Explanipedia

Temporal Robustness in Hate Speech Detection: Updating German Classifiers with Advanced AI Infrastructures Open

Michael P. Hoffmann, Jan Fillies, Jophin John, Antonis Maronikolakis, Ajay Navilarekal , et al. · 2025

Over the past two decades, hate speech on social media has surged, causing significant harm and threatening democracies. Initially, research focused on English hate speech, but recent years have seen the development of non-English datasets…

A Federated Approach to Few-Shot Hate Speech Detection for Marginalized Communities Open

Haotian Ye, Axel Wisiorek, Antonis Maronikolakis, Özge Alaçam, Hinrich Schütze · 2025

A Federated Approach to Few-Shot Hate Speech Detection for Marginalized Communities Open

Haotian Ye, Axel Wisiorek, Antonis Maronikolakis, Özge Alaçam, Hinrich Schütze · 2024

Hate speech online remains an understudied issue for marginalized communities, particularly in the Global South, which includes developing societies with increasing internet penetration. In this paper, we aim to provide marginalized commun…

What should I wear to a party in a Greek taverna? Evaluation for Conversational Agents in the Fashion Domain Open

Antonis Maronikolakis, Ana Peleteiro Ramallo, Weiwei Cheng, Thomas Kober · 2024

Large language models (LLMs) are poised to revolutionize the domain of online fashion retail, enhancing customer experience and discovery of fashion online. LLM-powered conversational agents introduce a new way of discovery by directly int…

Politeness Stereotypes and Attack Vectors: Gender Stereotypes in Japanese and Korean Language Models Open

Victor Steinborn, Antonis Maronikolakis, Hinrich Schütze · 2023

In efforts to keep up with the rapid progress and use of large language models, gender bias research is becoming more prevalent in NLP. Non-English bias research, however, is still in its infancy with most work focusing on English. In our …

Sociocultural knowledge is needed for selection of shots in hate speech detection tasks Open

Antonis Maronikolakis, Abdullatif Köksal, Hinrich Schütze · 2023

We introduce HATELEXICON, a lexicon of slurs and targets of hate speech for the countries of Brazil, Germany, India and Kenya, to aid training and interpretability of models. We demonstrate how our lexicon can be used to interpret model pr…

Ethical scaling for content moderation: Extreme speech and the (in)significance of artificial intelligence Open

Sahana Udupa, Antonis Maronikolakis, Axel Wisiorek · 2023

In this article, we present new empirical evidence to demonstrate the severe limitations of existing machine learning content moderation methods to keep pace with, let alone stay ahead of, hateful language online. Building on the collabora…

This joke is [MASK]: Recognizing Humor and Offense with Prompting Open

Junze Li, Mengjie Zhao, Yubo Xie, Antonis Maronikolakis, Pearl Pu , et al. · 2022

Humor is a magnetic component in everyday human interactions and communications. Computationally modeling humor enables NLP systems to entertain and engage with users. We investigate the effectiveness of prompting, a new transfer learning …

Analyzing Hate Speech Data along Racial, Gender and Intersectional Axes Open

Antonis Maronikolakis, Philip Baader, Hinrich Schütze · 2022

To tackle the rising phenomenon of hate speech, efforts have been made towards data curation and analysis. When it comes to analysis of bias, previous work has focused predominantly on race. In our work, we further investigate bias in hate…

Proceedings of the 4th Workshop on Gender Bias in Natural Language Processing (GeBNLP) Open

Antonis Maronikolakis, Philip Baader, Hinrich Schütze, Margaret Mitchell, Simone Wu , et al. · 2022

Warning: This work contains strong and offensive language, sometimes uncensored.To tackle the rising phenomenon of hate speech, efforts have been made towards data curation and analysis.When it comes to analysis of bias, previous work has …

Analyzing Hate Speech Data along Racial, Gender and Intersectional Axes Open

Antonis Maronikolakis, Philip Baader, Hinrich Schütze · 2022

To tackle the rising phenomenon of hate speech, efforts have been made towards data curation and analysis. When it comes to analysis of bias, previous work has focused predominantly on race. In our work, we further investigate bias in hate…

Listening to Affected Communities to Define Extreme Speech: Dataset and Experiments Open

Antonis Maronikolakis, Axel Wisiorek, Leah Nann, Haris Jabbar, Sahana Udupa , et al. · 2022

Building on current work on multilingual hate speech (e.g., Ousidhoum et al. (2019)) and hate speech reduction (e.g., Sap et al. (2020)), we present XTREMESPEECH, a new hate speech dataset containing 20,297 social media passages from Brazi…

Listening to Affected Communities to Define Extreme Speech: Dataset and Experiments Open

Antonis Maronikolakis, Axel Wisiorek, Leah Nann, Haris Jabbar, Sahana Udupa , et al. · 2022

Building on current work on multilingual hate speech (e.g., Ousidhoum et al. (2019)) and hate speech reduction (e.g., Sap et al. (2020)), we present XTREMESPEECH, a new hate speech dataset containing 20,297 social media passages from Brazi…

Separating Hate Speech and Offensive Language Classes via Adversarial Debiasing Open

Shuzhou Yuan, Antonis Maronikolakis, Hinrich Schütze · 2022

Research to tackle hate speech plaguing online media has made strides in providing solutions, analyzing bias and curating data. A challenging problem is ambiguity between hate speech and offensive language, causing low performance both ove…

Wine is Not v i n. -- On the Compatibility of Tokenizations Across Languages Open

Antonis Maronikolakis, Philipp Dufter, Hinrich Schütze · 2021

The size of the vocabulary is a central design choice in large pretrained language models, with respect to both performance and memory requirements. Typically, subword tokenization algorithms such as byte pair encoding and WordPiece are us…

Identifying Automatically Generated Headlines using Transformers Open

Antonis Maronikolakis, Hinrich Schütze, Mark Stevenson · 2021

False information spread via the internet and social media influences public opinion and user activity, while generative models enable fake content to be generated faster and more cheaply than had previously been possible. In the not so di…

Artificial Intelligence, Extreme Speech and the Challenges of Online Content Moderation Open

Sahana Udupa, Elonnai Hickok, Antonis Maronikolakis, Hinrich Schuetze, Laura Csuka , et al. · 2021

Proceedings of the Fourth Workshop on NLP for Internet Freedom: Censorship, Disinformation, and Propaganda Open

Antonis Maronikolakis, Hinrich Schütze, Mark Stevenson, T. B. Brown, Benjamin F. Mann , et al. · 2021

Welcome to the fourth edition of the Workshop on NLP for Internet Freedom: Censorship, Disinformation, and Propaganda.This is the second time we are running the workshop virtually, due to the COVID-19 pandemic.The pandemic has had a profou…

Wine is not v i n. On the Compatibility of Tokenizations across Languages Open

Antonis Maronikolakis, Philipp Dufter, Hinrich Schütze · 2021

The size of the vocabulary is a central design choice in large pretrained language models, with respect to both performance and memory requirements. Typically, subword tokenization algorithms such as byte pair encoding and WordPiece are us…

BERT Cannot Align Characters Open

Antonis Maronikolakis, Philipp Dufter, Hinrich Schütze · 2021

In previous work, it has been shown that BERT can adequately align cross-lingual sentences on the word level. Here we investigate whether BERT can also operate as a char-level aligner. The languages examined are English, Fake-English, Germ…

Transformers Are Better Than Humans at Identifying Generated Text. Open

Antonis Maronikolakis, Mark Stevenson, Hinrich Schütze · 2020

Fake information spread via the internet and social media influences public opinion and user activity. Generative models enable fake content to be generated faster and more cheaply than had previously been possible. This paper examines the…

Analyzing Political Parody in Social Media Open

Antonis Maronikolakis, Danae Sánchez Villegas, Daniel Preoțiuc-Pietro, Νικόλαος Αλέτρας · 2020

Parody is a figurative device used to imitate an entity for comedic or critical purposes and represents a widespread phenomenon in social media through many popular parody accounts. In this paper, we present the first computational study o…

Antonis Maronikolakis YOU? Author Swipe