Explanipedia

Understanding and Analyzing Model Robustness and Knowledge-Transfer in Multilingual Neural Machine Translation using TX-Ray Open

Vageesh Saxena, Sharid Loáiciga, Nils Rethmeier · 2024

Computer science Chemistry

Neural networks have demonstrated significant advancements in Neural Machine Translation (NMT) compared to conventional phrase-based approaches. However, Multilingual Neural Machine Translation (MNMT) in extremely low-resource settings rem…

VendorLink: An NLP approach for Identifying & Linking Vendor Migrants & Potential Aliases on Darknet Markets Open

Vageesh Saxena, Nils Rethmeier, Gijs van Dijck, Gerasimos Spanakis · 2023

Computer science Business Political science

The anonymity on the Darknet allows vendors to stay undetected by using multiple vendor aliases or frequently migrating between markets. Consequently, illegal markets and their connections are challenging to uncover on the Darknet. To iden…

VendorLink: An NLP approach for Identifying & Linking Vendor Migrants & Potential Aliases on Darknet Markets Open

Vageesh Saxena, Nils Rethmeier, Gijs van Dijck, Gerasimos Spanakis · 2023

Computer science Business Political science

The anonymity on the Darknet allows vendors to stay undetected by using multiple vendor aliases or frequently migrating between markets. Consequently, illegal markets and their connections are challenging to uncover on the Darknet. To iden…

A Primer on Contrastive Pretraining in Language Processing: Methods, Lessons Learned, and Perspectives Open

Nils Rethmeier, Isabelle Augenstein · 2022

Computer science Philosophy Economics

Modern natural language processing (NLP) methods employ self-supervised pretraining objectives such as masked language modeling to boost the performance of various downstream tasks. These pretraining methods are frequently extended with re…

Long-Tail Zero and Few-Shot Learning via Contrastive Pretraining on and for Small Data Open

Nils Rethmeier, Isabelle Augenstein · 2022

Computer science Mathematics Physics

Preserving long-tail, minority information during model compression has been linked to algorithmic fairness considerations. However, this assumes that large models capture long-tail information and smaller ones do not, which raises two que…

Neighborhood Contrastive Learning for Scientific Document Representations with Citation Embeddings Open

Malte Ostendorff, Nils Rethmeier, Isabelle Augenstein, Béla Gipp, Georg Rehm · 2022

Computer science Chemistry Geography

Learning scientific document representations can be substantially improved through contrastive learning objectives, where the challenge lies in creating positive and negative training samples that encode the desired similarity semantics. P…

Neighborhood Contrastive Learning for Scientific Document Representations with Citation Embeddings Open

Malte Ostendorff, Nils Rethmeier, Isabelle Augenstein, Béla Gipp, Georg Rehm · 2022

Computer science Geography

Learning scientific document representations can be substantially improved through contrastive learning objectives, where the challenge lies in creating positive and negative training samples that encode the desired similarity semantics. P…

A Primer on Contrastive Pretraining in Language Processing: Methods,\n Lessons Learned and Perspectives Open

Nils Rethmeier, Isabelle Augenstein · 2021

Computer science Economics Philosophy

Modern natural language processing (NLP) methods employ self-supervised\npretraining objectives such as masked language modeling to boost the\nperformance of various application tasks. These pretraining methods are\nfrequently extended wit…

Data-Efficient Pretraining via Contrastive Self-Supervision Open

Nils Rethmeier, Isabelle Augenstein · 2020

Computer science Psychology

For natural language processing `text-to-text' tasks, the prevailing approaches heavily rely on pretraining large self-supervised models on increasingly larger `task-external' data. Transfer learning from high-resource pretraining works we…

EffiCare: Better Prognostic Models via Resource-Efficient Health Embeddings Open

Nils Rethmeier, Necip Oğuz Şerbetci, Sebastian Möller, Roland Roller · 2020

Computer science Philosophy Mathematics

Recent medical prognostic models adapted from high data-resource fields like language processing have quickly grown in complexity and size. However, since medical data typically constitute low data-resource settings, performances on tasks …

EffiCare: Better Prognostic Models via Resource-Efficient Health Embeddings. Open

Nils Rethmeier, Necip Oğuz Şerbetci, Sebastian Möller, Roland Roller · 2020

Computer science Mathematics Philosophy

Recent medical prognostic models adapted from high data-resource fields like language processing have quickly grown in complexity and size. However, since medical data typically constitute low data-resource settings, performances on tasks …

TX-Ray: Quantifying and Explaining Model-Knowledge Transfer in (Un-)Supervised NLP Open

Nils Rethmeier, Vageesh Saxena, Isabelle Augenstein · 2019

Computer science Philosophy Biology

While state-of-the-art NLP explainability (XAI) methods focus on explaining per-sample decisions in supervised end or probing tasks, this is insufficient to explain and quantify model knowledge transfer during (un-)supervised training. Thu…

TX-Ray: Quantifying and Explaining Model-Knowledge Transfer in\n (Un-)Supervised NLP Open

Nils Rethmeier, Vageesh Saxena, Isabelle Augenstein · 2019

Computer science Mathematics Philosophy

While state-of-the-art NLP explainability (XAI) methods focus on explaining\nper-sample decisions in supervised end or probing tasks, this is insufficient\nto explain and quantify model knowledge transfer during (un-)supervised\ntraining. …

MoRTy: Unsupervised Learning of Task-specialized Word Embeddings by Autoencoding Open

Nils Rethmeier, Barbara Plank · 2019

Computer science Engineering Philosophy

Word embeddings have undoubtedly revolutionized NLP. However, pre-trained embeddings do not always work for a specific task (or set of tasks), particularly in limited resource setups. We introduce a simple yet effective, self-supervised po…

Learning Comment Controversy Prediction in Web Discussions Using Incidentally Supervised Multi-Task CNNs Open

Nils Rethmeier, Marc P. Hübner, Leonhard Hennig · 2018

Computer science Economics

Comments on web news contain controversies that manifest as inter-group agreement-conflicts. Tracking such rapidly evolving controversy could ease conflict resolution or journalist-user interaction. However, this presupposes controversy on…

Detecting Named Entities and Relations in German Clinical Reports Open

Roland Roller, Nils Rethmeier, Philippe Thomas, Marc P. Hübner, Hans Uszkoreit , et al. · 2018

Computer science History Philosophy

Clinical notes and discharge summaries are commonly used in the clinical routine and contain patient related information such as well-being, findings and treatments. Information is often described in text form and presented in a semi-struc…

Common Round: Application of Language Technologies to Large-Scale Web Debates Open

Hans Uszkoreit, Aleksandra Gabryszak, Leonhard Hennig, Jörg Steffen, Renlong Ai , et al. · 2017

Computer science Philosophy Art

Hans Uszkoreit, Aleksandra Gabryszak, Leonhard Hennig, Jörg Steffen, Renlong Ai, Stephan Busemann, Jon Dehdari, Josef van Genabith, Georg Heigold, Nils Rethmeier, Raphael Rubino, Sven Schmeier, Philippe Thomas, He Wang, Feiyu Xu. Proceedin…

Nils Rethmeier YOU? Author Swipe