Explanipedia

Improving Wikipedia verifiability with AI Open

Fabio Petroni, Samuel Broscheit, Aleksandra Piktus, Patrick A. Lewis, Gautier Izacard , et al. · 2023

Computer science Philosophy

Verifiability is a core content policy of Wikipedia: claims need to be backed by citations. Maintaining and improving the quality of Wikipedia references is an important challenge and there is a pressing need for better tools to assist hum…

Improving Wikipedia Verifiability with AI Open

Fabio Petroni, Samuel Broscheit, Aleksandra Piktus, Patrick Lewis, Gautier Izacard , et al. · 2022

Computer science Philosophy Economics

Verifiability is a core content policy of Wikipedia: claims that are likely to be challenged need to be backed by citations. There are millions of articles available online and thousands of new articles are released each month. For this re…

Improving Wikipedia Verifiability with AI Open

Fabio Petroni, Samuel Broscheit, Aleksandra Piktus, Patrick Lewis, Gautier Izacard , et al. · 2022

Computer science Economics Philosophy

Verifiability is a core content policy of Wikipedia: claims that are likely to be challenged need to be backed by citations. There are millions of articles available online and thousands of new articles are released each month. For this re…

Distributionally Robust Finetuning BERT for Covariate Drift in Spoken Language Understanding Open

Samuel Broscheit, Quynh Do, Judith Gaspers · 2022

Computer science Chemistry

In this study, we investigate robustness against covariate drift in spoken language understanding (SLU). Covariate drift can occur in SLUwhen there is a drift between training and testing regarding what users request or how they request it…

The Web Is Your Oyster - Knowledge-Intensive NLP against a Very Large Web Corpus Open

Aleksandra Piktus, Fabio Petroni, Vladimir Karpukhin, Dmytro Okhonko, Samuel Broscheit , et al. · 2021

Computer science Geology

In order to address increasing demands of real-world applications, the research for knowledge-intensive NLP (KI-NLP) should advance by capturing the challenges of a truly open-domain environment: web-scale knowledge, lack of structure, inc…

Unsupervised Multi-View Post-OCR Error Correction With Language Models Open

Harsh K. Gupta, Luciano Del Corro, Samuel Broscheit, Johannes Hoffart, Eliot Brenner · 2021

Computer science Physics Mathematics

We investigate post-OCR correction in a setting where we have access to different OCR views of the same document. The goal of this study is to understand if a pretrained language model (LM) can be used in an unsupervised way to reconcile t…

You CAN Teach an Old Dog New Tricks! On Training Knowledge Graph Embeddings Open

Daniel Ruffinelli, Samuel Broscheit, Rainer Gemulla · 2020

Computer science Materials science

Knowledge graph embedding (KGE) models learn algebraic representations of the entities and relations in a knowledge graph. A vast number of KGE techniques for multi-relational link prediction have been proposed in the recent literature, of…

Can We Predict New Facts with Open Knowledge Graph Embeddings? A Benchmark for Open Link Prediction Open

Samuel Broscheit, Kiril Gashteovski, Yanjie Wang, Rainer Gemulla · 2020

Computer science Geography Economics

Open Information Extraction systems extract (“subject text”, “relation text”, “object text”) triples from raw text. Some triples are textual versions of facts, i.e., non-canonicalized mentions of entities and relations. In this paper, we i…

LibKGE - A knowledge graph embedding library for reproducible research Open

Samuel Broscheit, Daniel Ruffinelli, Adrian Kochsiek, Patrick Betz, Rainer Gemulla · 2020

Computer science

LibKGE (https://github.com/uma-pi1/kge) is an open-source PyTorch-based library for training, hyperparameter optimization, and evaluation of knowledge graph embedding models for link prediction. The key goals of LibKGE are to enable reprod…

PRoFET: Predicting the Risk of Firms from Event Transcripts Open

Christoph Kilian Theil, Samuel Broscheit, Heiner Stuckenschmidt · 2019

Computer science Economics

Financial risk, defined as the chance to deviate from return expectations, is most commonly measured with volatility. Due to its value for investment decision making, volatility prediction is probably among the most important tasks in fina…

OPIEC: An Open Information Extraction Corpus Open

Kiril Gashteovski, Sebastian Wanner, Sven Hertling, Samuel Broscheit, Rainer Gemulla · 2019

Computer science

Open information extraction (OIE) systems extract relations and their arguments from natural language text in an unsupervised manner. The resulting extractions are a valuable resource for downstream tasks such as knowledge base constructio…

A Relational Tucker Decomposition for Multi-Relational Link Prediction Open

Yanjie Wang, Samuel Broscheit, Rainer Gemulla · 2019

Computer science Mathematics Biology

We propose the Relational Tucker3 (RT) decomposition for multi-relational link prediction in knowledge graphs. We show that many existing knowledge graph embedding models are special cases of the RT decomposition with certain predefined sp…

Investigating Entity Knowledge in BERT with Simple Neural End-To-End Entity Linking Open

Samuel Broscheit · 2019

Computer science Economics Philosophy

A typical architecture for end-to-end entity linking systems consists of three steps: mention detection, candidate generation and entity disambiguation. In this study we investigate the following questions: (a) Can all those steps be learn…

On Evaluating Embedding Models for Knowledge Base Completion Open

Yanjie Wang, Daniel Ruffinelli, Rainer Gemulla, Samuel Broscheit, Christian Meilicke · 2019

Computer science Mathematics Philosophy

Knowledge graph embedding models have recently received significant attention in the literature. These models learn latent semantic representations for the entities and relations in a given knowledge base; the representations can be used t…

Do Embedding Models Perform Well for Knowledge Base Completion Open

Yanjie Wang, Daniel Ruffinelli, Rainer Gemulla, Samuel Broscheit, Christian Meilicke · 2018

Computer science Mathematics Engineering

In this work, we put into question the effectiveness of the evaluation methods currently used to measure the performance of latent factor models for the task of knowledge base completion. We argue that by focusing on a small subset of poss…

On Evaluating Embedding Models for Knowledge Base Completion Open

Yanjie Wang, Daniel Ruffinelli, Rainer Gemulla, Samuel Broscheit, Christian Meilicke · 2018

Computer science Mathematics Engineering

Knowledge bases contribute to many web search and mining tasks, yet they are often incomplete. To add missing facts to a given knowledge base, various embedding models have been proposed in the recent literature. Perhaps surprisingly, rela…

Learning Distributional Token Representations from Visual Features Open

Samuel Broscheit · 2018

Computer science Economics Political science

In this study, we compare token representations constructed from visual features (i.e., pixels) with standard lookup-based embeddings. Our goal is to gain insight about the challenges of encoding a text representation from low-level featur…

A Neural Autoencoder Approach for Document Ranking and Query Refinement in Pharmacogenomic Information Retrieval Open

Jonas Pfeiffer, Samuel Broscheit, Rainer Gemulla, Mathias Göschl · 2018

Computer science Philosophy Mathematics

In this study, we investigate learning-to-rank and query refinement approaches for information retrieval in the pharmacogenomic domain. The goal is to improve the information retrieval process of biomedical curators, who manually build kno…

Summa At Tac Knowledge Base Population Task 2016 Open

Pēteris Paikens, Guntis Bārzdiņš, Afonso Mendes, Daniel Ferreira, Samuel Broscheit , et al. · 2016

Computer science Mathematics Medicine

Our submission to the NIST TAC-KBP-20161 is an initial attempt to apply our ongoing research on text analysis within SUMMA project to TAC shared tasks. The goal of SUMMA is to develop a scalable and extensible media monitoring platform wit…

Samuel Broscheit YOU? Author Swipe