Explanipedia

The virtual doctor prescribing the future: Diagnostics with interactive clinical decision support Open

Jan Benedikt Ruhland, Johannes Wichmann, Dmitry Degtyar, Roman Martin, Leon Fehse , et al. · 2025

The growing shortage of healthcare professionals, particularly in rural areas, poses a significant challenge to timely and effective medical care. To address this issue, we present the Virtual Doctor, a walk-in cabin designed to collect an…

JET: Fast Estimation of Hierarchical Time Series Clustering Open

Phillip Wenig, Mathias Höfgen, Thorsten Papenbrock · 2024

Clustering is an effective, unsupervised classification approach for time series analysis applications that suffer a natural lack of training data. One such application is the development of jet engines, which involves numerous test runs a…

Discovering Functional Dependencies through Hitting Set Enumeration Open

Tobias Bleifuß, Thorsten Papenbrock, Thomas Bläsius, Martin Schirneck, Felix Naumann · 2024

Functional dependencies (FDs) are among the most important integrity constraints in databases. They serve to normalize datasets and thus resolve redundancies, they contribute to query optimization, and they are frequently used to guide dat…

Actix-Telepathy Open

Phillip Wenig, Thorsten Papenbrock · 2023

The actor programming model supports the development of concurrent applications by encapsulating state and behavior into independent actors. Each actor is a computational entity with strictly private state and behavior. Actors communicate …

TimeEval Open

Phillip Wenig, Sebastian Schmidl, Thorsten Papenbrock · 2022

Detecting anomalous subsequences in time series is an important task in time series analytics because it serves the identification of special events, such as production faults, delivery bottlenecks, system defects, or heart flicker. Conseq…

NFDI Data Integration Open

Bernhard Seeger, Andreas Henrich, Thorsten Papenbrock, Dirk Riehle · 2022

<p>Within the scientific landscape, every discipline has big data management problems due to the heterogeneity and/or to the large volume of scientific data. NFDI provides new FAIR approaches to overcome these problems within domains…

NFDI Data Integration Open

Bernhard Seeger, Andreas Henrich, Thorsten Papenbrock, Dirk Riehle · 2022

<p>Within the scientific landscape, every discipline has big data management problems due to the heterogeneity and/or to the large volume of scientific data. NFDI provides new FAIR approaches to overcome these problems within domains…

Correction to: Data dependencies for query optimization: a survey Open

Jan Kossmann, Thorsten Papenbrock, Felix Naumann · 2021

Efficient distributed discovery of bidirectional order dependencies Open

Sebastian Schmidl, Thorsten Papenbrock · 2021

Data dependencies for query optimization: a survey Open

Jan Kossmann, Thorsten Papenbrock, Felix Naumann · 2021

Effective query optimization is a core feature of any database management system. While most query optimization techniques make use of simple metadata, such as cardinalities and other basic statistics, other optimization techniques are bas…

Distributed detection of sequential anomalies in univariate time series Open

Johannes Schneider, Phillip Wenig, Thorsten Papenbrock · 2021

The automated detection of sequential anomalies in time series is an essential task for many applications, such as the monitoring of technical systems, fraud detection in high-frequency trading, or the early detection of disease symptoms. …

Optimized Theta-Join Processing Open

Julian Weise, Sebastian Schmidl, Thorsten Papenbrock · 2021

The Theta-Join is a powerful operation to connect tuples of different relational tables based on arbitrary conditions. The operation is a fundamental requirement for many data-driven use cases, such as data cleaning, consistency checking, …

Hitting set enumeration with partial information for unique column combination discovery Open

Johann Birnick, Thomas Bläsius, Tobias Friedrich, Felix Naumann, Thorsten Papenbrock , et al. · 2020

Unique column combinations (UCCs) are a fundamental concept in relational databases. They identify entities in the data and support various data management activities. Still, UCCs are usually not explicitly defined and need to be discovere…

4.-8. März 2019 Open

Thorsten Papenbrock · 2019

Daten sind nicht nur in der Informatik, sondern auch in vielen anderen wissenschaftlichen Disziplinen ein unverzichtbares Wirtschaftsgut. Sie dienen dem Austausch, der Verknüpfung und der Speicherung von Wissen und sind daher unverzichtbar…

DynFD: Functional Dependency Discovery in Dynamic Datasets Open

Philipp Schirmer, Thorsten Papenbrock, Sebastian Kruse, Felix Naumann, Dennis Hempfing , et al. · 2019

4.-8. März 2019 Open

Sebastian Schmidl, Frederic Schneider, Thorsten Papenbrock · 2019

System architectures for data-centric applications are commonly comprised of two tiers: An application tier and a data tier. The fact that these tiers do not typically share a common format for data is referred to as object-relational impe…

Data profiling - efficient discovery of dependencies Open

Thorsten Papenbrock · 2017

Data profiling is the computer science discipline of analyzing a given dataset for its metadata. The types of metadata range from basic statistics, such as tuple counts, column aggregations, and value distributions, to much more complex st…

Data-driven Schema Normalization Open

Thorsten Papenbrock, Felix Naumann · 2017

Holistic Data Profiling: Simultaneous Discovery of Various Metadata Open

Jens Ehrlich, Mandy Roick, Lukas Schulze, Jakob Zwiener, Thorsten Papenbrock , et al. · 2016

Data proling is the discipline of examining an unknown dataset for its structure and statistical information. It is a preprocessing step in a wide range of applications, such as data integration, data cleansing, or query optimization. For …

Thorsten Papenbrock YOU? Author Swipe