Thorsten Papenbrock
YOU?
Author Swipe
View article: The virtual doctor prescribing the future: Diagnostics with interactive clinical decision support
The virtual doctor prescribing the future: Diagnostics with interactive clinical decision support Open
The growing shortage of healthcare professionals, particularly in rural areas, poses a significant challenge to timely and effective medical care. To address this issue, we present the Virtual Doctor, a walk-in cabin designed to collect an…
View article: JET: Fast Estimation of Hierarchical Time Series Clustering
JET: Fast Estimation of Hierarchical Time Series Clustering Open
Clustering is an effective, unsupervised classification approach for time series analysis applications that suffer a natural lack of training data. One such application is the development of jet engines, which involves numerous test runs a…
View article: Discovering Functional Dependencies through Hitting Set Enumeration
Discovering Functional Dependencies through Hitting Set Enumeration Open
Functional dependencies (FDs) are among the most important integrity constraints in databases. They serve to normalize datasets and thus resolve redundancies, they contribute to query optimization, and they are frequently used to guide dat…
View article: Actix-Telepathy
Actix-Telepathy Open
The actor programming model supports the development of concurrent applications by encapsulating state and behavior into independent actors. Each actor is a computational entity with strictly private state and behavior. Actors communicate …
View article: TimeEval
TimeEval Open
Detecting anomalous subsequences in time series is an important task in time series analytics because it serves the identification of special events, such as production faults, delivery bottlenecks, system defects, or heart flicker. Conseq…
View article: NFDI Data Integration
NFDI Data Integration Open
<p>Within the scientific landscape, every discipline has big data management problems due to the heterogeneity and/or to the large volume of scientific data. NFDI provides new FAIR approaches to overcome these problems within domains…
View article: NFDI Data Integration
NFDI Data Integration Open
<p>Within the scientific landscape, every discipline has big data management problems due to the heterogeneity and/or to the large volume of scientific data. NFDI provides new FAIR approaches to overcome these problems within domains…
View article: Correction to: Data dependencies for query optimization: a survey
Correction to: Data dependencies for query optimization: a survey Open
View article: Efficient distributed discovery of bidirectional order dependencies
Efficient distributed discovery of bidirectional order dependencies Open
View article: Data dependencies for query optimization: a survey
Data dependencies for query optimization: a survey Open
Effective query optimization is a core feature of any database management system. While most query optimization techniques make use of simple metadata, such as cardinalities and other basic statistics, other optimization techniques are bas…
View article: Distributed detection of sequential anomalies in univariate time series
Distributed detection of sequential anomalies in univariate time series Open
The automated detection of sequential anomalies in time series is an essential task for many applications, such as the monitoring of technical systems, fraud detection in high-frequency trading, or the early detection of disease symptoms. …
View article: Optimized Theta-Join Processing
Optimized Theta-Join Processing Open
The Theta-Join is a powerful operation to connect tuples of different relational tables based on arbitrary conditions. The operation is a fundamental requirement for many data-driven use cases, such as data cleaning, consistency checking, …
View article: Hitting set enumeration with partial information for unique column combination discovery
Hitting set enumeration with partial information for unique column combination discovery Open
Unique column combinations (UCCs) are a fundamental concept in relational databases. They identify entities in the data and support various data management activities. Still, UCCs are usually not explicitly defined and need to be discovere…
View article: 4.-8. März 2019
4.-8. März 2019 Open
Daten sind nicht nur in der Informatik, sondern auch in vielen anderen wissenschaftlichen Disziplinen ein unverzichtbares Wirtschaftsgut. Sie dienen dem Austausch, der Verknüpfung und der Speicherung von Wissen und sind daher unverzichtbar…
View article: DynFD: Functional Dependency Discovery in Dynamic Datasets
DynFD: Functional Dependency Discovery in Dynamic Datasets Open
View article: 4.-8. März 2019
4.-8. März 2019 Open
System architectures for data-centric applications are commonly comprised of two tiers: An application tier and a data tier. The fact that these tiers do not typically share a common format for data is referred to as object-relational impe…
View article: Data profiling - efficient discovery of dependencies
Data profiling - efficient discovery of dependencies Open
Data profiling is the computer science discipline of analyzing a given dataset for its metadata. The types of metadata range from basic statistics, such as tuple counts, column aggregations, and value distributions, to much more complex st…
View article: Data-driven Schema Normalization
Data-driven Schema Normalization Open
View article: Holistic Data Profiling: Simultaneous Discovery of Various Metadata
Holistic Data Profiling: Simultaneous Discovery of Various Metadata Open
Data proling is the discipline of examining an unknown dataset for its structure and statistical information. It is a preprocessing step in a wide range of applications, such as data integration, data cleansing, or query optimization. For …