Claus Weihs
YOU?
Author Swipe
View article: Editorial for ADAC issue 2 of volume 18 (2024)
Editorial for ADAC issue 2 of volume 18 (2024) Open
View article: How real is the quantitative turn? Investigating statistics as the new normal in linguistics
How real is the quantitative turn? Investigating statistics as the new normal in linguistics Open
Statistical approaches in linguistics seem to have gained in importance in recent times, especially in the field of Corpus Linguistics. In particular, the last ten years have seen an upsurge of linguists being dedicated to statistical meth…
View article: Editorial for ADAC issue 3 of volume 16 (2022)
Editorial for ADAC issue 3 of volume 16 (2022) Open
View article: Repeated undersampling in PrInDT (RePrInDT): Variation in undersampling and prediction, and ranking of predictors in ensembles
Repeated undersampling in PrInDT (RePrInDT): Variation in undersampling and prediction, and ranking of predictors in ensembles Open
In this paper, we extend our PrInDT method (Weihs & Buschfeld 2021a) towards undersampling with different percentages of the smaller and the larger classes (psmall and plarge), stratification of predictors, varying the prediction threshold…
View article: Repeated undersampling in PrInDT (RePrInDT): Variation in undersampling\n and prediction, and ranking of predictors in ensembles
Repeated undersampling in PrInDT (RePrInDT): Variation in undersampling\n and prediction, and ranking of predictors in ensembles Open
In this paper, we extend our PrInDT method (Weihs & Buschfeld 2021a) towards\nundersampling with different percentages of the smaller and the larger classes\n(psmall and plarge), stratification of predictors, varying the prediction\nthresh…
View article: Editorial for ADAC issue 2 of volume 15 (2021)
Editorial for ADAC issue 2 of volume 15 (2021) Open
View article: NesPrInDT: Nested undersampling in PrInDT
NesPrInDT: Nested undersampling in PrInDT Open
In this paper, we extend our PrInDT method (Weihs, Buschfeld 2021) towards additional undersampling of one of the predictors. This helps us to handle multiple unbalanced data sets, i.e. data sets that are not only unbalanced with respect t…
View article: Combining Prediction and Interpretation in Decision Trees (PrInDT) -- a Linguistic Example
Combining Prediction and Interpretation in Decision Trees (PrInDT) -- a Linguistic Example Open
In this paper, we show that conditional inference trees and ensembles are suitable methods for modeling linguistic variation. As against earlier linguistic applications, however, we claim that their suitability is strongly increased if we …
View article: Multi-Criteria Optimization in the Production of Lithium-Ion Batteries
Multi-Criteria Optimization in the Production of Lithium-Ion Batteries Open
Lithium-ion-batteries (LIBs) play a key role in determining the environmental impacts of future mobility technologies. In particular, the production of LIBs has a strong environmental impact as it is characterized by high scrap rates. In a…
View article: Class Prediction by Prediction Intervals for Neural Nets
Class Prediction by Prediction Intervals for Neural Nets Open
Generally, the unknown coefficients of neural nets are estimated by nonlinear least squares. Therefore, prediction intervals for the true value of the target feature exist. The paper proposes to use such intervals for class prediction and …
View article: Data Journalism – Impact of Statistical Methods
Data Journalism – Impact of Statistical Methods Open
Data journalism strongly depends on adequate data preparation and analysis. In this paper, we discuss the impact of statistical methods on data journalistic analysis. To this end, we re-analyze two data journalistic publications of SPIEGEL…
View article: Predicting industrial‐scale cell culture seed trains–A Bayesian framework for model fitting and parameter estimation, dealing with uncertainty in measurements and model parameters, applied to a nonlinear kinetic cell culture model, using an MCMC method
Predicting industrial‐scale cell culture seed trains–A Bayesian framework for model fitting and parameter estimation, dealing with uncertainty in measurements and model parameters, applied to a nonlinear kinetic cell culture model, using an MCMC method Open
For production of biopharmaceuticals in suspension cell culture, seed trains are required to increase cell number from cell thawing up to production scale. Because cultivation conditions during the seed train have a significant impact on c…
View article: Infill Criterion for Multimodal Model-Based Optimisation
Infill Criterion for Multimodal Model-Based Optimisation Open
Physical systems are modelled and investigated within simulation software in an increasing range of applications. In reality an investigation of the system is often performed by empirical test scenarios which are related to typical situati…
View article: Arbeitszeiten von Professorinnen und Professoren in Deutschland 2016
Arbeitszeiten von Professorinnen und Professoren in Deutschland 2016 Open
View article: Data Science: the impact of statistics
Data Science: the impact of statistics Open
In this paper, we substantiate our premise that statistics is one of the most important disciplines to provide tools and methods to find structure in and to give deeper insight into data, and the most important discipline to analyze and qu…
View article: Classification Method Performance in High Dimensions
Classification Method Performance in High Dimensions Open
We discuss standard classification methods for high-dimensional data and a small number of observations. By means of designed simulations illustrating the practical relevance of theoretical results we show that in the 2-class case the foll…
View article: DMAIC in Lithium-Ion-Battery Production
DMAIC in Lithium-Ion-Battery Production Open
DMAIC refers to a data-driven cycle of analysis steps used for improving, optimizing and stabilizing business processes with the steps Define, Measure, Analyze, Improve, and Control (DMAIC). In this paper, we will demonstrate the Define, M…
View article: Support Vector Machines for Survival Analysis with R
Support Vector Machines for Survival Analysis with R Open
This article introduces the R package survivalsvm, implementing support vector machines for survival analysis. Three approaches are available in the package: The regression approach takes censoring into account when formulating the inequal…
View article: Cutting Optimal Sections from Production Foils
Cutting Optimal Sections from Production Foils Open
Rechargeable Lithium-Ion battery cell production is one of the most important processes in the field of electro mobility. The batteries’ electrodes are produced in the form of long coated foils which are then cut into pieces of a predefine…
View article: Industrial Data Science: Developing a Qualification Concept for Machine Learning in Industrial Production
Industrial Data Science: Developing a Qualification Concept for Machine Learning in Industrial Production Open
The advent of Industry 4.0 and the availability of large data storage systems lead to an increasing demand for specially educated data-oriented professionals in industrial production. The education of these specialists should combine eleme…
View article: Online Linear Discriminant Analysis for Data Streams with Concept Drift
Online Linear Discriminant Analysis for Data Streams with Concept Drift Open
Various methods based on classical classification methods such as linear discriminant analysis (LDA) have been developed for working on data streams in situations with concept drift. Nevertheless, the updated classifiers of such methods ma…
View article: Combination of Several Control Charts Based on Dynamic Ensemble Methods
Combination of Several Control Charts Based on Dynamic Ensemble Methods Open
Combining methods from Statistical ProcessControl (SPC) in order to benefit from more than one method's efficiency has been recently challenged.One of the reasons is that real life problems change overtime and a small improvement can lead …
View article: Predicting measurements at unobserved locations in an electrical transmission system
Predicting measurements at unobserved locations in an electrical transmission system Open
View article: A computational study of auditory models in music recognition tasks for normal-hearing and hearing-impaired listeners
A computational study of auditory models in music recognition tasks for normal-hearing and hearing-impaired listeners Open
View article: Efficient Global Optimization: Motivation, Variations, and Applications
Efficient Global Optimization: Motivation, Variations, and Applications Open
A popular optimization method of a black box objective function is Efficient Global Optimization (EGO), also known as Sequential Model Based Optimization, SMBO, with kriging and expected improvement. EGO is a sequential design of experimen…
View article: Multi-objective selection of algorithm portfolios
Multi-objective selection of algorithm portfolios Open
We propose a method for selecting a portfolio of algorithms optimizing multiple criteria. We select a portfolio of limited size and at the same time good quality from a possibly large pool of algorithms. Our method also helps to decide whi…
View article: Model based optimization of a statistical simulation model for single diamond grinding
Model based optimization of a statistical simulation model for single diamond grinding Open
View article: Fast model selection by limiting SVM training times
Fast model selection by limiting SVM training times Open
Kernelized Support Vector Machines (SVMs) are among the best performing supervised learning methods. But for optimal predictive performance, time-consuming parameter tuning is crucial, which impedes application. To tackle this problem, the…
View article: Time efficient optimization of instance based problems with application to tone onset detection
Time efficient optimization of instance based problems with application to tone onset detection Open
A time efficient optimization technique for instance based problems is proposed, where for each parameter setting the target function has to be evaluated on a large set of problem instances. Computational time is reduced by beginning with …
View article: A multivariate approach for onset detection using supervised classification
A multivariate approach for onset detection using supervised classification Open
In this paper we introduce a new onset detection approach which incorporates a supervised classification model for estimating the tone onset probability in signal frames. In contrast to the most classical strategies where only one detectio…