Pauli Miettinen
YOU?
Author Swipe
View article: Brain Ventricle Morphology Markers in Predicting Shunt Surgery Outcome in Idiopathic Normal-Pressure Hydrocephalus
Brain Ventricle Morphology Markers in Predicting Shunt Surgery Outcome in Idiopathic Normal-Pressure Hydrocephalus Open
Background Idiopathic normal pressure hydrocephalus (iNPH) is characterized by a clinical triad of symptoms: abnormal gait, memory problems, and urinary incontinence. Neuroimaging plays a crucial role in diagnosing iNPH. However, current r…
View article: Finding Rule-Interpretable Non-Negative Data Representation
Finding Rule-Interpretable Non-Negative Data Representation Open
View article: Fast Redescription Mining Using Locality-Sensitive Hashing
Fast Redescription Mining Using Locality-Sensitive Hashing Open
Redescription mining is a data analysis technique that has found applications in diverse fields. The most used redescription mining approaches involve two phases: finding matching pairs among data attributes and extending the pairs. This p…
View article: Visualizing Overlapping Biclusterings and Boolean Matrix Factorizations
Visualizing Overlapping Biclusterings and Boolean Matrix Factorizations Open
Finding (bi-)clusters in bipartite graphs is a popular data analysis approach. Analysts typically want to visualize the clusters, which is simple as long as the clusters are disjoint. However, many modern algorithms find overlapping cluste…
View article: Differentially private tree-based redescription mining
Differentially private tree-based redescription mining Open
Differential privacy provides a strong form of privacy and allows preserving most of the original characteristics of the dataset. Utilizing these benefits requires one to design specific differentially private data analysis algorithms. In …
View article: Differentially Private Tree-Based Redescription Mining
Differentially Private Tree-Based Redescription Mining Open
Differential privacy provides a strong form of privacy and allows preserving most of the original characteristics of the dataset. Utilizing these benefits requires one to design specific differentially private data analysis algorithms. In …
View article: Finding Rule-Interpretable Non-Negative Data Representation
Finding Rule-Interpretable Non-Negative Data Representation Open
Non-negative Matrix Factorization (NMF) is an intensively used technique for obtaining parts-based, lower dimensional and non-negative representation. Researchers in biology, medicine, pharmacy and other fields often prefer NMF over other …
View article: [Title page iii]
[Title page iii] Open
View article: Biclustering and Boolean Matrix Factorization in Data Streams
Biclustering and Boolean Matrix Factorization in Data Streams Open
We study the clustering of bipartite graphs and Boolean matrix factorization in data streams. We consider a streaming setting in which the vertices from the left side of the graph arrive one by one together with all of their incident edges…
View article: Recent Developments in Boolean Matrix Factorization
Recent Developments in Boolean Matrix Factorization Open
The goal of Boolean Matrix Factorization (BMF) is to approximate a given binary matrix as the product of two low-rank binary factor matrices, where the product of the factor matrices is computed under the Boolean algebra. While the problem…
View article: Recent Developments in Boolean Matrix Factorization
Recent Developments in Boolean Matrix Factorization Open
The goal of Boolean Matrix Factorization (BMF) is to approximate a given binary matrix as the product of two low-rank binary factor matrices, where the product of the factor matrices is computed under the Boolean algebra. While the problem…
View article: Frequent subgraph mining for biologically meaningful structural motifs
Frequent subgraph mining for biologically meaningful structural motifs Open
Identification of biologically relevant motifs in proteins is a long-standing problem in bioinformatics, especially when considering distantly related proteins where sequence analysis alone becomes increasingly difficult. Here we present a…
View article: Lainvalmistelu tiedonhallinnan haasteena – tekoäly ratkaisuna?
Lainvalmistelu tiedonhallinnan haasteena – tekoäly ratkaisuna? Open
Tässä artikkelissa tarkastellaan käytännön lainvalmisteluprosessin mallintamisen pohjalta tekoälytyövälineiden hyödyntämisen mahdollisuuksia lainvalmistelun laadun parantamisessa. Tutkimuksessa on käytetty tapausesimerkkinä liikenne- ja vi…
View article: HyGen: generating random graphs with hyperbolic communities
HyGen: generating random graphs with hyperbolic communities Open
View article: Hybrid ASP-based Approach to Pattern Mining
Hybrid ASP-based Approach to Pattern Mining Open
Detecting small sets of relevant patterns from a given data set is a central challenge in data mining. The relevance of a pattern is based on user-provided criteria; typically, all patterns that satisfy certain criteria are considered rele…
View article: Algorithms for approximate subtropical matrix factorization
Algorithms for approximate subtropical matrix factorization Open
Matrix factorization methods are important tools in data mining and analysis. They can be used for many tasks, ranging from dimensionality reduction to visualization. In this paper we concentrate on the use of matrix factorizations for fin…
View article: Latitude: A Model for Mixed Linear-Tropical Matrix Factorization
Latitude: A Model for Mixed Linear-Tropical Matrix Factorization Open
Nonnegative matrix factorization (NMF) is one of the most frequently-used matrix factorization models in data analysis. A significant reason to the popularity of NMF is its interpretability and the `parts of whole' interpretation of its co…
View article: Mining Redescriptions with Siren
Mining Redescriptions with Siren Open
In many areas of science, scientists need to find distinct common characterizations of the same objects and, vice versa, to identify sets of objects that admit multiple shared descriptions. For example, in biology, an important task is to …
View article: From sets of good redescriptions to good sets of redescriptions
From sets of good redescriptions to good sets of redescriptions Open
View article: Logistic-Tropical Decompositions and Nested Subgraphs
Logistic-Tropical Decompositions and Nested Subgraphs Open
View article: Reductions for Frequency-Based Data Mining Problems
Reductions for Frequency-Based Data Mining Problems Open
Studying the computational complexity of problems is one of the - if not the - fundamental questions in computer science. Yet, surprisingly little is known about the computational complexity of many central problems in data mining. In this…
View article: Algorithms for Approximate Subtropical Matrix Factorization
Algorithms for Approximate Subtropical Matrix Factorization Open
Matrix factorization methods are important tools in data mining and analysis. They can be used for many tasks, ranging from dimensionality reduction to visualization. In this paper we concentrate on the use of matrix factorizations for fin…
View article: Redescription Mining: An Overview
Redescription Mining: An Overview Open
International audience
View article: Hyperbolae are No Hyperbole: Modelling Communities That are Not Cliques
Hyperbolae are No Hyperbole: Modelling Communities That are Not Cliques Open
Cliques are frequently used to model communities: a community is a set of nodes where each pair is equally likely to be connected. But studying real-world communities reveals that they have more structure than that. In particular, the node…
View article: Analysing Political Opinions Using Redescription Mining
Analysing Political Opinions Using Redescription Mining Open
International audience
View article: What You Will Gain By Rounding: Theory and Algorithms for Rounding Rank
What You Will Gain By Rounding: Theory and Algorithms for Rounding Rank Open
When factorizing binary matrices, we often have to make a choice between using expensive combinatorial methods that retain the discrete nature of the data and using continuous methods that can be more efficient but destroy the discrete str…
View article: Capricorn: An Algorithm for Subtropical Matrix Factorization
Capricorn: An Algorithm for Subtropical Matrix Factorization Open
Max-times algebra, sometimes known as subtropical algebra, is a semi-ring over the nonnegative real numbers where the addition operation is the max function and the multiplication is the standard one.Factorizing a nonnegative matrix over t…
View article: Interactive Constrained Boolean Matrix Factorization
Interactive Constrained Boolean Matrix Factorization Open
View article: Getting to Know the Unknown Unknowns: Destructive-Noise Resistant Boolean Matrix Factorization
Getting to Know the Unknown Unknowns: Destructive-Noise Resistant Boolean Matrix Factorization Open
Finding patterns in binary data is a classical problem in data mining, dating back to at least frequent itemset mining. More recently, approaches such as tiling and Boolean matrix factorization (BMF), have been proposed to find sets of pat…
View article: Clustering Boolean tensors
Clustering Boolean tensors Open