Explanipedia

Protocol for development of a checklist and guideline for transparent reporting of cluster analyses (TRoCA) Open

Daniil Lisik, Syed Ahmar Shah, Rani Basna, Duy-Tai Dinh, Ryan P. Browne , et al. · 2025

Introduction Cluster analysis, a machine learning-based and data-driven technique for identifying groups in data, has demonstrated its potential in a wide range of contexts. However, critical appraisal and reproducibility are often limited…

Model-based bi-clustering using multivariate Poisson-lognormal with general block-diagonal covariance matrix and its applications Open

Caitlin Kral, Evan Chance, Ryan P. Browne, Sanjeena Subedi · 2025

While several Gaussian mixture models-based biclustering approaches currently exist in the literature for continuous data, approaches to handle discrete data have not been well researched. A multivariate Poisson-lognormal (MPLN) model-base…

Statistical inference for sketching algorithms Open

Ryan P. Browne, Jeffrey L. Andrews · 2024

Computer science Mathematics

Sketching algorithms use random projections to generate a smaller sketched data set, often for the purposes of modelling. Complete and partial sketch regression estimates can be constructed using information from only the sketched data set…

Statistical inference for sketching algorithms Open

Ryan P. Browne, Jeffrey L. Andrews · 2023

Computer science Mathematics

Sketching algorithms use random projections to generate a smaller sketched data set, often for the purposes of modelling. Complete and partial sketch regression estimates can be constructed using information from only the sketched data set…

Estimation of Gaussian Bi-Clusters with General Block-Diagonal Covariance Matrix and Applications Open

Anastasiia Livochka, Ryan P. Browne, Sanjeena Subedi · 2023

Computer science Mathematics Physics

Bi-clustering is a technique that allows for the simultaneous clustering of observations and features in a dataset. This technique is often used in bioinformatics, text mining, and time series analysis. An important advantage of biclusteri…

Assessing the variability of posterior probabilities in Gaussian model-based clustering Open

Yuchi Zhang, Ryan P. Browne, Jeffrey L. Andrews · 2021

Computer science Mathematics Physics

We propose a variant of the bootstrap to assess the variability of posterior probabilities arising from Gaussian model-based clustering. The bootstrap variant uses predictions based on out-of-bootstrap-sample observations and then construc…

One Line To Rule Them All: Generating LO-Shot Soft-Label Prototypes Open

Ilia Sucholutsky, Nam‐Hwui Kim, Ryan P. Browne, Matthias Schonlau · 2021

Computer science Mathematics

Increasingly large datasets are rapidly driving up the computational costs of machine learning. Prototype generation methods aim to create a small set of synthetic observations that accurately represent a training dataset but greatly reduc…

Model-Based Clustering, Classification, and Discriminant Analysis Using the Generalized Hyperbolic Distribution: <b>MixGHD</b> <i>R</i> package Open

Cristina Tortora, Ryan P. Browne, Aisha ElSherbiny, Brian C. Franczak, Paul D. McNicholas · 2021

Computer science Mathematics

The MixGHD package for R performs model-based clustering, classification, and discriminant analysis using the generalized hyperbolic distribution (GHD). This approach is suitable for data that can be considered a realization of a (multivar…

A parsimonious family of multivariate Poisson-lognormal distributions for clustering multivariate count data Open

Sanjeena Subedi, Ryan P. Browne · 2020

Mathematics Computer science

Multivariate count data are commonly encountered through high-throughput sequencing technologies in bioinformatics, text mining, or in sports analytics. Although the Poisson distribution seems a natural fit to these count data, its multiva…

A parsimonious family of multivariate Poisson-lognormal distributions\n for clustering multivariate count data Open

Sanjeena Subedi, Ryan P. Browne · 2020

Mathematics Computer science

Multivariate count data are commonly encountered through high-throughput\nsequencing technologies in bioinformatics, text mining, or in sports analytics.\nAlthough the Poisson distribution seems a natural fit to these count data, its\nmult…

Model-based clustering and classification using mixtures of multivariate skewed power exponential distributions Open

Utkarsh J. Dang, Michael P. B. Gallaugher, Ryan P. Browne, Paul D. McNicholas · 2019

Mathematics Chemistry

Families of mixtures of multivariate power exponential (MPE) distributions have been previously introduced and shown to be competitive for cluster analysis in comparison to other elliptical mixtures including mixtures of Gaussian distribut…

Flexible clustering of high‐dimensional data via mixtures of joint generalized hyperbolic distributions Open

Yang Tang, Ryan P. Browne, Paul D. McNicholas · 2018

Computer science Mathematics Engineering

A mixture of joint generalized hyperbolic distributions (MJGHD) is introduced for asymmetric clustering for high‐dimensional data. The MJGHD approach takes into account the cluster‐specific subspaces, thereby limiting the number of paramet…

Asymmetric Clustering for High-Dimensional Data via Mixtures of Joint Generalized Hyperbolic Models Open

Yang Tang, Ryan P. Browne, Paul D. McNicholas · 2017

Mathematics Computer science Engineering

A mixture of joint generalized hyperbolic models is introduced for asymmetric clustering for high-dimensional data (MJGHM-HDClust). The MJGHM-HDClust approach takes into account the cluster specific subspace and therefore limits the number…

Ryan P. Browne YOU? Author Swipe