bioRxiv (Cold Spring Harbor Laboratory)
scMUSCL: Multi-Source Transfer Learning for Clustering scRNA-seq Data
April 2024 • Arash Khoeini, Funda Sar, Yen‐Yi Lin, Colin C. Collins, Martin Ester
Abstract Motivation scRNA-seq analysis relies heavily on single-cell clustering to perform many downstream functions. Several machine learning methods have been proposed to improve the clustering of single cells, yet most of these methods are fully unsupervised and ignore the wealth of publicly available annotated datasets from single-cell experiments. Cells are high-dimensional entities, and unsupervised clustering might find clusters without biological meaning. Exploiting relevant annotated scRNA-seq dataset as …