Identifying cell states in single-cell RNA-seq data at statistically maximal resolution Article Swipe
YOU?
·
· 2024
· Open Access
·
· DOI: https://doi.org/10.1371/journal.pcbi.1012224
Single-cell RNA sequencing (scRNA-seq) has become a popular experimental method to study variation of gene expression within a population of cells. However, obtaining an accurate picture of the diversity of distinct gene expression states that are present in a given dataset is highly challenging because of the sparsity of the scRNA-seq data and its inhomogeneous measurement noise properties. Although a vast number of different methods is applied in the literature for clustering cells into subsets with ‘similar’ expression profiles, these methods generally lack rigorously specified objectives, involve multiple complex layers of normalization, filtering, feature selection, dimensionality-reduction, employ ad hoc measures of distance or similarity between cells, often ignore the known measurement noise properties of scRNA-seq measurements, and include a large number of tunable parameters. Consequently, it is virtually impossible to assign concrete biophysical meaning to the clusterings that result from these methods. Here we address the following problem: Given raw unique molecule identifier (UMI) counts of an scRNA-seq dataset, partition the cells into subsets such that the gene expression states of the cells in each subset are statistically indistinguishable, and each subset corresponds to a distinct gene expression state. That is, we aim to partition cells so as to maximally reduce the complexity of the dataset without removing any of its meaningful structure. We show that, given the known measurement noise structure of scRNA-seq data, this problem is mathematically well-defined and derive its unique solution from first principles. We have implemented this solution in a tool called Cellstates which operates directly on the raw data and automatically determines the optimal partition and cluster number, with zero tunable parameters. We show that, on synthetic datasets, Cellstates almost perfectly recovers optimal partitions. On real data, Cellstates robustly identifies subtle substructure within groups of cells that are traditionally annotated as a common cell type. Moreover, we show that the diversity of gene expression states that Cellstates identifies systematically depends on the tissue of origin and not on technical features of the experiments such as the total number of cells and total UMI count per cell. In addition to the Cellstates tool we also provide a small toolbox of software to place the identified cellstates into a hierarchical tree of higher-order clusters, to identify the most important differentially expressed genes at each branch of this hierarchy, and to visualize these results.
Related Topics
- Type
- article
- Language
- en
- Landing Page
- https://doi.org/10.1371/journal.pcbi.1012224
- https://journals.plos.org/ploscompbiol/article/file?id=10.1371/journal.pcbi.1012224&type=printable
- OA Status
- gold
- Cited By
- 7
- References
- 49
- Related Works
- 10
- OpenAlex ID
- https://openalex.org/W4400579193
Raw OpenAlex JSON
- OpenAlex ID
-
https://openalex.org/W4400579193Canonical identifier for this work in OpenAlex
- DOI
-
https://doi.org/10.1371/journal.pcbi.1012224Digital Object Identifier
- Title
-
Identifying cell states in single-cell RNA-seq data at statistically maximal resolutionWork title
- Type
-
articleOpenAlex work type
- Language
-
enPrimary language
- Publication year
-
2024Year of publication
- Publication date
-
2024-07-12Full publication date if available
- Authors
-
Pascal Grobecker, Thomas Sakoparnig, Erik van NimwegenList of authors in order
- Landing page
-
https://doi.org/10.1371/journal.pcbi.1012224Publisher landing page
- PDF URL
-
https://journals.plos.org/ploscompbiol/article/file?id=10.1371/journal.pcbi.1012224&type=printableDirect link to full text PDF
- Open access
-
YesWhether a free full text is available
- OA status
-
goldOpen access status per OpenAlex
- OA URL
-
https://journals.plos.org/ploscompbiol/article/file?id=10.1371/journal.pcbi.1012224&type=printableDirect OA link when available
- Concepts
-
RNA-Seq, Cell, Single-cell analysis, Computational biology, RNA, Transcriptome, Biology, Genetics, Gene expression, GeneTop concepts (fields/topics) attached by OpenAlex
- Cited by
-
7Total citation count in OpenAlex
- Citations by year (recent)
-
2025: 6, 2024: 1Per-year citation counts (last 5 years)
- References (count)
-
49Number of works referenced by this work
- Related works (count)
-
10Other works algorithmically related by OpenAlex
Full payload
| id | https://openalex.org/W4400579193 |
|---|---|
| doi | https://doi.org/10.1371/journal.pcbi.1012224 |
| ids.doi | https://doi.org/10.1371/journal.pcbi.1012224 |
| ids.pmid | https://pubmed.ncbi.nlm.nih.gov/38995959 |
| ids.openalex | https://openalex.org/W4400579193 |
| fwci | 3.36173058 |
| mesh[0].qualifier_ui | Q000379 |
| mesh[0].descriptor_ui | D059010 |
| mesh[0].is_major_topic | True |
| mesh[0].qualifier_name | methods |
| mesh[0].descriptor_name | Single-Cell Analysis |
| mesh[1].qualifier_ui | Q000379 |
| mesh[1].descriptor_ui | D000081246 |
| mesh[1].is_major_topic | True |
| mesh[1].qualifier_name | methods |
| mesh[1].descriptor_name | RNA-Seq |
| mesh[2].qualifier_ui | Q000379 |
| mesh[2].descriptor_ui | D019295 |
| mesh[2].is_major_topic | True |
| mesh[2].qualifier_name | methods |
| mesh[2].descriptor_name | Computational Biology |
| mesh[3].qualifier_ui | |
| mesh[3].descriptor_ui | D016000 |
| mesh[3].is_major_topic | False |
| mesh[3].qualifier_name | |
| mesh[3].descriptor_name | Cluster Analysis |
| mesh[4].qualifier_ui | Q000379 |
| mesh[4].descriptor_ui | D017423 |
| mesh[4].is_major_topic | True |
| mesh[4].qualifier_name | methods |
| mesh[4].descriptor_name | Sequence Analysis, RNA |
| mesh[5].qualifier_ui | Q000379 |
| mesh[5].descriptor_ui | D020869 |
| mesh[5].is_major_topic | False |
| mesh[5].qualifier_name | methods |
| mesh[5].descriptor_name | Gene Expression Profiling |
| mesh[6].qualifier_ui | |
| mesh[6].descriptor_ui | D006801 |
| mesh[6].is_major_topic | False |
| mesh[6].qualifier_name | |
| mesh[6].descriptor_name | Humans |
| mesh[7].qualifier_ui | |
| mesh[7].descriptor_ui | D000465 |
| mesh[7].is_major_topic | False |
| mesh[7].qualifier_name | |
| mesh[7].descriptor_name | Algorithms |
| mesh[8].qualifier_ui | |
| mesh[8].descriptor_ui | D000818 |
| mesh[8].is_major_topic | False |
| mesh[8].qualifier_name | |
| mesh[8].descriptor_name | Animals |
| mesh[9].qualifier_ui | |
| mesh[9].descriptor_ui | D000092386 |
| mesh[9].is_major_topic | False |
| mesh[9].qualifier_name | |
| mesh[9].descriptor_name | Single-Cell Gene Expression Analysis |
| mesh[10].qualifier_ui | Q000379 |
| mesh[10].descriptor_ui | D059010 |
| mesh[10].is_major_topic | True |
| mesh[10].qualifier_name | methods |
| mesh[10].descriptor_name | Single-Cell Analysis |
| mesh[11].qualifier_ui | Q000379 |
| mesh[11].descriptor_ui | D000081246 |
| mesh[11].is_major_topic | True |
| mesh[11].qualifier_name | methods |
| mesh[11].descriptor_name | RNA-Seq |
| mesh[12].qualifier_ui | Q000379 |
| mesh[12].descriptor_ui | D019295 |
| mesh[12].is_major_topic | True |
| mesh[12].qualifier_name | methods |
| mesh[12].descriptor_name | Computational Biology |
| mesh[13].qualifier_ui | |
| mesh[13].descriptor_ui | D016000 |
| mesh[13].is_major_topic | False |
| mesh[13].qualifier_name | |
| mesh[13].descriptor_name | Cluster Analysis |
| mesh[14].qualifier_ui | Q000379 |
| mesh[14].descriptor_ui | D017423 |
| mesh[14].is_major_topic | True |
| mesh[14].qualifier_name | methods |
| mesh[14].descriptor_name | Sequence Analysis, RNA |
| mesh[15].qualifier_ui | Q000379 |
| mesh[15].descriptor_ui | D020869 |
| mesh[15].is_major_topic | False |
| mesh[15].qualifier_name | methods |
| mesh[15].descriptor_name | Gene Expression Profiling |
| mesh[16].qualifier_ui | |
| mesh[16].descriptor_ui | D006801 |
| mesh[16].is_major_topic | False |
| mesh[16].qualifier_name | |
| mesh[16].descriptor_name | Humans |
| mesh[17].qualifier_ui | |
| mesh[17].descriptor_ui | D000465 |
| mesh[17].is_major_topic | False |
| mesh[17].qualifier_name | |
| mesh[17].descriptor_name | Algorithms |
| mesh[18].qualifier_ui | |
| mesh[18].descriptor_ui | D000818 |
| mesh[18].is_major_topic | False |
| mesh[18].qualifier_name | |
| mesh[18].descriptor_name | Animals |
| mesh[19].qualifier_ui | |
| mesh[19].descriptor_ui | D000092386 |
| mesh[19].is_major_topic | False |
| mesh[19].qualifier_name | |
| mesh[19].descriptor_name | Single-Cell Gene Expression Analysis |
| mesh[20].qualifier_ui | Q000379 |
| mesh[20].descriptor_ui | D059010 |
| mesh[20].is_major_topic | True |
| mesh[20].qualifier_name | methods |
| mesh[20].descriptor_name | Single-Cell Analysis |
| mesh[21].qualifier_ui | Q000379 |
| mesh[21].descriptor_ui | D000081246 |
| mesh[21].is_major_topic | True |
| mesh[21].qualifier_name | methods |
| mesh[21].descriptor_name | RNA-Seq |
| mesh[22].qualifier_ui | Q000379 |
| mesh[22].descriptor_ui | D019295 |
| mesh[22].is_major_topic | True |
| mesh[22].qualifier_name | methods |
| mesh[22].descriptor_name | Computational Biology |
| mesh[23].qualifier_ui | |
| mesh[23].descriptor_ui | D016000 |
| mesh[23].is_major_topic | False |
| mesh[23].qualifier_name | |
| mesh[23].descriptor_name | Cluster Analysis |
| mesh[24].qualifier_ui | Q000379 |
| mesh[24].descriptor_ui | D017423 |
| mesh[24].is_major_topic | True |
| mesh[24].qualifier_name | methods |
| mesh[24].descriptor_name | Sequence Analysis, RNA |
| mesh[25].qualifier_ui | Q000379 |
| mesh[25].descriptor_ui | D020869 |
| mesh[25].is_major_topic | False |
| mesh[25].qualifier_name | methods |
| mesh[25].descriptor_name | Gene Expression Profiling |
| mesh[26].qualifier_ui | |
| mesh[26].descriptor_ui | D006801 |
| mesh[26].is_major_topic | False |
| mesh[26].qualifier_name | |
| mesh[26].descriptor_name | Humans |
| mesh[27].qualifier_ui | |
| mesh[27].descriptor_ui | D000465 |
| mesh[27].is_major_topic | False |
| mesh[27].qualifier_name | |
| mesh[27].descriptor_name | Algorithms |
| mesh[28].qualifier_ui | |
| mesh[28].descriptor_ui | D000818 |
| mesh[28].is_major_topic | False |
| mesh[28].qualifier_name | |
| mesh[28].descriptor_name | Animals |
| mesh[29].qualifier_ui | |
| mesh[29].descriptor_ui | D000092386 |
| mesh[29].is_major_topic | False |
| mesh[29].qualifier_name | |
| mesh[29].descriptor_name | Single-Cell Gene Expression Analysis |
| type | article |
| title | Identifying cell states in single-cell RNA-seq data at statistically maximal resolution |
| awards[0].id | https://openalex.org/G1529875869 |
| awards[0].funder_id | https://openalex.org/F4320320924 |
| awards[0].display_name | |
| awards[0].funder_award_id | 310030_184937 |
| awards[0].funder_display_name | Schweizerischer Nationalfonds zur Förderung der Wissenschaftlichen Forschung |
| biblio.issue | 7 |
| biblio.volume | 20 |
| biblio.last_page | e1012224 |
| biblio.first_page | e1012224 |
| topics[0].id | https://openalex.org/T11289 |
| topics[0].field.id | https://openalex.org/fields/13 |
| topics[0].field.display_name | Biochemistry, Genetics and Molecular Biology |
| topics[0].score | 1.0 |
| topics[0].domain.id | https://openalex.org/domains/1 |
| topics[0].domain.display_name | Life Sciences |
| topics[0].subfield.id | https://openalex.org/subfields/1312 |
| topics[0].subfield.display_name | Molecular Biology |
| topics[0].display_name | Single-cell and spatial transcriptomics |
| topics[1].id | https://openalex.org/T12859 |
| topics[1].field.id | https://openalex.org/fields/13 |
| topics[1].field.display_name | Biochemistry, Genetics and Molecular Biology |
| topics[1].score | 0.9781000018119812 |
| topics[1].domain.id | https://openalex.org/domains/1 |
| topics[1].domain.display_name | Life Sciences |
| topics[1].subfield.id | https://openalex.org/subfields/1304 |
| topics[1].subfield.display_name | Biophysics |
| topics[1].display_name | Cell Image Analysis Techniques |
| topics[2].id | https://openalex.org/T11255 |
| topics[2].field.id | https://openalex.org/fields/22 |
| topics[2].field.display_name | Engineering |
| topics[2].score | 0.9569000005722046 |
| topics[2].domain.id | https://openalex.org/domains/3 |
| topics[2].domain.display_name | Physical Sciences |
| topics[2].subfield.id | https://openalex.org/subfields/2204 |
| topics[2].subfield.display_name | Biomedical Engineering |
| topics[2].display_name | Microfluidic and Bio-sensing Technologies |
| funders[0].id | https://openalex.org/F4320320924 |
| funders[0].ror | https://ror.org/00yjd3n13 |
| funders[0].display_name | Schweizerischer Nationalfonds zur Förderung der Wissenschaftlichen Forschung |
| is_xpac | False |
| apc_list.value | 2655 |
| apc_list.currency | USD |
| apc_list.value_usd | 2655 |
| apc_paid.value | 2659 |
| apc_paid.currency | EUR |
| apc_paid.value_usd | 2867 |
| concepts[0].id | https://openalex.org/C107397762 |
| concepts[0].level | 5 |
| concepts[0].score | 0.7468928098678589 |
| concepts[0].wikidata | https://www.wikidata.org/wiki/Q2542347 |
| concepts[0].display_name | RNA-Seq |
| concepts[1].id | https://openalex.org/C1491633281 |
| concepts[1].level | 2 |
| concepts[1].score | 0.5308647751808167 |
| concepts[1].wikidata | https://www.wikidata.org/wiki/Q7868 |
| concepts[1].display_name | Cell |
| concepts[2].id | https://openalex.org/C2776950831 |
| concepts[2].level | 3 |
| concepts[2].score | 0.49644237756729126 |
| concepts[2].wikidata | https://www.wikidata.org/wiki/Q15635432 |
| concepts[2].display_name | Single-cell analysis |
| concepts[3].id | https://openalex.org/C70721500 |
| concepts[3].level | 1 |
| concepts[3].score | 0.4881787896156311 |
| concepts[3].wikidata | https://www.wikidata.org/wiki/Q177005 |
| concepts[3].display_name | Computational biology |
| concepts[4].id | https://openalex.org/C67705224 |
| concepts[4].level | 3 |
| concepts[4].score | 0.47907477617263794 |
| concepts[4].wikidata | https://www.wikidata.org/wiki/Q11053 |
| concepts[4].display_name | RNA |
| concepts[5].id | https://openalex.org/C162317418 |
| concepts[5].level | 4 |
| concepts[5].score | 0.40694254636764526 |
| concepts[5].wikidata | https://www.wikidata.org/wiki/Q252857 |
| concepts[5].display_name | Transcriptome |
| concepts[6].id | https://openalex.org/C86803240 |
| concepts[6].level | 0 |
| concepts[6].score | 0.3640978932380676 |
| concepts[6].wikidata | https://www.wikidata.org/wiki/Q420 |
| concepts[6].display_name | Biology |
| concepts[7].id | https://openalex.org/C54355233 |
| concepts[7].level | 1 |
| concepts[7].score | 0.22783508896827698 |
| concepts[7].wikidata | https://www.wikidata.org/wiki/Q7162 |
| concepts[7].display_name | Genetics |
| concepts[8].id | https://openalex.org/C150194340 |
| concepts[8].level | 3 |
| concepts[8].score | 0.18213510513305664 |
| concepts[8].wikidata | https://www.wikidata.org/wiki/Q26972 |
| concepts[8].display_name | Gene expression |
| concepts[9].id | https://openalex.org/C104317684 |
| concepts[9].level | 2 |
| concepts[9].score | 0.1043233871459961 |
| concepts[9].wikidata | https://www.wikidata.org/wiki/Q7187 |
| concepts[9].display_name | Gene |
| keywords[0].id | https://openalex.org/keywords/rna-seq |
| keywords[0].score | 0.7468928098678589 |
| keywords[0].display_name | RNA-Seq |
| keywords[1].id | https://openalex.org/keywords/cell |
| keywords[1].score | 0.5308647751808167 |
| keywords[1].display_name | Cell |
| keywords[2].id | https://openalex.org/keywords/single-cell-analysis |
| keywords[2].score | 0.49644237756729126 |
| keywords[2].display_name | Single-cell analysis |
| keywords[3].id | https://openalex.org/keywords/computational-biology |
| keywords[3].score | 0.4881787896156311 |
| keywords[3].display_name | Computational biology |
| keywords[4].id | https://openalex.org/keywords/rna |
| keywords[4].score | 0.47907477617263794 |
| keywords[4].display_name | RNA |
| keywords[5].id | https://openalex.org/keywords/transcriptome |
| keywords[5].score | 0.40694254636764526 |
| keywords[5].display_name | Transcriptome |
| keywords[6].id | https://openalex.org/keywords/biology |
| keywords[6].score | 0.3640978932380676 |
| keywords[6].display_name | Biology |
| keywords[7].id | https://openalex.org/keywords/genetics |
| keywords[7].score | 0.22783508896827698 |
| keywords[7].display_name | Genetics |
| keywords[8].id | https://openalex.org/keywords/gene-expression |
| keywords[8].score | 0.18213510513305664 |
| keywords[8].display_name | Gene expression |
| keywords[9].id | https://openalex.org/keywords/gene |
| keywords[9].score | 0.1043233871459961 |
| keywords[9].display_name | Gene |
| language | en |
| locations[0].id | doi:10.1371/journal.pcbi.1012224 |
| locations[0].is_oa | True |
| locations[0].source.id | https://openalex.org/S86033158 |
| locations[0].source.issn | 1553-734X, 1553-7358 |
| locations[0].source.type | journal |
| locations[0].source.is_oa | True |
| locations[0].source.issn_l | 1553-734X |
| locations[0].source.is_core | True |
| locations[0].source.is_in_doaj | True |
| locations[0].source.display_name | PLoS Computational Biology |
| locations[0].source.host_organization | https://openalex.org/P4310315706 |
| locations[0].source.host_organization_name | Public Library of Science |
| locations[0].source.host_organization_lineage | https://openalex.org/P4310315706 |
| locations[0].source.host_organization_lineage_names | Public Library of Science |
| locations[0].license | cc-by |
| locations[0].pdf_url | https://journals.plos.org/ploscompbiol/article/file?id=10.1371/journal.pcbi.1012224&type=printable |
| locations[0].version | publishedVersion |
| locations[0].raw_type | journal-article |
| locations[0].license_id | https://openalex.org/licenses/cc-by |
| locations[0].is_accepted | True |
| locations[0].is_published | True |
| locations[0].raw_source_name | PLOS Computational Biology |
| locations[0].landing_page_url | https://doi.org/10.1371/journal.pcbi.1012224 |
| locations[1].id | pmid:38995959 |
| locations[1].is_oa | False |
| locations[1].source.id | https://openalex.org/S4306525036 |
| locations[1].source.issn | |
| locations[1].source.type | repository |
| locations[1].source.is_oa | False |
| locations[1].source.issn_l | |
| locations[1].source.is_core | False |
| locations[1].source.is_in_doaj | False |
| locations[1].source.display_name | PubMed |
| locations[1].source.host_organization | https://openalex.org/I1299303238 |
| locations[1].source.host_organization_name | National Institutes of Health |
| locations[1].source.host_organization_lineage | https://openalex.org/I1299303238 |
| locations[1].license | |
| locations[1].pdf_url | |
| locations[1].version | publishedVersion |
| locations[1].raw_type | |
| locations[1].license_id | |
| locations[1].is_accepted | True |
| locations[1].is_published | True |
| locations[1].raw_source_name | PLoS computational biology |
| locations[1].landing_page_url | https://pubmed.ncbi.nlm.nih.gov/38995959 |
| locations[2].id | pmh:oai:RePEc:plo:pcbi00:1012224 |
| locations[2].is_oa | False |
| locations[2].source.id | https://openalex.org/S4306401271 |
| locations[2].source.issn | |
| locations[2].source.type | repository |
| locations[2].source.is_oa | False |
| locations[2].source.issn_l | |
| locations[2].source.is_core | False |
| locations[2].source.is_in_doaj | False |
| locations[2].source.display_name | RePEc: Research Papers in Economics |
| locations[2].source.host_organization | https://openalex.org/I77793887 |
| locations[2].source.host_organization_name | Federal Reserve Bank of St. Louis |
| locations[2].source.host_organization_lineage | https://openalex.org/I77793887 |
| locations[2].license | |
| locations[2].pdf_url | |
| locations[2].version | submittedVersion |
| locations[2].raw_type | article |
| locations[2].license_id | |
| locations[2].is_accepted | False |
| locations[2].is_published | False |
| locations[2].raw_source_name | |
| locations[2].landing_page_url | https://journals.plos.org/ploscompbiol/article?id=10.1371/journal.pcbi.1012224 |
| locations[3].id | pmh:oai:doaj.org/article:33e9bedf860f4e5da7bc9bb053277099 |
| locations[3].is_oa | False |
| locations[3].source.id | https://openalex.org/S4306401280 |
| locations[3].source.issn | |
| locations[3].source.type | repository |
| locations[3].source.is_oa | False |
| locations[3].source.issn_l | |
| locations[3].source.is_core | False |
| locations[3].source.is_in_doaj | False |
| locations[3].source.display_name | DOAJ (DOAJ: Directory of Open Access Journals) |
| locations[3].source.host_organization | |
| locations[3].source.host_organization_name | |
| locations[3].license | |
| locations[3].pdf_url | |
| locations[3].version | submittedVersion |
| locations[3].raw_type | article |
| locations[3].license_id | |
| locations[3].is_accepted | False |
| locations[3].is_published | False |
| locations[3].raw_source_name | PLoS Computational Biology, Vol 20, Iss 7, p e1012224 (2024) |
| locations[3].landing_page_url | https://doaj.org/article/33e9bedf860f4e5da7bc9bb053277099 |
| locations[4].id | pmh:oai:pubmedcentral.nih.gov:11364423 |
| locations[4].is_oa | True |
| locations[4].source.id | https://openalex.org/S2764455111 |
| locations[4].source.issn | |
| locations[4].source.type | repository |
| locations[4].source.is_oa | False |
| locations[4].source.issn_l | |
| locations[4].source.is_core | False |
| locations[4].source.is_in_doaj | False |
| locations[4].source.display_name | PubMed Central |
| locations[4].source.host_organization | https://openalex.org/I1299303238 |
| locations[4].source.host_organization_name | National Institutes of Health |
| locations[4].source.host_organization_lineage | https://openalex.org/I1299303238 |
| locations[4].license | other-oa |
| locations[4].pdf_url | |
| locations[4].version | submittedVersion |
| locations[4].raw_type | Text |
| locations[4].license_id | https://openalex.org/licenses/other-oa |
| locations[4].is_accepted | False |
| locations[4].is_published | False |
| locations[4].raw_source_name | PLoS Comput Biol |
| locations[4].landing_page_url | https://www.ncbi.nlm.nih.gov/pmc/articles/11364423 |
| indexed_in | crossref, doaj, pubmed |
| authorships[0].author.id | https://openalex.org/A5018868397 |
| authorships[0].author.orcid | |
| authorships[0].author.display_name | Pascal Grobecker |
| authorships[0].countries | CH |
| authorships[0].affiliations[0].institution_ids | https://openalex.org/I12708293, https://openalex.org/I1850255 |
| authorships[0].affiliations[0].raw_affiliation_string | Biozentrum, University of Basel and Swiss Institute of Bioinformatics, Basel, Switzerland |
| authorships[0].institutions[0].id | https://openalex.org/I12708293 |
| authorships[0].institutions[0].ror | https://ror.org/002n09z45 |
| authorships[0].institutions[0].type | nonprofit |
| authorships[0].institutions[0].lineage | https://openalex.org/I12708293 |
| authorships[0].institutions[0].country_code | CH |
| authorships[0].institutions[0].display_name | SIB Swiss Institute of Bioinformatics |
| authorships[0].institutions[1].id | https://openalex.org/I1850255 |
| authorships[0].institutions[1].ror | https://ror.org/02s6k3f65 |
| authorships[0].institutions[1].type | education |
| authorships[0].institutions[1].lineage | https://openalex.org/I1850255 |
| authorships[0].institutions[1].country_code | CH |
| authorships[0].institutions[1].display_name | University of Basel |
| authorships[0].author_position | first |
| authorships[0].raw_author_name | Pascal Grobecker |
| authorships[0].is_corresponding | True |
| authorships[0].raw_affiliation_strings | Biozentrum, University of Basel and Swiss Institute of Bioinformatics, Basel, Switzerland |
| authorships[1].author.id | https://openalex.org/A5059519234 |
| authorships[1].author.orcid | |
| authorships[1].author.display_name | Thomas Sakoparnig |
| authorships[1].countries | CH |
| authorships[1].affiliations[0].institution_ids | https://openalex.org/I12708293, https://openalex.org/I1850255 |
| authorships[1].affiliations[0].raw_affiliation_string | Biozentrum, University of Basel and Swiss Institute of Bioinformatics, Basel, Switzerland |
| authorships[1].institutions[0].id | https://openalex.org/I12708293 |
| authorships[1].institutions[0].ror | https://ror.org/002n09z45 |
| authorships[1].institutions[0].type | nonprofit |
| authorships[1].institutions[0].lineage | https://openalex.org/I12708293 |
| authorships[1].institutions[0].country_code | CH |
| authorships[1].institutions[0].display_name | SIB Swiss Institute of Bioinformatics |
| authorships[1].institutions[1].id | https://openalex.org/I1850255 |
| authorships[1].institutions[1].ror | https://ror.org/02s6k3f65 |
| authorships[1].institutions[1].type | education |
| authorships[1].institutions[1].lineage | https://openalex.org/I1850255 |
| authorships[1].institutions[1].country_code | CH |
| authorships[1].institutions[1].display_name | University of Basel |
| authorships[1].author_position | middle |
| authorships[1].raw_author_name | Thomas Sakoparnig |
| authorships[1].is_corresponding | True |
| authorships[1].raw_affiliation_strings | Biozentrum, University of Basel and Swiss Institute of Bioinformatics, Basel, Switzerland |
| authorships[2].author.id | https://openalex.org/A5017960972 |
| authorships[2].author.orcid | https://orcid.org/0000-0001-6338-1312 |
| authorships[2].author.display_name | Erik van Nimwegen |
| authorships[2].countries | CH |
| authorships[2].affiliations[0].institution_ids | https://openalex.org/I12708293, https://openalex.org/I1850255 |
| authorships[2].affiliations[0].raw_affiliation_string | Biozentrum, University of Basel and Swiss Institute of Bioinformatics, Basel, Switzerland |
| authorships[2].institutions[0].id | https://openalex.org/I12708293 |
| authorships[2].institutions[0].ror | https://ror.org/002n09z45 |
| authorships[2].institutions[0].type | nonprofit |
| authorships[2].institutions[0].lineage | https://openalex.org/I12708293 |
| authorships[2].institutions[0].country_code | CH |
| authorships[2].institutions[0].display_name | SIB Swiss Institute of Bioinformatics |
| authorships[2].institutions[1].id | https://openalex.org/I1850255 |
| authorships[2].institutions[1].ror | https://ror.org/02s6k3f65 |
| authorships[2].institutions[1].type | education |
| authorships[2].institutions[1].lineage | https://openalex.org/I1850255 |
| authorships[2].institutions[1].country_code | CH |
| authorships[2].institutions[1].display_name | University of Basel |
| authorships[2].author_position | last |
| authorships[2].raw_author_name | Erik van Nimwegen |
| authorships[2].is_corresponding | True |
| authorships[2].raw_affiliation_strings | Biozentrum, University of Basel and Swiss Institute of Bioinformatics, Basel, Switzerland |
| has_content.pdf | True |
| has_content.grobid_xml | False |
| is_paratext | False |
| open_access.is_oa | True |
| open_access.oa_url | https://journals.plos.org/ploscompbiol/article/file?id=10.1371/journal.pcbi.1012224&type=printable |
| open_access.oa_status | gold |
| open_access.any_repository_has_fulltext | False |
| created_date | 2025-10-10T00:00:00 |
| display_name | Identifying cell states in single-cell RNA-seq data at statistically maximal resolution |
| has_fulltext | False |
| is_retracted | False |
| updated_date | 2025-11-23T05:10:03.516525 |
| primary_topic.id | https://openalex.org/T11289 |
| primary_topic.field.id | https://openalex.org/fields/13 |
| primary_topic.field.display_name | Biochemistry, Genetics and Molecular Biology |
| primary_topic.score | 1.0 |
| primary_topic.domain.id | https://openalex.org/domains/1 |
| primary_topic.domain.display_name | Life Sciences |
| primary_topic.subfield.id | https://openalex.org/subfields/1312 |
| primary_topic.subfield.display_name | Molecular Biology |
| primary_topic.display_name | Single-cell and spatial transcriptomics |
| related_works | https://openalex.org/W4394359997, https://openalex.org/W4362511652, https://openalex.org/W4362511917, https://openalex.org/W4362511922, https://openalex.org/W4362485688, https://openalex.org/W4362530184, https://openalex.org/W4362530193, https://openalex.org/W2904597490, https://openalex.org/W4362485661, https://openalex.org/W4394365953 |
| cited_by_count | 7 |
| counts_by_year[0].year | 2025 |
| counts_by_year[0].cited_by_count | 6 |
| counts_by_year[1].year | 2024 |
| counts_by_year[1].cited_by_count | 1 |
| locations_count | 5 |
| best_oa_location.id | doi:10.1371/journal.pcbi.1012224 |
| best_oa_location.is_oa | True |
| best_oa_location.source.id | https://openalex.org/S86033158 |
| best_oa_location.source.issn | 1553-734X, 1553-7358 |
| best_oa_location.source.type | journal |
| best_oa_location.source.is_oa | True |
| best_oa_location.source.issn_l | 1553-734X |
| best_oa_location.source.is_core | True |
| best_oa_location.source.is_in_doaj | True |
| best_oa_location.source.display_name | PLoS Computational Biology |
| best_oa_location.source.host_organization | https://openalex.org/P4310315706 |
| best_oa_location.source.host_organization_name | Public Library of Science |
| best_oa_location.source.host_organization_lineage | https://openalex.org/P4310315706 |
| best_oa_location.source.host_organization_lineage_names | Public Library of Science |
| best_oa_location.license | cc-by |
| best_oa_location.pdf_url | https://journals.plos.org/ploscompbiol/article/file?id=10.1371/journal.pcbi.1012224&type=printable |
| best_oa_location.version | publishedVersion |
| best_oa_location.raw_type | journal-article |
| best_oa_location.license_id | https://openalex.org/licenses/cc-by |
| best_oa_location.is_accepted | True |
| best_oa_location.is_published | True |
| best_oa_location.raw_source_name | PLOS Computational Biology |
| best_oa_location.landing_page_url | https://doi.org/10.1371/journal.pcbi.1012224 |
| primary_location.id | doi:10.1371/journal.pcbi.1012224 |
| primary_location.is_oa | True |
| primary_location.source.id | https://openalex.org/S86033158 |
| primary_location.source.issn | 1553-734X, 1553-7358 |
| primary_location.source.type | journal |
| primary_location.source.is_oa | True |
| primary_location.source.issn_l | 1553-734X |
| primary_location.source.is_core | True |
| primary_location.source.is_in_doaj | True |
| primary_location.source.display_name | PLoS Computational Biology |
| primary_location.source.host_organization | https://openalex.org/P4310315706 |
| primary_location.source.host_organization_name | Public Library of Science |
| primary_location.source.host_organization_lineage | https://openalex.org/P4310315706 |
| primary_location.source.host_organization_lineage_names | Public Library of Science |
| primary_location.license | cc-by |
| primary_location.pdf_url | https://journals.plos.org/ploscompbiol/article/file?id=10.1371/journal.pcbi.1012224&type=printable |
| primary_location.version | publishedVersion |
| primary_location.raw_type | journal-article |
| primary_location.license_id | https://openalex.org/licenses/cc-by |
| primary_location.is_accepted | True |
| primary_location.is_published | True |
| primary_location.raw_source_name | PLOS Computational Biology |
| primary_location.landing_page_url | https://doi.org/10.1371/journal.pcbi.1012224 |
| publication_date | 2024-07-12 |
| publication_year | 2024 |
| referenced_works | https://openalex.org/W2612519453, https://openalex.org/W2788348358, https://openalex.org/W7052812575, https://openalex.org/W2801443765, https://openalex.org/W2799796757, https://openalex.org/W2612589628, https://openalex.org/W2904605567, https://openalex.org/W2953614989, https://openalex.org/W2956038331, https://openalex.org/W2805619986, https://openalex.org/W2901677030, https://openalex.org/W3157094484, https://openalex.org/W2739329039, https://openalex.org/W3177020879, https://openalex.org/W2998065416, https://openalex.org/W2088639752, https://openalex.org/W2523620612, https://openalex.org/W2601723752, https://openalex.org/W2946232537, https://openalex.org/W2806983506, https://openalex.org/W2528543174, https://openalex.org/W2526262591, https://openalex.org/W2795687816, https://openalex.org/W2811387400, https://openalex.org/W1979283544, https://openalex.org/W2514159238, https://openalex.org/W2606771561, https://openalex.org/W1813068103, https://openalex.org/W2598326928, https://openalex.org/W4291201938, https://openalex.org/W2884197618, https://openalex.org/W2951506174, https://openalex.org/W2889326414, https://openalex.org/W6756233103, https://openalex.org/W6800415945, https://openalex.org/W2951619011, https://openalex.org/W2968027223, https://openalex.org/W2949693884, https://openalex.org/W2909032196, https://openalex.org/W2772052731, https://openalex.org/W2907783748, https://openalex.org/W2979981308, https://openalex.org/W2894687190, https://openalex.org/W3008563687, https://openalex.org/W2989747508, https://openalex.org/W2187089797, https://openalex.org/W4385945347, https://openalex.org/W4213108508, https://openalex.org/W4233637541 |
| referenced_works_count | 49 |
| abstract_inverted_index.a | 6, 17, 38, 59, 118, 184, 244, 297, 351, 362 |
| abstract_inverted_index.In | 342 |
| abstract_inverted_index.On | 280 |
| abstract_inverted_index.We | 213, 238, 268 |
| abstract_inverted_index.ad | 97 |
| abstract_inverted_index.an | 23, 156 |
| abstract_inverted_index.as | 197, 296, 330 |
| abstract_inverted_index.at | 376 |
| abstract_inverted_index.in | 37, 67, 173, 243 |
| abstract_inverted_index.is | 41, 65, 126, 227 |
| abstract_inverted_index.it | 125 |
| abstract_inverted_index.of | 13, 19, 26, 29, 45, 48, 62, 90, 100, 113, 121, 155, 170, 203, 209, 222, 290, 307, 319, 326, 334, 354, 365, 379 |
| abstract_inverted_index.on | 251, 271, 316, 323 |
| abstract_inverted_index.or | 102 |
| abstract_inverted_index.so | 196 |
| abstract_inverted_index.to | 10, 129, 134, 183, 193, 198, 344, 356, 368, 383 |
| abstract_inverted_index.we | 143, 191, 302, 348 |
| abstract_inverted_index.RNA | 1 |
| abstract_inverted_index.UMI | 338 |
| abstract_inverted_index.aim | 192 |
| abstract_inverted_index.and | 52, 116, 179, 230, 255, 261, 321, 336, 382 |
| abstract_inverted_index.any | 208 |
| abstract_inverted_index.are | 35, 176, 293 |
| abstract_inverted_index.for | 70 |
| abstract_inverted_index.has | 4 |
| abstract_inverted_index.hoc | 98 |
| abstract_inverted_index.is, | 190 |
| abstract_inverted_index.its | 53, 210, 232 |
| abstract_inverted_index.not | 322 |
| abstract_inverted_index.per | 340 |
| abstract_inverted_index.raw | 149, 253 |
| abstract_inverted_index.the | 27, 46, 49, 68, 108, 135, 145, 160, 166, 171, 201, 204, 217, 252, 258, 305, 317, 327, 331, 345, 358, 370 |
| abstract_inverted_index.Here | 142 |
| abstract_inverted_index.That | 189 |
| abstract_inverted_index.also | 349 |
| abstract_inverted_index.cell | 299 |
| abstract_inverted_index.data | 51, 254 |
| abstract_inverted_index.each | 174, 180, 377 |
| abstract_inverted_index.from | 139, 235 |
| abstract_inverted_index.gene | 14, 31, 167, 186, 308 |
| abstract_inverted_index.have | 239 |
| abstract_inverted_index.into | 73, 162, 361 |
| abstract_inverted_index.lack | 82 |
| abstract_inverted_index.most | 371 |
| abstract_inverted_index.real | 281 |
| abstract_inverted_index.show | 214, 269, 303 |
| abstract_inverted_index.such | 164, 329 |
| abstract_inverted_index.that | 34, 137, 165, 292, 304, 311 |
| abstract_inverted_index.this | 225, 241, 380 |
| abstract_inverted_index.tool | 245, 347 |
| abstract_inverted_index.tree | 364 |
| abstract_inverted_index.vast | 60 |
| abstract_inverted_index.with | 75, 264 |
| abstract_inverted_index.zero | 265 |
| abstract_inverted_index.(UMI) | 153 |
| abstract_inverted_index.Given | 148 |
| abstract_inverted_index.cell. | 341 |
| abstract_inverted_index.cells | 72, 161, 172, 195, 291, 335 |
| abstract_inverted_index.count | 339 |
| abstract_inverted_index.data, | 224, 282 |
| abstract_inverted_index.first | 236 |
| abstract_inverted_index.genes | 375 |
| abstract_inverted_index.given | 39, 216 |
| abstract_inverted_index.known | 109, 218 |
| abstract_inverted_index.large | 119 |
| abstract_inverted_index.noise | 56, 111, 220 |
| abstract_inverted_index.often | 106 |
| abstract_inverted_index.place | 357 |
| abstract_inverted_index.small | 352 |
| abstract_inverted_index.study | 11 |
| abstract_inverted_index.that, | 215, 270 |
| abstract_inverted_index.these | 79, 140, 385 |
| abstract_inverted_index.total | 332, 337 |
| abstract_inverted_index.type. | 300 |
| abstract_inverted_index.which | 248 |
| abstract_inverted_index.almost | 275 |
| abstract_inverted_index.assign | 130 |
| abstract_inverted_index.become | 5 |
| abstract_inverted_index.branch | 378 |
| abstract_inverted_index.called | 246 |
| abstract_inverted_index.cells, | 105 |
| abstract_inverted_index.cells. | 20 |
| abstract_inverted_index.common | 298 |
| abstract_inverted_index.counts | 154 |
| abstract_inverted_index.derive | 231 |
| abstract_inverted_index.employ | 96 |
| abstract_inverted_index.groups | 289 |
| abstract_inverted_index.highly | 42 |
| abstract_inverted_index.ignore | 107 |
| abstract_inverted_index.layers | 89 |
| abstract_inverted_index.method | 9 |
| abstract_inverted_index.number | 61, 120, 333 |
| abstract_inverted_index.origin | 320 |
| abstract_inverted_index.reduce | 200 |
| abstract_inverted_index.result | 138 |
| abstract_inverted_index.state. | 188 |
| abstract_inverted_index.states | 33, 169, 310 |
| abstract_inverted_index.subset | 175, 181 |
| abstract_inverted_index.subtle | 286 |
| abstract_inverted_index.tissue | 318 |
| abstract_inverted_index.unique | 150, 233 |
| abstract_inverted_index.within | 16, 288 |
| abstract_inverted_index.address | 144 |
| abstract_inverted_index.applied | 66 |
| abstract_inverted_index.because | 44 |
| abstract_inverted_index.between | 104 |
| abstract_inverted_index.cluster | 262 |
| abstract_inverted_index.complex | 88 |
| abstract_inverted_index.dataset | 40, 205 |
| abstract_inverted_index.depends | 315 |
| abstract_inverted_index.feature | 93 |
| abstract_inverted_index.include | 117 |
| abstract_inverted_index.involve | 86 |
| abstract_inverted_index.meaning | 133 |
| abstract_inverted_index.methods | 64, 80 |
| abstract_inverted_index.number, | 263 |
| abstract_inverted_index.optimal | 259, 278 |
| abstract_inverted_index.picture | 25 |
| abstract_inverted_index.popular | 7 |
| abstract_inverted_index.present | 36 |
| abstract_inverted_index.problem | 226 |
| abstract_inverted_index.provide | 350 |
| abstract_inverted_index.subsets | 74, 163 |
| abstract_inverted_index.toolbox | 353 |
| abstract_inverted_index.tunable | 122, 266 |
| abstract_inverted_index.without | 206 |
| abstract_inverted_index.Although | 58 |
| abstract_inverted_index.However, | 21 |
| abstract_inverted_index.accurate | 24 |
| abstract_inverted_index.addition | 343 |
| abstract_inverted_index.concrete | 131 |
| abstract_inverted_index.dataset, | 158 |
| abstract_inverted_index.directly | 250 |
| abstract_inverted_index.distance | 101 |
| abstract_inverted_index.distinct | 30, 185 |
| abstract_inverted_index.features | 325 |
| abstract_inverted_index.identify | 369 |
| abstract_inverted_index.measures | 99 |
| abstract_inverted_index.methods. | 141 |
| abstract_inverted_index.molecule | 151 |
| abstract_inverted_index.multiple | 87 |
| abstract_inverted_index.operates | 249 |
| abstract_inverted_index.problem: | 147 |
| abstract_inverted_index.recovers | 277 |
| abstract_inverted_index.removing | 207 |
| abstract_inverted_index.results. | 386 |
| abstract_inverted_index.robustly | 284 |
| abstract_inverted_index.software | 355 |
| abstract_inverted_index.solution | 234, 242 |
| abstract_inverted_index.sparsity | 47 |
| abstract_inverted_index.Moreover, | 301 |
| abstract_inverted_index.annotated | 295 |
| abstract_inverted_index.clusters, | 367 |
| abstract_inverted_index.datasets, | 273 |
| abstract_inverted_index.different | 63 |
| abstract_inverted_index.diversity | 28, 306 |
| abstract_inverted_index.expressed | 374 |
| abstract_inverted_index.following | 146 |
| abstract_inverted_index.generally | 81 |
| abstract_inverted_index.important | 372 |
| abstract_inverted_index.maximally | 199 |
| abstract_inverted_index.obtaining | 22 |
| abstract_inverted_index.partition | 159, 194, 260 |
| abstract_inverted_index.perfectly | 276 |
| abstract_inverted_index.profiles, | 78 |
| abstract_inverted_index.scRNA-seq | 50, 114, 157, 223 |
| abstract_inverted_index.specified | 84 |
| abstract_inverted_index.structure | 221 |
| abstract_inverted_index.synthetic | 272 |
| abstract_inverted_index.technical | 324 |
| abstract_inverted_index.variation | 12 |
| abstract_inverted_index.virtually | 127 |
| abstract_inverted_index.visualize | 384 |
| abstract_inverted_index.Cellstates | 247, 274, 283, 312, 346 |
| abstract_inverted_index.cellstates | 360 |
| abstract_inverted_index.clustering | 71 |
| abstract_inverted_index.complexity | 202 |
| abstract_inverted_index.determines | 257 |
| abstract_inverted_index.expression | 15, 32, 77, 168, 187, 309 |
| abstract_inverted_index.filtering, | 92 |
| abstract_inverted_index.hierarchy, | 381 |
| abstract_inverted_index.identified | 359 |
| abstract_inverted_index.identifier | 152 |
| abstract_inverted_index.identifies | 285, 313 |
| abstract_inverted_index.impossible | 128 |
| abstract_inverted_index.literature | 69 |
| abstract_inverted_index.meaningful | 211 |
| abstract_inverted_index.population | 18 |
| abstract_inverted_index.properties | 112 |
| abstract_inverted_index.rigorously | 83 |
| abstract_inverted_index.selection, | 94 |
| abstract_inverted_index.sequencing | 2 |
| abstract_inverted_index.similarity | 103 |
| abstract_inverted_index.structure. | 212 |
| abstract_inverted_index.(scRNA-seq) | 3 |
| abstract_inverted_index.Single-cell | 0 |
| abstract_inverted_index.biophysical | 132 |
| abstract_inverted_index.challenging | 43 |
| abstract_inverted_index.clusterings | 136 |
| abstract_inverted_index.corresponds | 182 |
| abstract_inverted_index.experiments | 328 |
| abstract_inverted_index.implemented | 240 |
| abstract_inverted_index.measurement | 55, 110, 219 |
| abstract_inverted_index.objectives, | 85 |
| abstract_inverted_index.parameters. | 123, 267 |
| abstract_inverted_index.partitions. | 279 |
| abstract_inverted_index.principles. | 237 |
| abstract_inverted_index.properties. | 57 |
| abstract_inverted_index.experimental | 8 |
| abstract_inverted_index.hierarchical | 363 |
| abstract_inverted_index.higher-order | 366 |
| abstract_inverted_index.substructure | 287 |
| abstract_inverted_index.well-defined | 229 |
| abstract_inverted_index.Consequently, | 124 |
| abstract_inverted_index.automatically | 256 |
| abstract_inverted_index.inhomogeneous | 54 |
| abstract_inverted_index.measurements, | 115 |
| abstract_inverted_index.statistically | 177 |
| abstract_inverted_index.traditionally | 294 |
| abstract_inverted_index.‘similar’ | 76 |
| abstract_inverted_index.differentially | 373 |
| abstract_inverted_index.mathematically | 228 |
| abstract_inverted_index.normalization, | 91 |
| abstract_inverted_index.systematically | 314 |
| abstract_inverted_index.indistinguishable, | 178 |
| abstract_inverted_index.dimensionality-reduction, | 95 |
| cited_by_percentile_year.max | 99 |
| cited_by_percentile_year.min | 90 |
| corresponding_author_ids | https://openalex.org/A5059519234, https://openalex.org/A5017960972, https://openalex.org/A5018868397 |
| countries_distinct_count | 1 |
| institutions_distinct_count | 3 |
| corresponding_institution_ids | https://openalex.org/I12708293, https://openalex.org/I1850255 |
| citation_normalized_percentile.value | 0.8802942 |
| citation_normalized_percentile.is_in_top_1_percent | False |
| citation_normalized_percentile.is_in_top_10_percent | True |