Topiary: Pruning the manual labor from ancestral sequence reconstruction Article Swipe
YOU?
·
· 2022
· Open Access
·
· DOI: https://doi.org/10.1002/pro.4551
Ancestral sequence reconstruction (ASR) is a powerful tool to study the evolution of proteins and thus gain deep insight into the relationships among protein sequence, structure, and function. A major barrier to its broad use is the complexity of the task: it requires multiple software packages, complex file manipulations, and expert phylogenetic knowledge. Here we introduce topiary , a software pipeline that aims to overcome this barrier. To use topiary, users prepare a spreadsheet with a handful of sequences. Topiary then: (1) Infers the taxonomic scope for the ASR study and finds relevant sequences by BLAST; (2) Does taxonomically informed sequence quality control and redundancy reduction; (3) Constructs a multiple sequence alignment; (4) Generates a maximum‐likelihood gene tree; (5) Reconciles the gene tree to the species tree; (6) Reconstructs ancestral amino acid sequences; and (7) Determines branch supports. The pipeline returns annotated evolutionary trees, spreadsheets with sequences, and graphical summaries of ancestor quality. This is achieved by integrating modern phylogenetics software (Muscle5, RAxML‐NG, GeneRax, and PastML) with online databases (NCBI and the Open Tree of Life). In this paper, we introduce non‐expert readers to the steps required for ASR, describe the specific design choices made in topiary , provide a detailed protocol for users, and then validate the pipeline using datasets from a broad collection of protein families. Topiary is freely available for download: https://github.com/harmslab/topiary .
Related Topics
- Type
- article
- Language
- en
- Landing Page
- https://doi.org/10.1002/pro.4551
- OA Status
- green
- Cited By
- 13
- References
- 52
- Related Works
- 10
- OpenAlex ID
- https://openalex.org/W4312121546
Raw OpenAlex JSON
- OpenAlex ID
-
https://openalex.org/W4312121546Canonical identifier for this work in OpenAlex
- DOI
-
https://doi.org/10.1002/pro.4551Digital Object Identifier
- Title
-
Topiary: Pruning the manual labor from ancestral sequence reconstructionWork title
- Type
-
articleOpenAlex work type
- Language
-
enPrimary language
- Publication year
-
2022Year of publication
- Publication date
-
2022-12-24Full publication date if available
- Authors
-
Kona N. Orlandi, Sophia R. Phillips, Zachary R. Sailer, Joseph Harman, Michael J. HarmsList of authors in order
- Landing page
-
https://doi.org/10.1002/pro.4551Publisher landing page
- Open access
-
YesWhether a free full text is available
- OA status
-
greenOpen access status per OpenAlex
- OA URL
-
https://www.ncbi.nlm.nih.gov/pmc/articles/9847077Direct OA link when available
- Concepts
-
Computer science, Pruning, Phylogenetic tree, Multiple sequence alignment, Software, Pipeline (software), Tree (set theory), Sequence (biology), Sequence alignment, Data mining, Artificial intelligence, Machine learning, Biology, Gene, Peptide sequence, Programming language, Genetics, Mathematical analysis, Agronomy, MathematicsTop concepts (fields/topics) attached by OpenAlex
- Cited by
-
13Total citation count in OpenAlex
- Citations by year (recent)
-
2025: 2, 2024: 4, 2023: 5, 2022: 2Per-year citation counts (last 5 years)
- References (count)
-
52Number of works referenced by this work
- Related works (count)
-
10Other works algorithmically related by OpenAlex
Full payload
| id | https://openalex.org/W4312121546 |
|---|---|
| doi | https://doi.org/10.1002/pro.4551 |
| ids.doi | https://doi.org/10.1002/pro.4551 |
| ids.pmid | https://pubmed.ncbi.nlm.nih.gov/36565302 |
| ids.openalex | https://openalex.org/W4312121546 |
| fwci | 1.59989905 |
| mesh[0].qualifier_ui | |
| mesh[0].descriptor_ui | D010802 |
| mesh[0].is_major_topic | False |
| mesh[0].qualifier_name | |
| mesh[0].descriptor_name | Phylogeny |
| mesh[1].qualifier_ui | |
| mesh[1].descriptor_ui | D012984 |
| mesh[1].is_major_topic | True |
| mesh[1].qualifier_name | |
| mesh[1].descriptor_name | Software |
| mesh[2].qualifier_ui | |
| mesh[2].descriptor_ui | D000595 |
| mesh[2].is_major_topic | False |
| mesh[2].qualifier_name | |
| mesh[2].descriptor_name | Amino Acid Sequence |
| mesh[3].qualifier_ui | Q000235 |
| mesh[3].descriptor_ui | D011506 |
| mesh[3].is_major_topic | True |
| mesh[3].qualifier_name | genetics |
| mesh[3].descriptor_name | Proteins |
| mesh[4].qualifier_ui | Q000737 |
| mesh[4].descriptor_ui | D011506 |
| mesh[4].is_major_topic | True |
| mesh[4].qualifier_name | chemistry |
| mesh[4].descriptor_name | Proteins |
| mesh[5].qualifier_ui | |
| mesh[5].descriptor_ui | D016415 |
| mesh[5].is_major_topic | False |
| mesh[5].qualifier_name | |
| mesh[5].descriptor_name | Sequence Alignment |
| mesh[6].qualifier_ui | |
| mesh[6].descriptor_ui | D019143 |
| mesh[6].is_major_topic | False |
| mesh[6].qualifier_name | |
| mesh[6].descriptor_name | Evolution, Molecular |
| mesh[7].qualifier_ui | |
| mesh[7].descriptor_ui | D010802 |
| mesh[7].is_major_topic | False |
| mesh[7].qualifier_name | |
| mesh[7].descriptor_name | Phylogeny |
| mesh[8].qualifier_ui | |
| mesh[8].descriptor_ui | D012984 |
| mesh[8].is_major_topic | True |
| mesh[8].qualifier_name | |
| mesh[8].descriptor_name | Software |
| mesh[9].qualifier_ui | |
| mesh[9].descriptor_ui | D000595 |
| mesh[9].is_major_topic | False |
| mesh[9].qualifier_name | |
| mesh[9].descriptor_name | Amino Acid Sequence |
| mesh[10].qualifier_ui | Q000235 |
| mesh[10].descriptor_ui | D011506 |
| mesh[10].is_major_topic | True |
| mesh[10].qualifier_name | genetics |
| mesh[10].descriptor_name | Proteins |
| mesh[11].qualifier_ui | Q000737 |
| mesh[11].descriptor_ui | D011506 |
| mesh[11].is_major_topic | True |
| mesh[11].qualifier_name | chemistry |
| mesh[11].descriptor_name | Proteins |
| mesh[12].qualifier_ui | |
| mesh[12].descriptor_ui | D016415 |
| mesh[12].is_major_topic | False |
| mesh[12].qualifier_name | |
| mesh[12].descriptor_name | Sequence Alignment |
| mesh[13].qualifier_ui | |
| mesh[13].descriptor_ui | D019143 |
| mesh[13].is_major_topic | False |
| mesh[13].qualifier_name | |
| mesh[13].descriptor_name | Evolution, Molecular |
| mesh[14].qualifier_ui | |
| mesh[14].descriptor_ui | D010802 |
| mesh[14].is_major_topic | False |
| mesh[14].qualifier_name | |
| mesh[14].descriptor_name | Phylogeny |
| mesh[15].qualifier_ui | |
| mesh[15].descriptor_ui | D012984 |
| mesh[15].is_major_topic | True |
| mesh[15].qualifier_name | |
| mesh[15].descriptor_name | Software |
| mesh[16].qualifier_ui | |
| mesh[16].descriptor_ui | D000595 |
| mesh[16].is_major_topic | False |
| mesh[16].qualifier_name | |
| mesh[16].descriptor_name | Amino Acid Sequence |
| mesh[17].qualifier_ui | Q000235 |
| mesh[17].descriptor_ui | D011506 |
| mesh[17].is_major_topic | True |
| mesh[17].qualifier_name | genetics |
| mesh[17].descriptor_name | Proteins |
| mesh[18].qualifier_ui | Q000737 |
| mesh[18].descriptor_ui | D011506 |
| mesh[18].is_major_topic | True |
| mesh[18].qualifier_name | chemistry |
| mesh[18].descriptor_name | Proteins |
| mesh[19].qualifier_ui | |
| mesh[19].descriptor_ui | D016415 |
| mesh[19].is_major_topic | False |
| mesh[19].qualifier_name | |
| mesh[19].descriptor_name | Sequence Alignment |
| mesh[20].qualifier_ui | |
| mesh[20].descriptor_ui | D019143 |
| mesh[20].is_major_topic | False |
| mesh[20].qualifier_name | |
| mesh[20].descriptor_name | Evolution, Molecular |
| type | article |
| title | Topiary: Pruning the manual labor from ancestral sequence reconstruction |
| awards[0].id | https://openalex.org/G7482845915 |
| awards[0].funder_id | https://openalex.org/F4320337354 |
| awards[0].display_name | |
| awards[0].funder_award_id | 1R01GM146114‐01 |
| awards[0].funder_display_name | National Institute of General Medical Sciences |
| awards[1].id | https://openalex.org/G4465247516 |
| awards[1].funder_id | https://openalex.org/F4320337400 |
| awards[1].display_name | |
| awards[1].funder_award_id | NSF CAREER Award DEB‐1844963 |
| awards[1].funder_display_name | Division of Environmental Biology |
| awards[2].id | https://openalex.org/G2944139568 |
| awards[2].funder_id | https://openalex.org/F4320337354 |
| awards[2].display_name | |
| awards[2].funder_award_id | 7T32GM007759 |
| awards[2].funder_display_name | National Institute of General Medical Sciences |
| awards[3].id | https://openalex.org/G4752504285 |
| awards[3].funder_id | https://openalex.org/F4320337354 |
| awards[3].display_name | |
| awards[3].funder_award_id | T32GM007413 |
| awards[3].funder_display_name | National Institute of General Medical Sciences |
| biblio.issue | 2 |
| biblio.volume | 32 |
| biblio.last_page | e4551 |
| biblio.first_page | e4551 |
| topics[0].id | https://openalex.org/T10015 |
| topics[0].field.id | https://openalex.org/fields/13 |
| topics[0].field.display_name | Biochemistry, Genetics and Molecular Biology |
| topics[0].score | 0.9998999834060669 |
| topics[0].domain.id | https://openalex.org/domains/1 |
| topics[0].domain.display_name | Life Sciences |
| topics[0].subfield.id | https://openalex.org/subfields/1312 |
| topics[0].subfield.display_name | Molecular Biology |
| topics[0].display_name | Genomics and Phylogenetic Studies |
| topics[1].id | https://openalex.org/T10012 |
| topics[1].field.id | https://openalex.org/fields/13 |
| topics[1].field.display_name | Biochemistry, Genetics and Molecular Biology |
| topics[1].score | 0.9919000267982483 |
| topics[1].domain.id | https://openalex.org/domains/1 |
| topics[1].domain.display_name | Life Sciences |
| topics[1].subfield.id | https://openalex.org/subfields/1311 |
| topics[1].subfield.display_name | Genetics |
| topics[1].display_name | Genetic diversity and population structure |
| topics[2].id | https://openalex.org/T11764 |
| topics[2].field.id | https://openalex.org/fields/13 |
| topics[2].field.display_name | Biochemistry, Genetics and Molecular Biology |
| topics[2].score | 0.9724000096321106 |
| topics[2].domain.id | https://openalex.org/domains/1 |
| topics[2].domain.display_name | Life Sciences |
| topics[2].subfield.id | https://openalex.org/subfields/1311 |
| topics[2].subfield.display_name | Genetics |
| topics[2].display_name | Evolution and Genetic Dynamics |
| funders[0].id | https://openalex.org/F4320337354 |
| funders[0].ror | https://ror.org/04q48ey07 |
| funders[0].display_name | National Institute of General Medical Sciences |
| funders[1].id | https://openalex.org/F4320337400 |
| funders[1].ror | https://ror.org/03g87he71 |
| funders[1].display_name | Division of Environmental Biology |
| is_xpac | False |
| apc_list.value | 4070 |
| apc_list.currency | USD |
| apc_list.value_usd | 4070 |
| apc_paid | |
| concepts[0].id | https://openalex.org/C41008148 |
| concepts[0].level | 0 |
| concepts[0].score | 0.6375146508216858 |
| concepts[0].wikidata | https://www.wikidata.org/wiki/Q21198 |
| concepts[0].display_name | Computer science |
| concepts[1].id | https://openalex.org/C108010975 |
| concepts[1].level | 2 |
| concepts[1].score | 0.6270495057106018 |
| concepts[1].wikidata | https://www.wikidata.org/wiki/Q500094 |
| concepts[1].display_name | Pruning |
| concepts[2].id | https://openalex.org/C193252679 |
| concepts[2].level | 3 |
| concepts[2].score | 0.614266574382782 |
| concepts[2].wikidata | https://www.wikidata.org/wiki/Q242125 |
| concepts[2].display_name | Phylogenetic tree |
| concepts[3].id | https://openalex.org/C88031987 |
| concepts[3].level | 5 |
| concepts[3].score | 0.5618060827255249 |
| concepts[3].wikidata | https://www.wikidata.org/wiki/Q1377767 |
| concepts[3].display_name | Multiple sequence alignment |
| concepts[4].id | https://openalex.org/C2777904410 |
| concepts[4].level | 2 |
| concepts[4].score | 0.5365716218948364 |
| concepts[4].wikidata | https://www.wikidata.org/wiki/Q7397 |
| concepts[4].display_name | Software |
| concepts[5].id | https://openalex.org/C43521106 |
| concepts[5].level | 2 |
| concepts[5].score | 0.5356836318969727 |
| concepts[5].wikidata | https://www.wikidata.org/wiki/Q2165493 |
| concepts[5].display_name | Pipeline (software) |
| concepts[6].id | https://openalex.org/C113174947 |
| concepts[6].level | 2 |
| concepts[6].score | 0.532524585723877 |
| concepts[6].wikidata | https://www.wikidata.org/wiki/Q2859736 |
| concepts[6].display_name | Tree (set theory) |
| concepts[7].id | https://openalex.org/C2778112365 |
| concepts[7].level | 2 |
| concepts[7].score | 0.44904860854148865 |
| concepts[7].wikidata | https://www.wikidata.org/wiki/Q3511065 |
| concepts[7].display_name | Sequence (biology) |
| concepts[8].id | https://openalex.org/C45484198 |
| concepts[8].level | 4 |
| concepts[8].score | 0.4386853873729706 |
| concepts[8].wikidata | https://www.wikidata.org/wiki/Q827246 |
| concepts[8].display_name | Sequence alignment |
| concepts[9].id | https://openalex.org/C124101348 |
| concepts[9].level | 1 |
| concepts[9].score | 0.37486132979393005 |
| concepts[9].wikidata | https://www.wikidata.org/wiki/Q172491 |
| concepts[9].display_name | Data mining |
| concepts[10].id | https://openalex.org/C154945302 |
| concepts[10].level | 1 |
| concepts[10].score | 0.32984858751296997 |
| concepts[10].wikidata | https://www.wikidata.org/wiki/Q11660 |
| concepts[10].display_name | Artificial intelligence |
| concepts[11].id | https://openalex.org/C119857082 |
| concepts[11].level | 1 |
| concepts[11].score | 0.32724449038505554 |
| concepts[11].wikidata | https://www.wikidata.org/wiki/Q2539 |
| concepts[11].display_name | Machine learning |
| concepts[12].id | https://openalex.org/C86803240 |
| concepts[12].level | 0 |
| concepts[12].score | 0.3024106025695801 |
| concepts[12].wikidata | https://www.wikidata.org/wiki/Q420 |
| concepts[12].display_name | Biology |
| concepts[13].id | https://openalex.org/C104317684 |
| concepts[13].level | 2 |
| concepts[13].score | 0.14787355065345764 |
| concepts[13].wikidata | https://www.wikidata.org/wiki/Q7187 |
| concepts[13].display_name | Gene |
| concepts[14].id | https://openalex.org/C167625842 |
| concepts[14].level | 3 |
| concepts[14].score | 0.1267755627632141 |
| concepts[14].wikidata | https://www.wikidata.org/wiki/Q899763 |
| concepts[14].display_name | Peptide sequence |
| concepts[15].id | https://openalex.org/C199360897 |
| concepts[15].level | 1 |
| concepts[15].score | 0.12601524591445923 |
| concepts[15].wikidata | https://www.wikidata.org/wiki/Q9143 |
| concepts[15].display_name | Programming language |
| concepts[16].id | https://openalex.org/C54355233 |
| concepts[16].level | 1 |
| concepts[16].score | 0.11742052435874939 |
| concepts[16].wikidata | https://www.wikidata.org/wiki/Q7162 |
| concepts[16].display_name | Genetics |
| concepts[17].id | https://openalex.org/C134306372 |
| concepts[17].level | 1 |
| concepts[17].score | 0.0 |
| concepts[17].wikidata | https://www.wikidata.org/wiki/Q7754 |
| concepts[17].display_name | Mathematical analysis |
| concepts[18].id | https://openalex.org/C6557445 |
| concepts[18].level | 1 |
| concepts[18].score | 0.0 |
| concepts[18].wikidata | https://www.wikidata.org/wiki/Q173113 |
| concepts[18].display_name | Agronomy |
| concepts[19].id | https://openalex.org/C33923547 |
| concepts[19].level | 0 |
| concepts[19].score | 0.0 |
| concepts[19].wikidata | https://www.wikidata.org/wiki/Q395 |
| concepts[19].display_name | Mathematics |
| keywords[0].id | https://openalex.org/keywords/computer-science |
| keywords[0].score | 0.6375146508216858 |
| keywords[0].display_name | Computer science |
| keywords[1].id | https://openalex.org/keywords/pruning |
| keywords[1].score | 0.6270495057106018 |
| keywords[1].display_name | Pruning |
| keywords[2].id | https://openalex.org/keywords/phylogenetic-tree |
| keywords[2].score | 0.614266574382782 |
| keywords[2].display_name | Phylogenetic tree |
| keywords[3].id | https://openalex.org/keywords/multiple-sequence-alignment |
| keywords[3].score | 0.5618060827255249 |
| keywords[3].display_name | Multiple sequence alignment |
| keywords[4].id | https://openalex.org/keywords/software |
| keywords[4].score | 0.5365716218948364 |
| keywords[4].display_name | Software |
| keywords[5].id | https://openalex.org/keywords/pipeline |
| keywords[5].score | 0.5356836318969727 |
| keywords[5].display_name | Pipeline (software) |
| keywords[6].id | https://openalex.org/keywords/tree |
| keywords[6].score | 0.532524585723877 |
| keywords[6].display_name | Tree (set theory) |
| keywords[7].id | https://openalex.org/keywords/sequence |
| keywords[7].score | 0.44904860854148865 |
| keywords[7].display_name | Sequence (biology) |
| keywords[8].id | https://openalex.org/keywords/sequence-alignment |
| keywords[8].score | 0.4386853873729706 |
| keywords[8].display_name | Sequence alignment |
| keywords[9].id | https://openalex.org/keywords/data-mining |
| keywords[9].score | 0.37486132979393005 |
| keywords[9].display_name | Data mining |
| keywords[10].id | https://openalex.org/keywords/artificial-intelligence |
| keywords[10].score | 0.32984858751296997 |
| keywords[10].display_name | Artificial intelligence |
| keywords[11].id | https://openalex.org/keywords/machine-learning |
| keywords[11].score | 0.32724449038505554 |
| keywords[11].display_name | Machine learning |
| keywords[12].id | https://openalex.org/keywords/biology |
| keywords[12].score | 0.3024106025695801 |
| keywords[12].display_name | Biology |
| keywords[13].id | https://openalex.org/keywords/gene |
| keywords[13].score | 0.14787355065345764 |
| keywords[13].display_name | Gene |
| keywords[14].id | https://openalex.org/keywords/peptide-sequence |
| keywords[14].score | 0.1267755627632141 |
| keywords[14].display_name | Peptide sequence |
| keywords[15].id | https://openalex.org/keywords/programming-language |
| keywords[15].score | 0.12601524591445923 |
| keywords[15].display_name | Programming language |
| keywords[16].id | https://openalex.org/keywords/genetics |
| keywords[16].score | 0.11742052435874939 |
| keywords[16].display_name | Genetics |
| language | en |
| locations[0].id | doi:10.1002/pro.4551 |
| locations[0].is_oa | False |
| locations[0].source.id | https://openalex.org/S156919612 |
| locations[0].source.issn | 0961-8368, 1469-896X |
| locations[0].source.type | journal |
| locations[0].source.is_oa | False |
| locations[0].source.issn_l | 0961-8368 |
| locations[0].source.is_core | True |
| locations[0].source.is_in_doaj | False |
| locations[0].source.display_name | Protein Science |
| locations[0].source.host_organization | https://openalex.org/P4310320595 |
| locations[0].source.host_organization_name | Wiley |
| locations[0].source.host_organization_lineage | https://openalex.org/P4310320595 |
| locations[0].source.host_organization_lineage_names | Wiley |
| locations[0].license | |
| locations[0].pdf_url | |
| locations[0].version | publishedVersion |
| locations[0].raw_type | journal-article |
| locations[0].license_id | |
| locations[0].is_accepted | True |
| locations[0].is_published | True |
| locations[0].raw_source_name | Protein Science |
| locations[0].landing_page_url | https://doi.org/10.1002/pro.4551 |
| locations[1].id | pmid:36565302 |
| locations[1].is_oa | False |
| locations[1].source.id | https://openalex.org/S4306525036 |
| locations[1].source.issn | |
| locations[1].source.type | repository |
| locations[1].source.is_oa | False |
| locations[1].source.issn_l | |
| locations[1].source.is_core | False |
| locations[1].source.is_in_doaj | False |
| locations[1].source.display_name | PubMed |
| locations[1].source.host_organization | https://openalex.org/I1299303238 |
| locations[1].source.host_organization_name | National Institutes of Health |
| locations[1].source.host_organization_lineage | https://openalex.org/I1299303238 |
| locations[1].license | |
| locations[1].pdf_url | |
| locations[1].version | publishedVersion |
| locations[1].raw_type | |
| locations[1].license_id | |
| locations[1].is_accepted | True |
| locations[1].is_published | True |
| locations[1].raw_source_name | Protein science : a publication of the Protein Society |
| locations[1].landing_page_url | https://pubmed.ncbi.nlm.nih.gov/36565302 |
| locations[2].id | pmh:oai:pubmedcentral.nih.gov:9847077 |
| locations[2].is_oa | True |
| locations[2].source.id | https://openalex.org/S2764455111 |
| locations[2].source.issn | |
| locations[2].source.type | repository |
| locations[2].source.is_oa | False |
| locations[2].source.issn_l | |
| locations[2].source.is_core | False |
| locations[2].source.is_in_doaj | False |
| locations[2].source.display_name | PubMed Central |
| locations[2].source.host_organization | https://openalex.org/I1299303238 |
| locations[2].source.host_organization_name | National Institutes of Health |
| locations[2].source.host_organization_lineage | https://openalex.org/I1299303238 |
| locations[2].license | |
| locations[2].pdf_url | |
| locations[2].version | submittedVersion |
| locations[2].raw_type | Text |
| locations[2].license_id | |
| locations[2].is_accepted | False |
| locations[2].is_published | False |
| locations[2].raw_source_name | Protein Sci |
| locations[2].landing_page_url | https://www.ncbi.nlm.nih.gov/pmc/articles/9847077 |
| indexed_in | crossref, pubmed |
| authorships[0].author.id | https://openalex.org/A5043733464 |
| authorships[0].author.orcid | https://orcid.org/0000-0002-6426-4764 |
| authorships[0].author.display_name | Kona N. Orlandi |
| authorships[0].countries | US |
| authorships[0].affiliations[0].institution_ids | https://openalex.org/I181233156 |
| authorships[0].affiliations[0].raw_affiliation_string | Department of Biology, University of Oregon, Eugene, Oregon, USA |
| authorships[0].affiliations[1].institution_ids | https://openalex.org/I181233156 |
| authorships[0].affiliations[1].raw_affiliation_string | Institute of Molecular Biology, University of Oregon, Eugene, Oregon, USA |
| authorships[0].affiliations[2].raw_affiliation_string | Kona N. Orlandi and Sophia R. Phillips contributed equally to this work. |
| authorships[0].institutions[0].id | https://openalex.org/I181233156 |
| authorships[0].institutions[0].ror | https://ror.org/0293rh119 |
| authorships[0].institutions[0].type | education |
| authorships[0].institutions[0].lineage | https://openalex.org/I181233156 |
| authorships[0].institutions[0].country_code | US |
| authorships[0].institutions[0].display_name | University of Oregon |
| authorships[0].author_position | first |
| authorships[0].raw_author_name | Kona N. Orlandi |
| authorships[0].is_corresponding | False |
| authorships[0].raw_affiliation_strings | Department of Biology, University of Oregon, Eugene, Oregon, USA, Institute of Molecular Biology, University of Oregon, Eugene, Oregon, USA, Kona N. Orlandi and Sophia R. Phillips contributed equally to this work. |
| authorships[1].author.id | https://openalex.org/A5025383308 |
| authorships[1].author.orcid | https://orcid.org/0000-0002-6778-5194 |
| authorships[1].author.display_name | Sophia R. Phillips |
| authorships[1].countries | US |
| authorships[1].affiliations[0].raw_affiliation_string | Kona N. Orlandi and Sophia R. Phillips contributed equally to this work. |
| authorships[1].affiliations[1].institution_ids | https://openalex.org/I181233156 |
| authorships[1].affiliations[1].raw_affiliation_string | Department of Chemistry and Biochemistry, University of Oregon, Eugene, Oregon, USA |
| authorships[1].affiliations[2].institution_ids | https://openalex.org/I181233156 |
| authorships[1].affiliations[2].raw_affiliation_string | Institute of Molecular Biology, University of Oregon, Eugene, Oregon, USA |
| authorships[1].institutions[0].id | https://openalex.org/I181233156 |
| authorships[1].institutions[0].ror | https://ror.org/0293rh119 |
| authorships[1].institutions[0].type | education |
| authorships[1].institutions[0].lineage | https://openalex.org/I181233156 |
| authorships[1].institutions[0].country_code | US |
| authorships[1].institutions[0].display_name | University of Oregon |
| authorships[1].author_position | middle |
| authorships[1].raw_author_name | Sophia R. Phillips |
| authorships[1].is_corresponding | False |
| authorships[1].raw_affiliation_strings | Department of Chemistry and Biochemistry, University of Oregon, Eugene, Oregon, USA, Institute of Molecular Biology, University of Oregon, Eugene, Oregon, USA, Kona N. Orlandi and Sophia R. Phillips contributed equally to this work. |
| authorships[2].author.id | https://openalex.org/A5056170020 |
| authorships[2].author.orcid | https://orcid.org/0000-0003-3260-619X |
| authorships[2].author.display_name | Zachary R. Sailer |
| authorships[2].countries | US |
| authorships[2].affiliations[0].institution_ids | https://openalex.org/I181233156 |
| authorships[2].affiliations[0].raw_affiliation_string | Institute of Molecular Biology, University of Oregon, Eugene, Oregon, USA |
| authorships[2].affiliations[1].institution_ids | https://openalex.org/I181233156 |
| authorships[2].affiliations[1].raw_affiliation_string | Department of Chemistry and Biochemistry, University of Oregon, Eugene, Oregon, USA |
| authorships[2].institutions[0].id | https://openalex.org/I181233156 |
| authorships[2].institutions[0].ror | https://ror.org/0293rh119 |
| authorships[2].institutions[0].type | education |
| authorships[2].institutions[0].lineage | https://openalex.org/I181233156 |
| authorships[2].institutions[0].country_code | US |
| authorships[2].institutions[0].display_name | University of Oregon |
| authorships[2].author_position | middle |
| authorships[2].raw_author_name | Zachary R. Sailer |
| authorships[2].is_corresponding | False |
| authorships[2].raw_affiliation_strings | Department of Chemistry and Biochemistry, University of Oregon, Eugene, Oregon, USA, Institute of Molecular Biology, University of Oregon, Eugene, Oregon, USA |
| authorships[3].author.id | https://openalex.org/A5000329191 |
| authorships[3].author.orcid | https://orcid.org/0000-0002-8283-0301 |
| authorships[3].author.display_name | Joseph Harman |
| authorships[3].countries | US |
| authorships[3].affiliations[0].institution_ids | https://openalex.org/I181233156 |
| authorships[3].affiliations[0].raw_affiliation_string | Institute of Molecular Biology, University of Oregon, Eugene, Oregon, USA |
| authorships[3].affiliations[1].institution_ids | https://openalex.org/I181233156 |
| authorships[3].affiliations[1].raw_affiliation_string | Department of Chemistry and Biochemistry, University of Oregon, Eugene, Oregon, USA |
| authorships[3].institutions[0].id | https://openalex.org/I181233156 |
| authorships[3].institutions[0].ror | https://ror.org/0293rh119 |
| authorships[3].institutions[0].type | education |
| authorships[3].institutions[0].lineage | https://openalex.org/I181233156 |
| authorships[3].institutions[0].country_code | US |
| authorships[3].institutions[0].display_name | University of Oregon |
| authorships[3].author_position | middle |
| authorships[3].raw_author_name | Joseph L. Harman |
| authorships[3].is_corresponding | False |
| authorships[3].raw_affiliation_strings | Department of Chemistry and Biochemistry, University of Oregon, Eugene, Oregon, USA, Institute of Molecular Biology, University of Oregon, Eugene, Oregon, USA |
| authorships[4].author.id | https://openalex.org/A5048475119 |
| authorships[4].author.orcid | https://orcid.org/0000-0002-0241-4122 |
| authorships[4].author.display_name | Michael J. Harms |
| authorships[4].countries | US |
| authorships[4].affiliations[0].institution_ids | https://openalex.org/I181233156 |
| authorships[4].affiliations[0].raw_affiliation_string | Department of Chemistry and Biochemistry, University of Oregon, Eugene, Oregon, USA |
| authorships[4].affiliations[1].institution_ids | https://openalex.org/I181233156 |
| authorships[4].affiliations[1].raw_affiliation_string | Institute of Molecular Biology, University of Oregon, Eugene, Oregon, USA |
| authorships[4].institutions[0].id | https://openalex.org/I181233156 |
| authorships[4].institutions[0].ror | https://ror.org/0293rh119 |
| authorships[4].institutions[0].type | education |
| authorships[4].institutions[0].lineage | https://openalex.org/I181233156 |
| authorships[4].institutions[0].country_code | US |
| authorships[4].institutions[0].display_name | University of Oregon |
| authorships[4].author_position | last |
| authorships[4].raw_author_name | Michael J. Harms |
| authorships[4].is_corresponding | True |
| authorships[4].raw_affiliation_strings | Department of Chemistry and Biochemistry, University of Oregon, Eugene, Oregon, USA, Institute of Molecular Biology, University of Oregon, Eugene, Oregon, USA |
| has_content.pdf | False |
| has_content.grobid_xml | False |
| is_paratext | False |
| open_access.is_oa | True |
| open_access.oa_url | https://www.ncbi.nlm.nih.gov/pmc/articles/9847077 |
| open_access.oa_status | green |
| open_access.any_repository_has_fulltext | False |
| created_date | 2025-10-10T00:00:00 |
| display_name | Topiary: Pruning the manual labor from ancestral sequence reconstruction |
| has_fulltext | False |
| is_retracted | False |
| updated_date | 2025-11-06T03:46:38.306776 |
| primary_topic.id | https://openalex.org/T10015 |
| primary_topic.field.id | https://openalex.org/fields/13 |
| primary_topic.field.display_name | Biochemistry, Genetics and Molecular Biology |
| primary_topic.score | 0.9998999834060669 |
| primary_topic.domain.id | https://openalex.org/domains/1 |
| primary_topic.domain.display_name | Life Sciences |
| primary_topic.subfield.id | https://openalex.org/subfields/1312 |
| primary_topic.subfield.display_name | Molecular Biology |
| primary_topic.display_name | Genomics and Phylogenetic Studies |
| related_works | https://openalex.org/W2132972885, https://openalex.org/W4248019281, https://openalex.org/W2064153754, https://openalex.org/W2051969447, https://openalex.org/W2111937814, https://openalex.org/W2162923930, https://openalex.org/W1482324242, https://openalex.org/W2739032002, https://openalex.org/W2133116680, https://openalex.org/W2091678889 |
| cited_by_count | 13 |
| counts_by_year[0].year | 2025 |
| counts_by_year[0].cited_by_count | 2 |
| counts_by_year[1].year | 2024 |
| counts_by_year[1].cited_by_count | 4 |
| counts_by_year[2].year | 2023 |
| counts_by_year[2].cited_by_count | 5 |
| counts_by_year[3].year | 2022 |
| counts_by_year[3].cited_by_count | 2 |
| locations_count | 3 |
| best_oa_location.id | pmh:oai:pubmedcentral.nih.gov:9847077 |
| best_oa_location.is_oa | True |
| best_oa_location.source.id | https://openalex.org/S2764455111 |
| best_oa_location.source.issn | |
| best_oa_location.source.type | repository |
| best_oa_location.source.is_oa | False |
| best_oa_location.source.issn_l | |
| best_oa_location.source.is_core | False |
| best_oa_location.source.is_in_doaj | False |
| best_oa_location.source.display_name | PubMed Central |
| best_oa_location.source.host_organization | https://openalex.org/I1299303238 |
| best_oa_location.source.host_organization_name | National Institutes of Health |
| best_oa_location.source.host_organization_lineage | https://openalex.org/I1299303238 |
| best_oa_location.license | |
| best_oa_location.pdf_url | |
| best_oa_location.version | submittedVersion |
| best_oa_location.raw_type | Text |
| best_oa_location.license_id | |
| best_oa_location.is_accepted | False |
| best_oa_location.is_published | False |
| best_oa_location.raw_source_name | Protein Sci |
| best_oa_location.landing_page_url | https://www.ncbi.nlm.nih.gov/pmc/articles/9847077 |
| primary_location.id | doi:10.1002/pro.4551 |
| primary_location.is_oa | False |
| primary_location.source.id | https://openalex.org/S156919612 |
| primary_location.source.issn | 0961-8368, 1469-896X |
| primary_location.source.type | journal |
| primary_location.source.is_oa | False |
| primary_location.source.issn_l | 0961-8368 |
| primary_location.source.is_core | True |
| primary_location.source.is_in_doaj | False |
| primary_location.source.display_name | Protein Science |
| primary_location.source.host_organization | https://openalex.org/P4310320595 |
| primary_location.source.host_organization_name | Wiley |
| primary_location.source.host_organization_lineage | https://openalex.org/P4310320595 |
| primary_location.source.host_organization_lineage_names | Wiley |
| primary_location.license | |
| primary_location.pdf_url | |
| primary_location.version | publishedVersion |
| primary_location.raw_type | journal-article |
| primary_location.license_id | |
| primary_location.is_accepted | True |
| primary_location.is_published | True |
| primary_location.raw_source_name | Protein Science |
| primary_location.landing_page_url | https://doi.org/10.1002/pro.4551 |
| publication_date | 2022-12-24 |
| publication_year | 2022 |
| referenced_works | https://openalex.org/W2144654387, https://openalex.org/W2081421232, https://openalex.org/W2158714788, https://openalex.org/W2232479414, https://openalex.org/W2002290225, https://openalex.org/W2908220830, https://openalex.org/W2114850508, https://openalex.org/W2479347895, https://openalex.org/W2980958588, https://openalex.org/W4309191898, https://openalex.org/W2543629455, https://openalex.org/W2030966943, https://openalex.org/W2148294135, https://openalex.org/W2170747616, https://openalex.org/W3088109935, https://openalex.org/W2110105435, https://openalex.org/W2070579299, https://openalex.org/W3014702047, https://openalex.org/W2027351435, https://openalex.org/W2277953252, https://openalex.org/W2949567053, https://openalex.org/W3189492453, https://openalex.org/W2415398545, https://openalex.org/W2949867654, https://openalex.org/W2170964688, https://openalex.org/W3217175664, https://openalex.org/W2152005166, https://openalex.org/W3162621204, https://openalex.org/W3032933344, https://openalex.org/W1846926012, https://openalex.org/W2995940204, https://openalex.org/W2889019390, https://openalex.org/W2151736966, https://openalex.org/W2953087319, https://openalex.org/W3164145451, https://openalex.org/W2139736670, https://openalex.org/W4251751280, https://openalex.org/W2164541345, https://openalex.org/W4220897858, https://openalex.org/W2796046773, https://openalex.org/W2120772351, https://openalex.org/W2765503503, https://openalex.org/W2110335151, https://openalex.org/W2111762123, https://openalex.org/W2524916784, https://openalex.org/W2113923028, https://openalex.org/W2964106101, https://openalex.org/W1788771833, https://openalex.org/W4283803957, https://openalex.org/W4312121546, https://openalex.org/W2152207030, https://openalex.org/W3194983551 |
| referenced_works_count | 52 |
| abstract_inverted_index., | 58, 198 |
| abstract_inverted_index.. | 226 |
| abstract_inverted_index.A | 29 |
| abstract_inverted_index.a | 6, 59, 73, 76, 109, 115, 200, 213 |
| abstract_inverted_index.In | 177 |
| abstract_inverted_index.To | 68 |
| abstract_inverted_index.by | 95, 157 |
| abstract_inverted_index.in | 196 |
| abstract_inverted_index.is | 5, 36, 155, 220 |
| abstract_inverted_index.it | 42 |
| abstract_inverted_index.of | 13, 39, 78, 151, 175, 216 |
| abstract_inverted_index.to | 9, 32, 64, 124, 184 |
| abstract_inverted_index.we | 55, 180 |
| abstract_inverted_index.(1) | 82 |
| abstract_inverted_index.(2) | 97 |
| abstract_inverted_index.(3) | 107 |
| abstract_inverted_index.(4) | 113 |
| abstract_inverted_index.(5) | 119 |
| abstract_inverted_index.(6) | 128 |
| abstract_inverted_index.(7) | 135 |
| abstract_inverted_index.ASR | 89 |
| abstract_inverted_index.The | 139 |
| abstract_inverted_index.and | 15, 27, 50, 91, 104, 134, 148, 165, 171, 205 |
| abstract_inverted_index.for | 87, 188, 203, 223 |
| abstract_inverted_index.its | 33 |
| abstract_inverted_index.the | 11, 21, 37, 40, 84, 88, 121, 125, 172, 185, 191, 208 |
| abstract_inverted_index.use | 35, 69 |
| abstract_inverted_index.ASR, | 189 |
| abstract_inverted_index.Does | 98 |
| abstract_inverted_index.Here | 54 |
| abstract_inverted_index.Open | 173 |
| abstract_inverted_index.This | 154 |
| abstract_inverted_index.Tree | 174 |
| abstract_inverted_index.acid | 132 |
| abstract_inverted_index.aims | 63 |
| abstract_inverted_index.deep | 18 |
| abstract_inverted_index.file | 48 |
| abstract_inverted_index.from | 212 |
| abstract_inverted_index.gain | 17 |
| abstract_inverted_index.gene | 117, 122 |
| abstract_inverted_index.into | 20 |
| abstract_inverted_index.made | 195 |
| abstract_inverted_index.that | 62 |
| abstract_inverted_index.then | 206 |
| abstract_inverted_index.this | 66, 178 |
| abstract_inverted_index.thus | 16 |
| abstract_inverted_index.tool | 8 |
| abstract_inverted_index.tree | 123 |
| abstract_inverted_index.with | 75, 146, 167 |
| abstract_inverted_index.(ASR) | 4 |
| abstract_inverted_index.(NCBI | 170 |
| abstract_inverted_index.amino | 131 |
| abstract_inverted_index.among | 23 |
| abstract_inverted_index.broad | 34, 214 |
| abstract_inverted_index.finds | 92 |
| abstract_inverted_index.major | 30 |
| abstract_inverted_index.scope | 86 |
| abstract_inverted_index.steps | 186 |
| abstract_inverted_index.study | 10, 90 |
| abstract_inverted_index.task: | 41 |
| abstract_inverted_index.then: | 81 |
| abstract_inverted_index.tree; | 118, 127 |
| abstract_inverted_index.users | 71 |
| abstract_inverted_index.using | 210 |
| abstract_inverted_index.BLAST; | 96 |
| abstract_inverted_index.Infers | 83 |
| abstract_inverted_index.Life). | 176 |
| abstract_inverted_index.branch | 137 |
| abstract_inverted_index.design | 193 |
| abstract_inverted_index.expert | 51 |
| abstract_inverted_index.freely | 221 |
| abstract_inverted_index.modern | 159 |
| abstract_inverted_index.online | 168 |
| abstract_inverted_index.paper, | 179 |
| abstract_inverted_index.trees, | 144 |
| abstract_inverted_index.users, | 204 |
| abstract_inverted_index.PastML) | 166 |
| abstract_inverted_index.Topiary | 80, 219 |
| abstract_inverted_index.barrier | 31 |
| abstract_inverted_index.choices | 194 |
| abstract_inverted_index.complex | 47 |
| abstract_inverted_index.control | 103 |
| abstract_inverted_index.handful | 77 |
| abstract_inverted_index.insight | 19 |
| abstract_inverted_index.prepare | 72 |
| abstract_inverted_index.protein | 24, 217 |
| abstract_inverted_index.provide | 199 |
| abstract_inverted_index.quality | 102 |
| abstract_inverted_index.readers | 183 |
| abstract_inverted_index.returns | 141 |
| abstract_inverted_index.species | 126 |
| abstract_inverted_index.topiary | 57, 197 |
| abstract_inverted_index.Abstract | 0 |
| abstract_inverted_index.GeneRax, | 164 |
| abstract_inverted_index.achieved | 156 |
| abstract_inverted_index.ancestor | 152 |
| abstract_inverted_index.barrier. | 67 |
| abstract_inverted_index.datasets | 211 |
| abstract_inverted_index.describe | 190 |
| abstract_inverted_index.detailed | 201 |
| abstract_inverted_index.informed | 100 |
| abstract_inverted_index.multiple | 44, 110 |
| abstract_inverted_index.overcome | 65 |
| abstract_inverted_index.pipeline | 61, 140, 209 |
| abstract_inverted_index.powerful | 7 |
| abstract_inverted_index.proteins | 14 |
| abstract_inverted_index.protocol | 202 |
| abstract_inverted_index.quality. | 153 |
| abstract_inverted_index.relevant | 93 |
| abstract_inverted_index.required | 187 |
| abstract_inverted_index.requires | 43 |
| abstract_inverted_index.sequence | 2, 101, 111 |
| abstract_inverted_index.software | 45, 60, 161 |
| abstract_inverted_index.specific | 192 |
| abstract_inverted_index.topiary, | 70 |
| abstract_inverted_index.validate | 207 |
| abstract_inverted_index.(Muscle5, | 162 |
| abstract_inverted_index.Ancestral | 1 |
| abstract_inverted_index.Generates | 114 |
| abstract_inverted_index.ancestral | 130 |
| abstract_inverted_index.annotated | 142 |
| abstract_inverted_index.available | 222 |
| abstract_inverted_index.databases | 169 |
| abstract_inverted_index.download: | 224 |
| abstract_inverted_index.evolution | 12 |
| abstract_inverted_index.families. | 218 |
| abstract_inverted_index.function. | 28 |
| abstract_inverted_index.graphical | 149 |
| abstract_inverted_index.introduce | 56, 181 |
| abstract_inverted_index.packages, | 46 |
| abstract_inverted_index.sequence, | 25 |
| abstract_inverted_index.sequences | 94 |
| abstract_inverted_index.summaries | 150 |
| abstract_inverted_index.supports. | 138 |
| abstract_inverted_index.taxonomic | 85 |
| abstract_inverted_index.Constructs | 108 |
| abstract_inverted_index.Determines | 136 |
| abstract_inverted_index.Reconciles | 120 |
| abstract_inverted_index.alignment; | 112 |
| abstract_inverted_index.collection | 215 |
| abstract_inverted_index.complexity | 38 |
| abstract_inverted_index.knowledge. | 53 |
| abstract_inverted_index.reduction; | 106 |
| abstract_inverted_index.redundancy | 105 |
| abstract_inverted_index.sequences, | 147 |
| abstract_inverted_index.sequences. | 79 |
| abstract_inverted_index.sequences; | 133 |
| abstract_inverted_index.structure, | 26 |
| abstract_inverted_index.RAxML‐NG, | 163 |
| abstract_inverted_index.integrating | 158 |
| abstract_inverted_index.spreadsheet | 74 |
| abstract_inverted_index.Reconstructs | 129 |
| abstract_inverted_index.evolutionary | 143 |
| abstract_inverted_index.non‐expert | 182 |
| abstract_inverted_index.phylogenetic | 52 |
| abstract_inverted_index.spreadsheets | 145 |
| abstract_inverted_index.phylogenetics | 160 |
| abstract_inverted_index.relationships | 22 |
| abstract_inverted_index.taxonomically | 99 |
| abstract_inverted_index.manipulations, | 49 |
| abstract_inverted_index.reconstruction | 3 |
| abstract_inverted_index.maximum‐likelihood | 116 |
| abstract_inverted_index.https://github.com/harmslab/topiary | 225 |
| cited_by_percentile_year.max | 98 |
| cited_by_percentile_year.min | 94 |
| corresponding_author_ids | https://openalex.org/A5048475119 |
| countries_distinct_count | 1 |
| institutions_distinct_count | 5 |
| corresponding_institution_ids | https://openalex.org/I181233156 |
| sustainable_development_goals[0].id | https://metadata.un.org/sdg/8 |
| sustainable_development_goals[0].score | 0.6499999761581421 |
| sustainable_development_goals[0].display_name | Decent work and economic growth |
| citation_normalized_percentile.value | 0.791233 |
| citation_normalized_percentile.is_in_top_1_percent | False |
| citation_normalized_percentile.is_in_top_10_percent | False |