Generalised Bayesian distance-based phylogenetics for the genomics era Article Swipe
YOU?
·
· 2025
· Open Access
·
· DOI: https://doi.org/10.48550/arxiv.2502.04067
As whole genomes become widely available, maximum likelihood and Bayesian phylogenetic methods are demonstrating their limits in meeting the escalating computational demands. Conversely, distance-based phylogenetic methods are efficient, but are rarely favoured due to their inferior performance. Here, we extend distance-based phylogenetics using an entropy-based likelihood of the evolution among pairs of taxa, allowing for fast Bayesian inference in genome-scale datasets. We provide evidence of a close link between the inference criteria used in distance methods and Felsenstein's likelihood, such that the methods are expected to have comparable performance in practice. Using the entropic likelihood, we perform Bayesian inference on three phylogenetic benchmark datasets and find that estimates closely correspond with previous inferences. We also apply this rapid inference approach to a 60-million-site alignment from 363 avian taxa, covering most avian families. The method has outstanding performance and reveals substantial uncertainty in the avian diversification events immediately after the K-Pg transition event. The entropic likelihood allows for efficient Bayesian phylogenetic inference, accommodating the analysis demands of the genomic era.
Related Topics
- Type
- preprint
- Language
- en
- Landing Page
- http://arxiv.org/abs/2502.04067
- https://arxiv.org/pdf/2502.04067
- OA Status
- green
- Related Works
- 10
- OpenAlex ID
- https://openalex.org/W4407245217
Raw OpenAlex JSON
- OpenAlex ID
-
https://openalex.org/W4407245217Canonical identifier for this work in OpenAlex
- DOI
-
https://doi.org/10.48550/arxiv.2502.04067Digital Object Identifier
- Title
-
Generalised Bayesian distance-based phylogenetics for the genomics eraWork title
- Type
-
preprintOpenAlex work type
- Language
-
enPrimary language
- Publication year
-
2025Year of publication
- Publication date
-
2025-02-06Full publication date if available
- Authors
-
Matthew J. Penn, Neil Scheidwasser, Mark P. Khurana, Christl A. Donnelly, David A. Duchêne, Samir BhattList of authors in order
- Landing page
-
https://arxiv.org/abs/2502.04067Publisher landing page
- PDF URL
-
https://arxiv.org/pdf/2502.04067Direct link to full text PDF
- Open access
-
YesWhether a free full text is available
- OA status
-
greenOpen access status per OpenAlex
- OA URL
-
https://arxiv.org/pdf/2502.04067Direct OA link when available
- Concepts
-
Phylogenetics, Genomics, Bayesian probability, Evolutionary biology, Computational biology, Biology, Data science, Computer science, Artificial intelligence, Genome, Genetics, GeneTop concepts (fields/topics) attached by OpenAlex
- Cited by
-
0Total citation count in OpenAlex
- Related works (count)
-
10Other works algorithmically related by OpenAlex
Full payload
| id | https://openalex.org/W4407245217 |
|---|---|
| doi | https://doi.org/10.48550/arxiv.2502.04067 |
| ids.doi | https://doi.org/10.48550/arxiv.2502.04067 |
| ids.openalex | https://openalex.org/W4407245217 |
| fwci | |
| type | preprint |
| title | Generalised Bayesian distance-based phylogenetics for the genomics era |
| biblio.issue | |
| biblio.volume | |
| biblio.last_page | |
| biblio.first_page | |
| topics[0].id | https://openalex.org/T10015 |
| topics[0].field.id | https://openalex.org/fields/13 |
| topics[0].field.display_name | Biochemistry, Genetics and Molecular Biology |
| topics[0].score | 0.9977999925613403 |
| topics[0].domain.id | https://openalex.org/domains/1 |
| topics[0].domain.display_name | Life Sciences |
| topics[0].subfield.id | https://openalex.org/subfields/1312 |
| topics[0].subfield.display_name | Molecular Biology |
| topics[0].display_name | Genomics and Phylogenetic Studies |
| topics[1].id | https://openalex.org/T12946 |
| topics[1].field.id | https://openalex.org/fields/13 |
| topics[1].field.display_name | Biochemistry, Genetics and Molecular Biology |
| topics[1].score | 0.968500018119812 |
| topics[1].domain.id | https://openalex.org/domains/1 |
| topics[1].domain.display_name | Life Sciences |
| topics[1].subfield.id | https://openalex.org/subfields/1312 |
| topics[1].subfield.display_name | Molecular Biology |
| topics[1].display_name | Fractal and DNA sequence analysis |
| topics[2].id | https://openalex.org/T11710 |
| topics[2].field.id | https://openalex.org/fields/13 |
| topics[2].field.display_name | Biochemistry, Genetics and Molecular Biology |
| topics[2].score | 0.9394000172615051 |
| topics[2].domain.id | https://openalex.org/domains/1 |
| topics[2].domain.display_name | Life Sciences |
| topics[2].subfield.id | https://openalex.org/subfields/1312 |
| topics[2].subfield.display_name | Molecular Biology |
| topics[2].display_name | Biomedical Text Mining and Ontologies |
| is_xpac | False |
| apc_list | |
| apc_paid | |
| concepts[0].id | https://openalex.org/C90132467 |
| concepts[0].level | 3 |
| concepts[0].score | 0.6484280824661255 |
| concepts[0].wikidata | https://www.wikidata.org/wiki/Q171184 |
| concepts[0].display_name | Phylogenetics |
| concepts[1].id | https://openalex.org/C189206191 |
| concepts[1].level | 4 |
| concepts[1].score | 0.6127811074256897 |
| concepts[1].wikidata | https://www.wikidata.org/wiki/Q222046 |
| concepts[1].display_name | Genomics |
| concepts[2].id | https://openalex.org/C107673813 |
| concepts[2].level | 2 |
| concepts[2].score | 0.5748589038848877 |
| concepts[2].wikidata | https://www.wikidata.org/wiki/Q812534 |
| concepts[2].display_name | Bayesian probability |
| concepts[3].id | https://openalex.org/C78458016 |
| concepts[3].level | 1 |
| concepts[3].score | 0.4989955425262451 |
| concepts[3].wikidata | https://www.wikidata.org/wiki/Q840400 |
| concepts[3].display_name | Evolutionary biology |
| concepts[4].id | https://openalex.org/C70721500 |
| concepts[4].level | 1 |
| concepts[4].score | 0.43202710151672363 |
| concepts[4].wikidata | https://www.wikidata.org/wiki/Q177005 |
| concepts[4].display_name | Computational biology |
| concepts[5].id | https://openalex.org/C86803240 |
| concepts[5].level | 0 |
| concepts[5].score | 0.3472912907600403 |
| concepts[5].wikidata | https://www.wikidata.org/wiki/Q420 |
| concepts[5].display_name | Biology |
| concepts[6].id | https://openalex.org/C2522767166 |
| concepts[6].level | 1 |
| concepts[6].score | 0.3352029323577881 |
| concepts[6].wikidata | https://www.wikidata.org/wiki/Q2374463 |
| concepts[6].display_name | Data science |
| concepts[7].id | https://openalex.org/C41008148 |
| concepts[7].level | 0 |
| concepts[7].score | 0.3208007216453552 |
| concepts[7].wikidata | https://www.wikidata.org/wiki/Q21198 |
| concepts[7].display_name | Computer science |
| concepts[8].id | https://openalex.org/C154945302 |
| concepts[8].level | 1 |
| concepts[8].score | 0.20343321561813354 |
| concepts[8].wikidata | https://www.wikidata.org/wiki/Q11660 |
| concepts[8].display_name | Artificial intelligence |
| concepts[9].id | https://openalex.org/C141231307 |
| concepts[9].level | 3 |
| concepts[9].score | 0.17960897088050842 |
| concepts[9].wikidata | https://www.wikidata.org/wiki/Q7020 |
| concepts[9].display_name | Genome |
| concepts[10].id | https://openalex.org/C54355233 |
| concepts[10].level | 1 |
| concepts[10].score | 0.16978922486305237 |
| concepts[10].wikidata | https://www.wikidata.org/wiki/Q7162 |
| concepts[10].display_name | Genetics |
| concepts[11].id | https://openalex.org/C104317684 |
| concepts[11].level | 2 |
| concepts[11].score | 0.11734098196029663 |
| concepts[11].wikidata | https://www.wikidata.org/wiki/Q7187 |
| concepts[11].display_name | Gene |
| keywords[0].id | https://openalex.org/keywords/phylogenetics |
| keywords[0].score | 0.6484280824661255 |
| keywords[0].display_name | Phylogenetics |
| keywords[1].id | https://openalex.org/keywords/genomics |
| keywords[1].score | 0.6127811074256897 |
| keywords[1].display_name | Genomics |
| keywords[2].id | https://openalex.org/keywords/bayesian-probability |
| keywords[2].score | 0.5748589038848877 |
| keywords[2].display_name | Bayesian probability |
| keywords[3].id | https://openalex.org/keywords/evolutionary-biology |
| keywords[3].score | 0.4989955425262451 |
| keywords[3].display_name | Evolutionary biology |
| keywords[4].id | https://openalex.org/keywords/computational-biology |
| keywords[4].score | 0.43202710151672363 |
| keywords[4].display_name | Computational biology |
| keywords[5].id | https://openalex.org/keywords/biology |
| keywords[5].score | 0.3472912907600403 |
| keywords[5].display_name | Biology |
| keywords[6].id | https://openalex.org/keywords/data-science |
| keywords[6].score | 0.3352029323577881 |
| keywords[6].display_name | Data science |
| keywords[7].id | https://openalex.org/keywords/computer-science |
| keywords[7].score | 0.3208007216453552 |
| keywords[7].display_name | Computer science |
| keywords[8].id | https://openalex.org/keywords/artificial-intelligence |
| keywords[8].score | 0.20343321561813354 |
| keywords[8].display_name | Artificial intelligence |
| keywords[9].id | https://openalex.org/keywords/genome |
| keywords[9].score | 0.17960897088050842 |
| keywords[9].display_name | Genome |
| keywords[10].id | https://openalex.org/keywords/genetics |
| keywords[10].score | 0.16978922486305237 |
| keywords[10].display_name | Genetics |
| keywords[11].id | https://openalex.org/keywords/gene |
| keywords[11].score | 0.11734098196029663 |
| keywords[11].display_name | Gene |
| language | en |
| locations[0].id | pmh:oai:arXiv.org:2502.04067 |
| locations[0].is_oa | True |
| locations[0].source.id | https://openalex.org/S4306400194 |
| locations[0].source.issn | |
| locations[0].source.type | repository |
| locations[0].source.is_oa | True |
| locations[0].source.issn_l | |
| locations[0].source.is_core | False |
| locations[0].source.is_in_doaj | False |
| locations[0].source.display_name | arXiv (Cornell University) |
| locations[0].source.host_organization | https://openalex.org/I205783295 |
| locations[0].source.host_organization_name | Cornell University |
| locations[0].source.host_organization_lineage | https://openalex.org/I205783295 |
| locations[0].license | |
| locations[0].pdf_url | https://arxiv.org/pdf/2502.04067 |
| locations[0].version | submittedVersion |
| locations[0].raw_type | text |
| locations[0].license_id | |
| locations[0].is_accepted | False |
| locations[0].is_published | False |
| locations[0].raw_source_name | |
| locations[0].landing_page_url | http://arxiv.org/abs/2502.04067 |
| locations[1].id | doi:10.48550/arxiv.2502.04067 |
| locations[1].is_oa | True |
| locations[1].source.id | https://openalex.org/S4306400194 |
| locations[1].source.issn | |
| locations[1].source.type | repository |
| locations[1].source.is_oa | True |
| locations[1].source.issn_l | |
| locations[1].source.is_core | False |
| locations[1].source.is_in_doaj | False |
| locations[1].source.display_name | arXiv (Cornell University) |
| locations[1].source.host_organization | https://openalex.org/I205783295 |
| locations[1].source.host_organization_name | Cornell University |
| locations[1].source.host_organization_lineage | https://openalex.org/I205783295 |
| locations[1].license | cc-by |
| locations[1].pdf_url | |
| locations[1].version | |
| locations[1].raw_type | article |
| locations[1].license_id | https://openalex.org/licenses/cc-by |
| locations[1].is_accepted | False |
| locations[1].is_published | |
| locations[1].raw_source_name | |
| locations[1].landing_page_url | https://doi.org/10.48550/arxiv.2502.04067 |
| indexed_in | arxiv, datacite |
| authorships[0].author.id | https://openalex.org/A5028742823 |
| authorships[0].author.orcid | https://orcid.org/0000-0001-8682-5393 |
| authorships[0].author.display_name | Matthew J. Penn |
| authorships[0].author_position | first |
| authorships[0].raw_author_name | Penn, Matthew J. |
| authorships[0].is_corresponding | False |
| authorships[1].author.id | https://openalex.org/A5055084920 |
| authorships[1].author.orcid | https://orcid.org/0000-0001-9922-0289 |
| authorships[1].author.display_name | Neil Scheidwasser |
| authorships[1].author_position | middle |
| authorships[1].raw_author_name | Scheidwasser, Neil |
| authorships[1].is_corresponding | False |
| authorships[2].author.id | https://openalex.org/A5031671404 |
| authorships[2].author.orcid | https://orcid.org/0000-0002-1123-7674 |
| authorships[2].author.display_name | Mark P. Khurana |
| authorships[2].author_position | middle |
| authorships[2].raw_author_name | Khurana, Mark P. |
| authorships[2].is_corresponding | False |
| authorships[3].author.id | https://openalex.org/A5012533867 |
| authorships[3].author.orcid | https://orcid.org/0000-0002-0195-2463 |
| authorships[3].author.display_name | Christl A. Donnelly |
| authorships[3].author_position | middle |
| authorships[3].raw_author_name | Donnelly, Christl A. |
| authorships[3].is_corresponding | False |
| authorships[4].author.id | https://openalex.org/A5022291539 |
| authorships[4].author.orcid | https://orcid.org/0000-0002-5479-1974 |
| authorships[4].author.display_name | David A. Duchêne |
| authorships[4].author_position | middle |
| authorships[4].raw_author_name | Duchêne, David A. |
| authorships[4].is_corresponding | False |
| authorships[5].author.id | https://openalex.org/A5091290326 |
| authorships[5].author.orcid | https://orcid.org/0000-0002-0891-4611 |
| authorships[5].author.display_name | Samir Bhatt |
| authorships[5].author_position | last |
| authorships[5].raw_author_name | Bhatt, Samir |
| authorships[5].is_corresponding | False |
| has_content.pdf | False |
| has_content.grobid_xml | False |
| is_paratext | False |
| open_access.is_oa | True |
| open_access.oa_url | https://arxiv.org/pdf/2502.04067 |
| open_access.oa_status | green |
| open_access.any_repository_has_fulltext | False |
| created_date | 2025-10-10T00:00:00 |
| display_name | Generalised Bayesian distance-based phylogenetics for the genomics era |
| has_fulltext | False |
| is_retracted | False |
| updated_date | 2025-11-06T06:51:31.235846 |
| primary_topic.id | https://openalex.org/T10015 |
| primary_topic.field.id | https://openalex.org/fields/13 |
| primary_topic.field.display_name | Biochemistry, Genetics and Molecular Biology |
| primary_topic.score | 0.9977999925613403 |
| primary_topic.domain.id | https://openalex.org/domains/1 |
| primary_topic.domain.display_name | Life Sciences |
| primary_topic.subfield.id | https://openalex.org/subfields/1312 |
| primary_topic.subfield.display_name | Molecular Biology |
| primary_topic.display_name | Genomics and Phylogenetic Studies |
| related_works | https://openalex.org/W2347127313, https://openalex.org/W158575883, https://openalex.org/W1563588491, https://openalex.org/W1967721632, https://openalex.org/W2155276701, https://openalex.org/W2128252278, https://openalex.org/W1923231248, https://openalex.org/W2383763250, https://openalex.org/W248275141, https://openalex.org/W2518331723 |
| cited_by_count | 0 |
| locations_count | 2 |
| best_oa_location.id | pmh:oai:arXiv.org:2502.04067 |
| best_oa_location.is_oa | True |
| best_oa_location.source.id | https://openalex.org/S4306400194 |
| best_oa_location.source.issn | |
| best_oa_location.source.type | repository |
| best_oa_location.source.is_oa | True |
| best_oa_location.source.issn_l | |
| best_oa_location.source.is_core | False |
| best_oa_location.source.is_in_doaj | False |
| best_oa_location.source.display_name | arXiv (Cornell University) |
| best_oa_location.source.host_organization | https://openalex.org/I205783295 |
| best_oa_location.source.host_organization_name | Cornell University |
| best_oa_location.source.host_organization_lineage | https://openalex.org/I205783295 |
| best_oa_location.license | |
| best_oa_location.pdf_url | https://arxiv.org/pdf/2502.04067 |
| best_oa_location.version | submittedVersion |
| best_oa_location.raw_type | text |
| best_oa_location.license_id | |
| best_oa_location.is_accepted | False |
| best_oa_location.is_published | False |
| best_oa_location.raw_source_name | |
| best_oa_location.landing_page_url | http://arxiv.org/abs/2502.04067 |
| primary_location.id | pmh:oai:arXiv.org:2502.04067 |
| primary_location.is_oa | True |
| primary_location.source.id | https://openalex.org/S4306400194 |
| primary_location.source.issn | |
| primary_location.source.type | repository |
| primary_location.source.is_oa | True |
| primary_location.source.issn_l | |
| primary_location.source.is_core | False |
| primary_location.source.is_in_doaj | False |
| primary_location.source.display_name | arXiv (Cornell University) |
| primary_location.source.host_organization | https://openalex.org/I205783295 |
| primary_location.source.host_organization_name | Cornell University |
| primary_location.source.host_organization_lineage | https://openalex.org/I205783295 |
| primary_location.license | |
| primary_location.pdf_url | https://arxiv.org/pdf/2502.04067 |
| primary_location.version | submittedVersion |
| primary_location.raw_type | text |
| primary_location.license_id | |
| primary_location.is_accepted | False |
| primary_location.is_published | False |
| primary_location.raw_source_name | |
| primary_location.landing_page_url | http://arxiv.org/abs/2502.04067 |
| publication_date | 2025-02-06 |
| publication_year | 2025 |
| referenced_works_count | 0 |
| abstract_inverted_index.a | 65, 121 |
| abstract_inverted_index.As | 0 |
| abstract_inverted_index.We | 61, 113 |
| abstract_inverted_index.an | 43 |
| abstract_inverted_index.in | 16, 58, 73, 89, 141 |
| abstract_inverted_index.of | 46, 51, 64, 165 |
| abstract_inverted_index.on | 99 |
| abstract_inverted_index.to | 33, 85, 120 |
| abstract_inverted_index.we | 38, 95 |
| abstract_inverted_index.363 | 125 |
| abstract_inverted_index.The | 132, 152 |
| abstract_inverted_index.and | 8, 76, 104, 137 |
| abstract_inverted_index.are | 12, 26, 29, 83 |
| abstract_inverted_index.but | 28 |
| abstract_inverted_index.due | 32 |
| abstract_inverted_index.for | 54, 156 |
| abstract_inverted_index.has | 134 |
| abstract_inverted_index.the | 18, 47, 69, 81, 92, 142, 148, 162, 166 |
| abstract_inverted_index.K-Pg | 149 |
| abstract_inverted_index.also | 114 |
| abstract_inverted_index.era. | 168 |
| abstract_inverted_index.fast | 55 |
| abstract_inverted_index.find | 105 |
| abstract_inverted_index.from | 124 |
| abstract_inverted_index.have | 86 |
| abstract_inverted_index.link | 67 |
| abstract_inverted_index.most | 129 |
| abstract_inverted_index.such | 79 |
| abstract_inverted_index.that | 80, 106 |
| abstract_inverted_index.this | 116 |
| abstract_inverted_index.used | 72 |
| abstract_inverted_index.with | 110 |
| abstract_inverted_index.Here, | 37 |
| abstract_inverted_index.Using | 91 |
| abstract_inverted_index.after | 147 |
| abstract_inverted_index.among | 49 |
| abstract_inverted_index.apply | 115 |
| abstract_inverted_index.avian | 126, 130, 143 |
| abstract_inverted_index.close | 66 |
| abstract_inverted_index.pairs | 50 |
| abstract_inverted_index.rapid | 117 |
| abstract_inverted_index.taxa, | 52, 127 |
| abstract_inverted_index.their | 14, 34 |
| abstract_inverted_index.three | 100 |
| abstract_inverted_index.using | 42 |
| abstract_inverted_index.whole | 1 |
| abstract_inverted_index.allows | 155 |
| abstract_inverted_index.become | 3 |
| abstract_inverted_index.event. | 151 |
| abstract_inverted_index.events | 145 |
| abstract_inverted_index.extend | 39 |
| abstract_inverted_index.limits | 15 |
| abstract_inverted_index.method | 133 |
| abstract_inverted_index.rarely | 30 |
| abstract_inverted_index.widely | 4 |
| abstract_inverted_index.between | 68 |
| abstract_inverted_index.closely | 108 |
| abstract_inverted_index.demands | 164 |
| abstract_inverted_index.genomes | 2 |
| abstract_inverted_index.genomic | 167 |
| abstract_inverted_index.maximum | 6 |
| abstract_inverted_index.meeting | 17 |
| abstract_inverted_index.methods | 11, 25, 75, 82 |
| abstract_inverted_index.perform | 96 |
| abstract_inverted_index.provide | 62 |
| abstract_inverted_index.reveals | 138 |
| abstract_inverted_index.Bayesian | 9, 56, 97, 158 |
| abstract_inverted_index.allowing | 53 |
| abstract_inverted_index.analysis | 163 |
| abstract_inverted_index.approach | 119 |
| abstract_inverted_index.covering | 128 |
| abstract_inverted_index.criteria | 71 |
| abstract_inverted_index.datasets | 103 |
| abstract_inverted_index.demands. | 21 |
| abstract_inverted_index.distance | 74 |
| abstract_inverted_index.entropic | 93, 153 |
| abstract_inverted_index.evidence | 63 |
| abstract_inverted_index.expected | 84 |
| abstract_inverted_index.favoured | 31 |
| abstract_inverted_index.inferior | 35 |
| abstract_inverted_index.previous | 111 |
| abstract_inverted_index.alignment | 123 |
| abstract_inverted_index.benchmark | 102 |
| abstract_inverted_index.datasets. | 60 |
| abstract_inverted_index.efficient | 157 |
| abstract_inverted_index.estimates | 107 |
| abstract_inverted_index.evolution | 48 |
| abstract_inverted_index.families. | 131 |
| abstract_inverted_index.inference | 57, 70, 98, 118 |
| abstract_inverted_index.practice. | 90 |
| abstract_inverted_index.available, | 5 |
| abstract_inverted_index.comparable | 87 |
| abstract_inverted_index.correspond | 109 |
| abstract_inverted_index.efficient, | 27 |
| abstract_inverted_index.escalating | 19 |
| abstract_inverted_index.inference, | 160 |
| abstract_inverted_index.likelihood | 7, 45, 154 |
| abstract_inverted_index.transition | 150 |
| abstract_inverted_index.Conversely, | 22 |
| abstract_inverted_index.immediately | 146 |
| abstract_inverted_index.inferences. | 112 |
| abstract_inverted_index.likelihood, | 78, 94 |
| abstract_inverted_index.outstanding | 135 |
| abstract_inverted_index.performance | 88, 136 |
| abstract_inverted_index.substantial | 139 |
| abstract_inverted_index.uncertainty | 140 |
| abstract_inverted_index.genome-scale | 59 |
| abstract_inverted_index.performance. | 36 |
| abstract_inverted_index.phylogenetic | 10, 24, 101, 159 |
| abstract_inverted_index.Felsenstein's | 77 |
| abstract_inverted_index.accommodating | 161 |
| abstract_inverted_index.computational | 20 |
| abstract_inverted_index.demonstrating | 13 |
| abstract_inverted_index.entropy-based | 44 |
| abstract_inverted_index.phylogenetics | 41 |
| abstract_inverted_index.distance-based | 23, 40 |
| abstract_inverted_index.60-million-site | 122 |
| abstract_inverted_index.diversification | 144 |
| cited_by_percentile_year | |
| countries_distinct_count | 0 |
| institutions_distinct_count | 6 |
| citation_normalized_percentile |