Interpreting alignment-free sequence comparison: what makes a score a good score? Article Swipe
YOU?
·
· 2022
· Open Access
·
· DOI: https://doi.org/10.1093/nargab/lqac062
Alignment-free methods are alternatives to alignment-based methods when searching sequence data sets. The output from an alignment-free sequence comparison is a similarity score, the interpretation of which is not straightforward. We propose objective functions to interpret and calibrate outputs from alignment-free searches, noting that different objective functions are necessary for different biological contexts. This leads to advantages: visualising and comparing score distributions, including those from true positives, may be a relatively simple method to gain insight into the performance of different metrics. Using an empirical approach with both DNA and protein sequences, we characterise different similarity score distributions generated under different parameters. In particular, we demonstrate how sequence length can affect the scores. We show that scores of true positive sequence pairs may correlate significantly with their mean length; and even if the correlation is weak, the relative difference in length of the sequence pair may significantly reduce the effectiveness of alignment-free metrics. Importantly, we show how objective functions can be used with test data to accurately estimate the probability of true positives. This can significantly increase the utility of alignment-free approaches. Finally, we have developed a general-purpose software tool called KAST for use in high-throughput workflows on Linux clusters.
Related Topics
- Type
- article
- Language
- en
- Landing Page
- https://doi.org/10.1093/nargab/lqac062
- https://academic.oup.com/nargab/article-pdf/4/3/lqac062/45690656/lqac062.pdf
- OA Status
- gold
- Cited By
- 6
- References
- 72
- Related Works
- 10
- OpenAlex ID
- https://openalex.org/W4294591310
Raw OpenAlex JSON
- OpenAlex ID
-
https://openalex.org/W4294591310Canonical identifier for this work in OpenAlex
- DOI
-
https://doi.org/10.1093/nargab/lqac062Digital Object Identifier
- Title
-
Interpreting alignment-free sequence comparison: what makes a score a good score?Work title
- Type
-
articleOpenAlex work type
- Language
-
enPrimary language
- Publication year
-
2022Year of publication
- Publication date
-
2022-07-09Full publication date if available
- Authors
-
Martin Swain, Martin VickersList of authors in order
- Landing page
-
https://doi.org/10.1093/nargab/lqac062Publisher landing page
- PDF URL
-
https://academic.oup.com/nargab/article-pdf/4/3/lqac062/45690656/lqac062.pdfDirect link to full text PDF
- Open access
-
YesWhether a free full text is available
- OA status
-
goldOpen access status per OpenAlex
- OA URL
-
https://academic.oup.com/nargab/article-pdf/4/3/lqac062/45690656/lqac062.pdfDirect OA link when available
- Concepts
-
Sequence (biology), Similarity (geometry), False positive paradox, Multiple sequence alignment, Computer science, Correlation, Score, Sequence alignment, Pattern recognition (psychology), Data mining, Artificial intelligence, Algorithm, Statistics, Mathematics, Machine learning, Biology, Image (mathematics), Genetics, Biochemistry, Gene, Peptide sequence, GeometryTop concepts (fields/topics) attached by OpenAlex
- Cited by
-
6Total citation count in OpenAlex
- Citations by year (recent)
-
2025: 3, 2024: 1, 2023: 2Per-year citation counts (last 5 years)
- References (count)
-
72Number of works referenced by this work
- Related works (count)
-
10Other works algorithmically related by OpenAlex
Full payload
| id | https://openalex.org/W4294591310 |
|---|---|
| doi | https://doi.org/10.1093/nargab/lqac062 |
| ids.doi | https://doi.org/10.1093/nargab/lqac062 |
| ids.pmid | https://pubmed.ncbi.nlm.nih.gov/36071721 |
| ids.openalex | https://openalex.org/W4294591310 |
| fwci | 0.73841495 |
| type | article |
| title | Interpreting alignment-free sequence comparison: what makes a score a good score? |
| awards[0].id | https://openalex.org/G8204187466 |
| awards[0].funder_id | https://openalex.org/F4320334629 |
| awards[0].display_name | |
| awards[0].funder_award_id | BB/CAP1730/1 |
| awards[0].funder_display_name | Biotechnology and Biological Sciences Research Council |
| awards[1].id | https://openalex.org/G328603825 |
| awards[1].funder_id | https://openalex.org/F4320334629 |
| awards[1].display_name | |
| awards[1].funder_award_id | BBS/E/W/10962A01D |
| awards[1].funder_display_name | Biotechnology and Biological Sciences Research Council |
| biblio.issue | 3 |
| biblio.volume | 4 |
| biblio.last_page | lqac062 |
| biblio.first_page | lqac062 |
| topics[0].id | https://openalex.org/T10015 |
| topics[0].field.id | https://openalex.org/fields/13 |
| topics[0].field.display_name | Biochemistry, Genetics and Molecular Biology |
| topics[0].score | 0.9998999834060669 |
| topics[0].domain.id | https://openalex.org/domains/1 |
| topics[0].domain.display_name | Life Sciences |
| topics[0].subfield.id | https://openalex.org/subfields/1312 |
| topics[0].subfield.display_name | Molecular Biology |
| topics[0].display_name | Genomics and Phylogenetic Studies |
| topics[1].id | https://openalex.org/T10521 |
| topics[1].field.id | https://openalex.org/fields/13 |
| topics[1].field.display_name | Biochemistry, Genetics and Molecular Biology |
| topics[1].score | 0.9955000281333923 |
| topics[1].domain.id | https://openalex.org/domains/1 |
| topics[1].domain.display_name | Life Sciences |
| topics[1].subfield.id | https://openalex.org/subfields/1312 |
| topics[1].subfield.display_name | Molecular Biology |
| topics[1].display_name | RNA and protein synthesis mechanisms |
| topics[2].id | https://openalex.org/T11791 |
| topics[2].field.id | https://openalex.org/fields/23 |
| topics[2].field.display_name | Environmental Science |
| topics[2].score | 0.9937999844551086 |
| topics[2].domain.id | https://openalex.org/domains/3 |
| topics[2].domain.display_name | Physical Sciences |
| topics[2].subfield.id | https://openalex.org/subfields/2303 |
| topics[2].subfield.display_name | Ecology |
| topics[2].display_name | Microbial Community Ecology and Physiology |
| funders[0].id | https://openalex.org/F4320334629 |
| funders[0].ror | https://ror.org/00cwqg982 |
| funders[0].display_name | Biotechnology and Biological Sciences Research Council |
| is_xpac | False |
| apc_list.value | 2473 |
| apc_list.currency | USD |
| apc_list.value_usd | 2473 |
| apc_paid.value | 2473 |
| apc_paid.currency | USD |
| apc_paid.value_usd | 2473 |
| concepts[0].id | https://openalex.org/C2778112365 |
| concepts[0].level | 2 |
| concepts[0].score | 0.6452201008796692 |
| concepts[0].wikidata | https://www.wikidata.org/wiki/Q3511065 |
| concepts[0].display_name | Sequence (biology) |
| concepts[1].id | https://openalex.org/C103278499 |
| concepts[1].level | 3 |
| concepts[1].score | 0.6143949031829834 |
| concepts[1].wikidata | https://www.wikidata.org/wiki/Q254465 |
| concepts[1].display_name | Similarity (geometry) |
| concepts[2].id | https://openalex.org/C64869954 |
| concepts[2].level | 2 |
| concepts[2].score | 0.5963835716247559 |
| concepts[2].wikidata | https://www.wikidata.org/wiki/Q1859747 |
| concepts[2].display_name | False positive paradox |
| concepts[3].id | https://openalex.org/C88031987 |
| concepts[3].level | 5 |
| concepts[3].score | 0.5820029973983765 |
| concepts[3].wikidata | https://www.wikidata.org/wiki/Q1377767 |
| concepts[3].display_name | Multiple sequence alignment |
| concepts[4].id | https://openalex.org/C41008148 |
| concepts[4].level | 0 |
| concepts[4].score | 0.5385282039642334 |
| concepts[4].wikidata | https://www.wikidata.org/wiki/Q21198 |
| concepts[4].display_name | Computer science |
| concepts[5].id | https://openalex.org/C117220453 |
| concepts[5].level | 2 |
| concepts[5].score | 0.4844491481781006 |
| concepts[5].wikidata | https://www.wikidata.org/wiki/Q5172842 |
| concepts[5].display_name | Correlation |
| concepts[6].id | https://openalex.org/C65660741 |
| concepts[6].level | 2 |
| concepts[6].score | 0.46183300018310547 |
| concepts[6].wikidata | https://www.wikidata.org/wiki/Q3952743 |
| concepts[6].display_name | Score |
| concepts[7].id | https://openalex.org/C45484198 |
| concepts[7].level | 4 |
| concepts[7].score | 0.4244697690010071 |
| concepts[7].wikidata | https://www.wikidata.org/wiki/Q827246 |
| concepts[7].display_name | Sequence alignment |
| concepts[8].id | https://openalex.org/C153180895 |
| concepts[8].level | 2 |
| concepts[8].score | 0.4231995940208435 |
| concepts[8].wikidata | https://www.wikidata.org/wiki/Q7148389 |
| concepts[8].display_name | Pattern recognition (psychology) |
| concepts[9].id | https://openalex.org/C124101348 |
| concepts[9].level | 1 |
| concepts[9].score | 0.41377806663513184 |
| concepts[9].wikidata | https://www.wikidata.org/wiki/Q172491 |
| concepts[9].display_name | Data mining |
| concepts[10].id | https://openalex.org/C154945302 |
| concepts[10].level | 1 |
| concepts[10].score | 0.3943127989768982 |
| concepts[10].wikidata | https://www.wikidata.org/wiki/Q11660 |
| concepts[10].display_name | Artificial intelligence |
| concepts[11].id | https://openalex.org/C11413529 |
| concepts[11].level | 1 |
| concepts[11].score | 0.3737014830112457 |
| concepts[11].wikidata | https://www.wikidata.org/wiki/Q8366 |
| concepts[11].display_name | Algorithm |
| concepts[12].id | https://openalex.org/C105795698 |
| concepts[12].level | 1 |
| concepts[12].score | 0.36524927616119385 |
| concepts[12].wikidata | https://www.wikidata.org/wiki/Q12483 |
| concepts[12].display_name | Statistics |
| concepts[13].id | https://openalex.org/C33923547 |
| concepts[13].level | 0 |
| concepts[13].score | 0.3371911644935608 |
| concepts[13].wikidata | https://www.wikidata.org/wiki/Q395 |
| concepts[13].display_name | Mathematics |
| concepts[14].id | https://openalex.org/C119857082 |
| concepts[14].level | 1 |
| concepts[14].score | 0.2632082402706146 |
| concepts[14].wikidata | https://www.wikidata.org/wiki/Q2539 |
| concepts[14].display_name | Machine learning |
| concepts[15].id | https://openalex.org/C86803240 |
| concepts[15].level | 0 |
| concepts[15].score | 0.12588447332382202 |
| concepts[15].wikidata | https://www.wikidata.org/wiki/Q420 |
| concepts[15].display_name | Biology |
| concepts[16].id | https://openalex.org/C115961682 |
| concepts[16].level | 2 |
| concepts[16].score | 0.09148150682449341 |
| concepts[16].wikidata | https://www.wikidata.org/wiki/Q860623 |
| concepts[16].display_name | Image (mathematics) |
| concepts[17].id | https://openalex.org/C54355233 |
| concepts[17].level | 1 |
| concepts[17].score | 0.0 |
| concepts[17].wikidata | https://www.wikidata.org/wiki/Q7162 |
| concepts[17].display_name | Genetics |
| concepts[18].id | https://openalex.org/C55493867 |
| concepts[18].level | 1 |
| concepts[18].score | 0.0 |
| concepts[18].wikidata | https://www.wikidata.org/wiki/Q7094 |
| concepts[18].display_name | Biochemistry |
| concepts[19].id | https://openalex.org/C104317684 |
| concepts[19].level | 2 |
| concepts[19].score | 0.0 |
| concepts[19].wikidata | https://www.wikidata.org/wiki/Q7187 |
| concepts[19].display_name | Gene |
| concepts[20].id | https://openalex.org/C167625842 |
| concepts[20].level | 3 |
| concepts[20].score | 0.0 |
| concepts[20].wikidata | https://www.wikidata.org/wiki/Q899763 |
| concepts[20].display_name | Peptide sequence |
| concepts[21].id | https://openalex.org/C2524010 |
| concepts[21].level | 1 |
| concepts[21].score | 0.0 |
| concepts[21].wikidata | https://www.wikidata.org/wiki/Q8087 |
| concepts[21].display_name | Geometry |
| keywords[0].id | https://openalex.org/keywords/sequence |
| keywords[0].score | 0.6452201008796692 |
| keywords[0].display_name | Sequence (biology) |
| keywords[1].id | https://openalex.org/keywords/similarity |
| keywords[1].score | 0.6143949031829834 |
| keywords[1].display_name | Similarity (geometry) |
| keywords[2].id | https://openalex.org/keywords/false-positive-paradox |
| keywords[2].score | 0.5963835716247559 |
| keywords[2].display_name | False positive paradox |
| keywords[3].id | https://openalex.org/keywords/multiple-sequence-alignment |
| keywords[3].score | 0.5820029973983765 |
| keywords[3].display_name | Multiple sequence alignment |
| keywords[4].id | https://openalex.org/keywords/computer-science |
| keywords[4].score | 0.5385282039642334 |
| keywords[4].display_name | Computer science |
| keywords[5].id | https://openalex.org/keywords/correlation |
| keywords[5].score | 0.4844491481781006 |
| keywords[5].display_name | Correlation |
| keywords[6].id | https://openalex.org/keywords/score |
| keywords[6].score | 0.46183300018310547 |
| keywords[6].display_name | Score |
| keywords[7].id | https://openalex.org/keywords/sequence-alignment |
| keywords[7].score | 0.4244697690010071 |
| keywords[7].display_name | Sequence alignment |
| keywords[8].id | https://openalex.org/keywords/pattern-recognition |
| keywords[8].score | 0.4231995940208435 |
| keywords[8].display_name | Pattern recognition (psychology) |
| keywords[9].id | https://openalex.org/keywords/data-mining |
| keywords[9].score | 0.41377806663513184 |
| keywords[9].display_name | Data mining |
| keywords[10].id | https://openalex.org/keywords/artificial-intelligence |
| keywords[10].score | 0.3943127989768982 |
| keywords[10].display_name | Artificial intelligence |
| keywords[11].id | https://openalex.org/keywords/algorithm |
| keywords[11].score | 0.3737014830112457 |
| keywords[11].display_name | Algorithm |
| keywords[12].id | https://openalex.org/keywords/statistics |
| keywords[12].score | 0.36524927616119385 |
| keywords[12].display_name | Statistics |
| keywords[13].id | https://openalex.org/keywords/mathematics |
| keywords[13].score | 0.3371911644935608 |
| keywords[13].display_name | Mathematics |
| keywords[14].id | https://openalex.org/keywords/machine-learning |
| keywords[14].score | 0.2632082402706146 |
| keywords[14].display_name | Machine learning |
| keywords[15].id | https://openalex.org/keywords/biology |
| keywords[15].score | 0.12588447332382202 |
| keywords[15].display_name | Biology |
| keywords[16].id | https://openalex.org/keywords/image |
| keywords[16].score | 0.09148150682449341 |
| keywords[16].display_name | Image (mathematics) |
| language | en |
| locations[0].id | doi:10.1093/nargab/lqac062 |
| locations[0].is_oa | True |
| locations[0].source.id | https://openalex.org/S4210241000 |
| locations[0].source.issn | 2631-9268 |
| locations[0].source.type | journal |
| locations[0].source.is_oa | True |
| locations[0].source.issn_l | 2631-9268 |
| locations[0].source.is_core | True |
| locations[0].source.is_in_doaj | True |
| locations[0].source.display_name | NAR Genomics and Bioinformatics |
| locations[0].source.host_organization | https://openalex.org/P4310311648 |
| locations[0].source.host_organization_name | Oxford University Press |
| locations[0].source.host_organization_lineage | https://openalex.org/P4310311648, https://openalex.org/P4310311647 |
| locations[0].source.host_organization_lineage_names | Oxford University Press, University of Oxford |
| locations[0].license | cc-by |
| locations[0].pdf_url | https://academic.oup.com/nargab/article-pdf/4/3/lqac062/45690656/lqac062.pdf |
| locations[0].version | publishedVersion |
| locations[0].raw_type | journal-article |
| locations[0].license_id | https://openalex.org/licenses/cc-by |
| locations[0].is_accepted | True |
| locations[0].is_published | True |
| locations[0].raw_source_name | NAR Genomics and Bioinformatics |
| locations[0].landing_page_url | https://doi.org/10.1093/nargab/lqac062 |
| locations[1].id | pmid:36071721 |
| locations[1].is_oa | False |
| locations[1].source.id | https://openalex.org/S4306525036 |
| locations[1].source.issn | |
| locations[1].source.type | repository |
| locations[1].source.is_oa | False |
| locations[1].source.issn_l | |
| locations[1].source.is_core | False |
| locations[1].source.is_in_doaj | False |
| locations[1].source.display_name | PubMed |
| locations[1].source.host_organization | https://openalex.org/I1299303238 |
| locations[1].source.host_organization_name | National Institutes of Health |
| locations[1].source.host_organization_lineage | https://openalex.org/I1299303238 |
| locations[1].license | |
| locations[1].pdf_url | |
| locations[1].version | publishedVersion |
| locations[1].raw_type | |
| locations[1].license_id | |
| locations[1].is_accepted | True |
| locations[1].is_published | True |
| locations[1].raw_source_name | NAR genomics and bioinformatics |
| locations[1].landing_page_url | https://pubmed.ncbi.nlm.nih.gov/36071721 |
| locations[2].id | pmh:oai:pubmedcentral.nih.gov:9442500 |
| locations[2].is_oa | True |
| locations[2].source.id | https://openalex.org/S2764455111 |
| locations[2].source.issn | |
| locations[2].source.type | repository |
| locations[2].source.is_oa | False |
| locations[2].source.issn_l | |
| locations[2].source.is_core | False |
| locations[2].source.is_in_doaj | False |
| locations[2].source.display_name | PubMed Central |
| locations[2].source.host_organization | https://openalex.org/I1299303238 |
| locations[2].source.host_organization_name | National Institutes of Health |
| locations[2].source.host_organization_lineage | https://openalex.org/I1299303238 |
| locations[2].license | other-oa |
| locations[2].pdf_url | |
| locations[2].version | submittedVersion |
| locations[2].raw_type | Text |
| locations[2].license_id | https://openalex.org/licenses/other-oa |
| locations[2].is_accepted | False |
| locations[2].is_published | False |
| locations[2].raw_source_name | NAR Genom Bioinform |
| locations[2].landing_page_url | https://www.ncbi.nlm.nih.gov/pmc/articles/9442500 |
| indexed_in | crossref, doaj, pubmed |
| authorships[0].author.id | https://openalex.org/A5000794908 |
| authorships[0].author.orcid | https://orcid.org/0000-0002-1418-4440 |
| authorships[0].author.display_name | Martin Swain |
| authorships[0].countries | GB |
| authorships[0].affiliations[0].institution_ids | https://openalex.org/I16038530 |
| authorships[0].affiliations[0].raw_affiliation_string | Department of Life Sciences, Aberystwyth University , Penglais, Aberystwyth, Ceredigion, SY23 3DA, UK |
| authorships[0].institutions[0].id | https://openalex.org/I16038530 |
| authorships[0].institutions[0].ror | https://ror.org/015m2p889 |
| authorships[0].institutions[0].type | education |
| authorships[0].institutions[0].lineage | https://openalex.org/I16038530 |
| authorships[0].institutions[0].country_code | GB |
| authorships[0].institutions[0].display_name | Aberystwyth University |
| authorships[0].author_position | first |
| authorships[0].raw_author_name | Martin T Swain |
| authorships[0].is_corresponding | True |
| authorships[0].raw_affiliation_strings | Department of Life Sciences, Aberystwyth University , Penglais, Aberystwyth, Ceredigion, SY23 3DA, UK |
| authorships[1].author.id | https://openalex.org/A5046851463 |
| authorships[1].author.orcid | https://orcid.org/0000-0002-1543-4827 |
| authorships[1].author.display_name | Martin Vickers |
| authorships[1].countries | GB |
| authorships[1].affiliations[0].institution_ids | https://openalex.org/I100288624, https://openalex.org/I2799300731 |
| authorships[1].affiliations[0].raw_affiliation_string | The John Innes Centre, Norwich Research Park , Norwich NR4 7UH, UK |
| authorships[1].institutions[0].id | https://openalex.org/I100288624 |
| authorships[1].institutions[0].ror | https://ror.org/055zmrh94 |
| authorships[1].institutions[0].type | facility |
| authorships[1].institutions[0].lineage | https://openalex.org/I100288624, https://openalex.org/I2799693246, https://openalex.org/I4210087105 |
| authorships[1].institutions[0].country_code | GB |
| authorships[1].institutions[0].display_name | John Innes Centre |
| authorships[1].institutions[1].id | https://openalex.org/I2799300731 |
| authorships[1].institutions[1].ror | https://ror.org/0062dz060 |
| authorships[1].institutions[1].type | archive |
| authorships[1].institutions[1].lineage | https://openalex.org/I2799300731 |
| authorships[1].institutions[1].country_code | GB |
| authorships[1].institutions[1].display_name | Norwich Research Park |
| authorships[1].author_position | last |
| authorships[1].raw_author_name | Martin Vickers |
| authorships[1].is_corresponding | False |
| authorships[1].raw_affiliation_strings | The John Innes Centre, Norwich Research Park , Norwich NR4 7UH, UK |
| has_content.pdf | True |
| has_content.grobid_xml | True |
| is_paratext | False |
| open_access.is_oa | True |
| open_access.oa_url | https://academic.oup.com/nargab/article-pdf/4/3/lqac062/45690656/lqac062.pdf |
| open_access.oa_status | gold |
| open_access.any_repository_has_fulltext | False |
| created_date | 2025-10-10T00:00:00 |
| display_name | Interpreting alignment-free sequence comparison: what makes a score a good score? |
| has_fulltext | False |
| is_retracted | False |
| updated_date | 2025-11-06T03:46:38.306776 |
| primary_topic.id | https://openalex.org/T10015 |
| primary_topic.field.id | https://openalex.org/fields/13 |
| primary_topic.field.display_name | Biochemistry, Genetics and Molecular Biology |
| primary_topic.score | 0.9998999834060669 |
| primary_topic.domain.id | https://openalex.org/domains/1 |
| primary_topic.domain.display_name | Life Sciences |
| primary_topic.subfield.id | https://openalex.org/subfields/1312 |
| primary_topic.subfield.display_name | Molecular Biology |
| primary_topic.display_name | Genomics and Phylogenetic Studies |
| related_works | https://openalex.org/W2051969447, https://openalex.org/W2111937814, https://openalex.org/W2162923930, https://openalex.org/W1482324242, https://openalex.org/W2133116680, https://openalex.org/W2029514038, https://openalex.org/W2149492307, https://openalex.org/W1985408726, https://openalex.org/W187239587, https://openalex.org/W2141411672 |
| cited_by_count | 6 |
| counts_by_year[0].year | 2025 |
| counts_by_year[0].cited_by_count | 3 |
| counts_by_year[1].year | 2024 |
| counts_by_year[1].cited_by_count | 1 |
| counts_by_year[2].year | 2023 |
| counts_by_year[2].cited_by_count | 2 |
| locations_count | 3 |
| best_oa_location.id | doi:10.1093/nargab/lqac062 |
| best_oa_location.is_oa | True |
| best_oa_location.source.id | https://openalex.org/S4210241000 |
| best_oa_location.source.issn | 2631-9268 |
| best_oa_location.source.type | journal |
| best_oa_location.source.is_oa | True |
| best_oa_location.source.issn_l | 2631-9268 |
| best_oa_location.source.is_core | True |
| best_oa_location.source.is_in_doaj | True |
| best_oa_location.source.display_name | NAR Genomics and Bioinformatics |
| best_oa_location.source.host_organization | https://openalex.org/P4310311648 |
| best_oa_location.source.host_organization_name | Oxford University Press |
| best_oa_location.source.host_organization_lineage | https://openalex.org/P4310311648, https://openalex.org/P4310311647 |
| best_oa_location.source.host_organization_lineage_names | Oxford University Press, University of Oxford |
| best_oa_location.license | cc-by |
| best_oa_location.pdf_url | https://academic.oup.com/nargab/article-pdf/4/3/lqac062/45690656/lqac062.pdf |
| best_oa_location.version | publishedVersion |
| best_oa_location.raw_type | journal-article |
| best_oa_location.license_id | https://openalex.org/licenses/cc-by |
| best_oa_location.is_accepted | True |
| best_oa_location.is_published | True |
| best_oa_location.raw_source_name | NAR Genomics and Bioinformatics |
| best_oa_location.landing_page_url | https://doi.org/10.1093/nargab/lqac062 |
| primary_location.id | doi:10.1093/nargab/lqac062 |
| primary_location.is_oa | True |
| primary_location.source.id | https://openalex.org/S4210241000 |
| primary_location.source.issn | 2631-9268 |
| primary_location.source.type | journal |
| primary_location.source.is_oa | True |
| primary_location.source.issn_l | 2631-9268 |
| primary_location.source.is_core | True |
| primary_location.source.is_in_doaj | True |
| primary_location.source.display_name | NAR Genomics and Bioinformatics |
| primary_location.source.host_organization | https://openalex.org/P4310311648 |
| primary_location.source.host_organization_name | Oxford University Press |
| primary_location.source.host_organization_lineage | https://openalex.org/P4310311648, https://openalex.org/P4310311647 |
| primary_location.source.host_organization_lineage_names | Oxford University Press, University of Oxford |
| primary_location.license | cc-by |
| primary_location.pdf_url | https://academic.oup.com/nargab/article-pdf/4/3/lqac062/45690656/lqac062.pdf |
| primary_location.version | publishedVersion |
| primary_location.raw_type | journal-article |
| primary_location.license_id | https://openalex.org/licenses/cc-by |
| primary_location.is_accepted | True |
| primary_location.is_published | True |
| primary_location.raw_source_name | NAR Genomics and Bioinformatics |
| primary_location.landing_page_url | https://doi.org/10.1093/nargab/lqac062 |
| publication_date | 2022-07-09 |
| publication_year | 2022 |
| referenced_works | https://openalex.org/W2158714788, https://openalex.org/W2074231493, https://openalex.org/W2087064593, https://openalex.org/W2171963266, https://openalex.org/W2761430568, https://openalex.org/W2774657098, https://openalex.org/W2734297307, https://openalex.org/W2150208009, https://openalex.org/W2104267362, https://openalex.org/W2065128082, https://openalex.org/W2795696821, https://openalex.org/W6802954568, https://openalex.org/W2110332627, https://openalex.org/W2962807110, https://openalex.org/W3209469105, https://openalex.org/W2166162901, https://openalex.org/W2048721314, https://openalex.org/W2140211206, https://openalex.org/W6749712838, https://openalex.org/W6842402754, https://openalex.org/W1986908504, https://openalex.org/W2015843229, https://openalex.org/W2094890728, https://openalex.org/W2088287032, https://openalex.org/W1968363797, https://openalex.org/W2160445378, https://openalex.org/W2124636403, https://openalex.org/W2001660237, https://openalex.org/W2739113951, https://openalex.org/W2147618390, https://openalex.org/W2165897980, https://openalex.org/W2118931211, https://openalex.org/W2991142785, https://openalex.org/W2105039688, https://openalex.org/W2116402707, https://openalex.org/W2153283265, https://openalex.org/W4230501579, https://openalex.org/W2141652419, https://openalex.org/W2945282335, https://openalex.org/W1972166243, https://openalex.org/W2142678031, https://openalex.org/W2075996757, https://openalex.org/W2097892623, https://openalex.org/W2124626190, https://openalex.org/W2468251096, https://openalex.org/W2568830712, https://openalex.org/W2082521980, https://openalex.org/W2129103781, https://openalex.org/W1966443928, https://openalex.org/W2125628797, https://openalex.org/W2016007423, https://openalex.org/W2997624599, https://openalex.org/W1971985263, https://openalex.org/W2068308871, https://openalex.org/W2968450569, https://openalex.org/W2159954944, https://openalex.org/W2950589160, https://openalex.org/W3128249532, https://openalex.org/W2611554670, https://openalex.org/W2096116532, https://openalex.org/W2009571806, https://openalex.org/W2952597168, https://openalex.org/W2122210493, https://openalex.org/W2127036970, https://openalex.org/W2161336914, https://openalex.org/W6684430665, https://openalex.org/W6607976765, https://openalex.org/W2136145671, https://openalex.org/W3209528471, https://openalex.org/W2793666472, https://openalex.org/W4293196852, https://openalex.org/W195533127 |
| referenced_works_count | 72 |
| abstract_inverted_index.a | 21, 70, 187 |
| abstract_inverted_index.In | 103 |
| abstract_inverted_index.We | 31, 114 |
| abstract_inverted_index.an | 16, 84 |
| abstract_inverted_index.be | 69, 161 |
| abstract_inverted_index.if | 132 |
| abstract_inverted_index.in | 140, 195 |
| abstract_inverted_index.is | 20, 28, 135 |
| abstract_inverted_index.of | 26, 80, 118, 142, 151, 171, 180 |
| abstract_inverted_index.on | 198 |
| abstract_inverted_index.to | 5, 35, 56, 74, 166 |
| abstract_inverted_index.we | 93, 105, 155, 184 |
| abstract_inverted_index.DNA | 89 |
| abstract_inverted_index.The | 13 |
| abstract_inverted_index.and | 37, 59, 90, 130 |
| abstract_inverted_index.are | 3, 48 |
| abstract_inverted_index.can | 110, 160, 175 |
| abstract_inverted_index.for | 50, 193 |
| abstract_inverted_index.how | 107, 157 |
| abstract_inverted_index.may | 68, 123, 146 |
| abstract_inverted_index.not | 29 |
| abstract_inverted_index.the | 24, 78, 112, 133, 137, 143, 149, 169, 178 |
| abstract_inverted_index.use | 194 |
| abstract_inverted_index.KAST | 192 |
| abstract_inverted_index.This | 54, 174 |
| abstract_inverted_index.both | 88 |
| abstract_inverted_index.data | 11, 165 |
| abstract_inverted_index.even | 131 |
| abstract_inverted_index.from | 15, 40, 65 |
| abstract_inverted_index.gain | 75 |
| abstract_inverted_index.have | 185 |
| abstract_inverted_index.into | 77 |
| abstract_inverted_index.mean | 128 |
| abstract_inverted_index.pair | 145 |
| abstract_inverted_index.show | 115, 156 |
| abstract_inverted_index.test | 164 |
| abstract_inverted_index.that | 44, 116 |
| abstract_inverted_index.tool | 190 |
| abstract_inverted_index.true | 66, 119, 172 |
| abstract_inverted_index.used | 162 |
| abstract_inverted_index.when | 8 |
| abstract_inverted_index.with | 87, 126, 163 |
| abstract_inverted_index.Linux | 199 |
| abstract_inverted_index.Using | 83 |
| abstract_inverted_index.leads | 55 |
| abstract_inverted_index.pairs | 122 |
| abstract_inverted_index.score | 61, 97 |
| abstract_inverted_index.sets. | 12 |
| abstract_inverted_index.their | 127 |
| abstract_inverted_index.those | 64 |
| abstract_inverted_index.under | 100 |
| abstract_inverted_index.weak, | 136 |
| abstract_inverted_index.which | 27 |
| abstract_inverted_index.affect | 111 |
| abstract_inverted_index.called | 191 |
| abstract_inverted_index.length | 109, 141 |
| abstract_inverted_index.method | 73 |
| abstract_inverted_index.noting | 43 |
| abstract_inverted_index.output | 14 |
| abstract_inverted_index.reduce | 148 |
| abstract_inverted_index.score, | 23 |
| abstract_inverted_index.scores | 117 |
| abstract_inverted_index.simple | 72 |
| abstract_inverted_index.insight | 76 |
| abstract_inverted_index.length; | 129 |
| abstract_inverted_index.methods | 2, 7 |
| abstract_inverted_index.outputs | 39 |
| abstract_inverted_index.propose | 32 |
| abstract_inverted_index.protein | 91 |
| abstract_inverted_index.scores. | 113 |
| abstract_inverted_index.utility | 179 |
| abstract_inverted_index.Abstract | 0 |
| abstract_inverted_index.Finally, | 183 |
| abstract_inverted_index.approach | 86 |
| abstract_inverted_index.estimate | 168 |
| abstract_inverted_index.increase | 177 |
| abstract_inverted_index.metrics. | 82, 153 |
| abstract_inverted_index.positive | 120 |
| abstract_inverted_index.relative | 138 |
| abstract_inverted_index.sequence | 10, 18, 108, 121, 144 |
| abstract_inverted_index.software | 189 |
| abstract_inverted_index.calibrate | 38 |
| abstract_inverted_index.clusters. | 200 |
| abstract_inverted_index.comparing | 60 |
| abstract_inverted_index.contexts. | 53 |
| abstract_inverted_index.correlate | 124 |
| abstract_inverted_index.developed | 186 |
| abstract_inverted_index.different | 45, 51, 81, 95, 101 |
| abstract_inverted_index.empirical | 85 |
| abstract_inverted_index.functions | 34, 47, 159 |
| abstract_inverted_index.generated | 99 |
| abstract_inverted_index.including | 63 |
| abstract_inverted_index.interpret | 36 |
| abstract_inverted_index.necessary | 49 |
| abstract_inverted_index.objective | 33, 46, 158 |
| abstract_inverted_index.searches, | 42 |
| abstract_inverted_index.searching | 9 |
| abstract_inverted_index.workflows | 197 |
| abstract_inverted_index.accurately | 167 |
| abstract_inverted_index.biological | 52 |
| abstract_inverted_index.comparison | 19 |
| abstract_inverted_index.difference | 139 |
| abstract_inverted_index.positives, | 67 |
| abstract_inverted_index.positives. | 173 |
| abstract_inverted_index.relatively | 71 |
| abstract_inverted_index.sequences, | 92 |
| abstract_inverted_index.similarity | 22, 96 |
| abstract_inverted_index.advantages: | 57 |
| abstract_inverted_index.approaches. | 182 |
| abstract_inverted_index.correlation | 134 |
| abstract_inverted_index.demonstrate | 106 |
| abstract_inverted_index.parameters. | 102 |
| abstract_inverted_index.particular, | 104 |
| abstract_inverted_index.performance | 79 |
| abstract_inverted_index.probability | 170 |
| abstract_inverted_index.visualising | 58 |
| abstract_inverted_index.Importantly, | 154 |
| abstract_inverted_index.alternatives | 4 |
| abstract_inverted_index.characterise | 94 |
| abstract_inverted_index.distributions | 98 |
| abstract_inverted_index.effectiveness | 150 |
| abstract_inverted_index.significantly | 125, 147, 176 |
| abstract_inverted_index.Alignment-free | 1 |
| abstract_inverted_index.alignment-free | 17, 41, 152, 181 |
| abstract_inverted_index.distributions, | 62 |
| abstract_inverted_index.interpretation | 25 |
| abstract_inverted_index.alignment-based | 6 |
| abstract_inverted_index.general-purpose | 188 |
| abstract_inverted_index.high-throughput | 196 |
| abstract_inverted_index.straightforward. | 30 |
| cited_by_percentile_year.max | 97 |
| cited_by_percentile_year.min | 90 |
| corresponding_author_ids | https://openalex.org/A5000794908 |
| countries_distinct_count | 1 |
| institutions_distinct_count | 2 |
| corresponding_institution_ids | https://openalex.org/I16038530 |
| citation_normalized_percentile.value | 0.64432181 |
| citation_normalized_percentile.is_in_top_1_percent | False |
| citation_normalized_percentile.is_in_top_10_percent | False |