GaKCo: a Fast Gappedk-mer string Kernel using Counting Article Swipe
YOU?
·
· 2018
· Open Access
·
· DOI: https://doi.org/10.1101/329425
String Kernel (SK) techniques, especially those using gapped k -mers as features (gk), have obtained great success in classifying sequences like DNA, protein, and text. However, the state-of-the-art gk-SK runs extremely slow when we increase the dictionary size ( Σ ) or allow more mismatches ( M ). This is because current gk-SK uses a trie-based algorithm to calculate co-occurrence of mismatched substrings resulting in a time cost proportional to O ( Σ M ). We propose a fast algorithm for calculating Ga pped k -mer K ernel using Co unting (GaKCo). GaKCo uses associative arrays to calculate the co-occurrence of substrings using cumulative counting. This algorithm is fast, scalable to larger Σ and M , and naturally parallelizable. We provide a rigorous asymptotic analysis that compares GaKCo with the state-of-the-art gk-SK. Theoretically, the time cost of GaKCo is independent of the Σ M term that slows down the trie-based approach. Experimentally, we observe that GaKCo achieves the same accuracy as the state-of-the-art and outperforms its speed by factors of 2, 100, and 4, on classifying sequences of DNA (5 datasets), protein (12 datasets), and character-based English text (2 datasets). 1
Related Topics
- Type
- preprint
- Language
- en
- Landing Page
- https://doi.org/10.1101/329425
- https://www.biorxiv.org/content/biorxiv/early/2018/05/24/329425.full.pdf
- OA Status
- green
- Cited By
- 4
- References
- 33
- Related Works
- 10
- OpenAlex ID
- https://openalex.org/W2950956043
Raw OpenAlex JSON
- OpenAlex ID
-
https://openalex.org/W2950956043Canonical identifier for this work in OpenAlex
- DOI
-
https://doi.org/10.1101/329425Digital Object Identifier
- Title
-
GaKCo: a Fast Gappedk-mer string Kernel using CountingWork title
- Type
-
preprintOpenAlex work type
- Language
-
enPrimary language
- Publication year
-
2018Year of publication
- Publication date
-
2018-05-24Full publication date if available
- Authors
-
Ritambhara Singh, Arshdeep Sekhon, Jack Lanchantin, Kamran Kowsari, Beilun Wang, Yanjun QiList of authors in order
- Landing page
-
https://doi.org/10.1101/329425Publisher landing page
- PDF URL
-
https://www.biorxiv.org/content/biorxiv/early/2018/05/24/329425.full.pdfDirect link to full text PDF
- Open access
-
YesWhether a free full text is available
- OA status
-
greenOpen access status per OpenAlex
- OA URL
-
https://www.biorxiv.org/content/biorxiv/early/2018/05/24/329425.full.pdfDirect OA link when available
- Concepts
-
Substring, Trie, Kernel (algebra), String (physics), Computer science, k-mer, Scalability, Parallelizable manifold, Algorithm, State (computer science), Character (mathematics), Mathematics, Combinatorics, Data structure, DNA, DNA sequencing, Geometry, Database, Biology, Programming language, Genetics, Mathematical physicsTop concepts (fields/topics) attached by OpenAlex
- Cited by
-
4Total citation count in OpenAlex
- Citations by year (recent)
-
2023: 1, 2020: 2, 2019: 1Per-year citation counts (last 5 years)
- References (count)
-
33Number of works referenced by this work
- Related works (count)
-
10Other works algorithmically related by OpenAlex
Full payload
| id | https://openalex.org/W2950956043 |
|---|---|
| doi | https://doi.org/10.1101/329425 |
| ids.doi | https://doi.org/10.1101/329425 |
| ids.mag | 2950956043 |
| ids.openalex | https://openalex.org/W2950956043 |
| fwci | 0.32131384 |
| type | preprint |
| title | GaKCo: a Fast Gappedk-mer string Kernel using Counting |
| biblio.issue | |
| biblio.volume | |
| biblio.last_page | |
| biblio.first_page | |
| topics[0].id | https://openalex.org/T10015 |
| topics[0].field.id | https://openalex.org/fields/13 |
| topics[0].field.display_name | Biochemistry, Genetics and Molecular Biology |
| topics[0].score | 0.9983000159263611 |
| topics[0].domain.id | https://openalex.org/domains/1 |
| topics[0].domain.display_name | Life Sciences |
| topics[0].subfield.id | https://openalex.org/subfields/1312 |
| topics[0].subfield.display_name | Molecular Biology |
| topics[0].display_name | Genomics and Phylogenetic Studies |
| topics[1].id | https://openalex.org/T10521 |
| topics[1].field.id | https://openalex.org/fields/13 |
| topics[1].field.display_name | Biochemistry, Genetics and Molecular Biology |
| topics[1].score | 0.9983000159263611 |
| topics[1].domain.id | https://openalex.org/domains/1 |
| topics[1].domain.display_name | Life Sciences |
| topics[1].subfield.id | https://openalex.org/subfields/1312 |
| topics[1].subfield.display_name | Molecular Biology |
| topics[1].display_name | RNA and protein synthesis mechanisms |
| topics[2].id | https://openalex.org/T10222 |
| topics[2].field.id | https://openalex.org/fields/13 |
| topics[2].field.display_name | Biochemistry, Genetics and Molecular Biology |
| topics[2].score | 0.9972000122070312 |
| topics[2].domain.id | https://openalex.org/domains/1 |
| topics[2].domain.display_name | Life Sciences |
| topics[2].subfield.id | https://openalex.org/subfields/1312 |
| topics[2].subfield.display_name | Molecular Biology |
| topics[2].display_name | Genomics and Chromatin Dynamics |
| is_xpac | False |
| apc_list | |
| apc_paid | |
| concepts[0].id | https://openalex.org/C182407805 |
| concepts[0].level | 3 |
| concepts[0].score | 0.9733496308326721 |
| concepts[0].wikidata | https://www.wikidata.org/wiki/Q2626534 |
| concepts[0].display_name | Substring |
| concepts[1].id | https://openalex.org/C190290938 |
| concepts[1].level | 3 |
| concepts[1].score | 0.7296486496925354 |
| concepts[1].wikidata | https://www.wikidata.org/wiki/Q387015 |
| concepts[1].display_name | Trie |
| concepts[2].id | https://openalex.org/C74193536 |
| concepts[2].level | 2 |
| concepts[2].score | 0.6802997589111328 |
| concepts[2].wikidata | https://www.wikidata.org/wiki/Q574844 |
| concepts[2].display_name | Kernel (algebra) |
| concepts[3].id | https://openalex.org/C157486923 |
| concepts[3].level | 2 |
| concepts[3].score | 0.6591174602508545 |
| concepts[3].wikidata | https://www.wikidata.org/wiki/Q1376436 |
| concepts[3].display_name | String (physics) |
| concepts[4].id | https://openalex.org/C41008148 |
| concepts[4].level | 0 |
| concepts[4].score | 0.5779857635498047 |
| concepts[4].wikidata | https://www.wikidata.org/wiki/Q21198 |
| concepts[4].display_name | Computer science |
| concepts[5].id | https://openalex.org/C2279292 |
| concepts[5].level | 4 |
| concepts[5].score | 0.5544005036354065 |
| concepts[5].wikidata | https://www.wikidata.org/wiki/Q6322851 |
| concepts[5].display_name | k-mer |
| concepts[6].id | https://openalex.org/C48044578 |
| concepts[6].level | 2 |
| concepts[6].score | 0.5485230684280396 |
| concepts[6].wikidata | https://www.wikidata.org/wiki/Q727490 |
| concepts[6].display_name | Scalability |
| concepts[7].id | https://openalex.org/C148047603 |
| concepts[7].level | 2 |
| concepts[7].score | 0.5311328768730164 |
| concepts[7].wikidata | https://www.wikidata.org/wiki/Q1014612 |
| concepts[7].display_name | Parallelizable manifold |
| concepts[8].id | https://openalex.org/C11413529 |
| concepts[8].level | 1 |
| concepts[8].score | 0.525844395160675 |
| concepts[8].wikidata | https://www.wikidata.org/wiki/Q8366 |
| concepts[8].display_name | Algorithm |
| concepts[9].id | https://openalex.org/C48103436 |
| concepts[9].level | 2 |
| concepts[9].score | 0.494582861661911 |
| concepts[9].wikidata | https://www.wikidata.org/wiki/Q599031 |
| concepts[9].display_name | State (computer science) |
| concepts[10].id | https://openalex.org/C2780861071 |
| concepts[10].level | 2 |
| concepts[10].score | 0.4430028796195984 |
| concepts[10].wikidata | https://www.wikidata.org/wiki/Q1062934 |
| concepts[10].display_name | Character (mathematics) |
| concepts[11].id | https://openalex.org/C33923547 |
| concepts[11].level | 0 |
| concepts[11].score | 0.2988877296447754 |
| concepts[11].wikidata | https://www.wikidata.org/wiki/Q395 |
| concepts[11].display_name | Mathematics |
| concepts[12].id | https://openalex.org/C114614502 |
| concepts[12].level | 1 |
| concepts[12].score | 0.24151629209518433 |
| concepts[12].wikidata | https://www.wikidata.org/wiki/Q76592 |
| concepts[12].display_name | Combinatorics |
| concepts[13].id | https://openalex.org/C162319229 |
| concepts[13].level | 2 |
| concepts[13].score | 0.16964566707611084 |
| concepts[13].wikidata | https://www.wikidata.org/wiki/Q175263 |
| concepts[13].display_name | Data structure |
| concepts[14].id | https://openalex.org/C552990157 |
| concepts[14].level | 2 |
| concepts[14].score | 0.11794257164001465 |
| concepts[14].wikidata | https://www.wikidata.org/wiki/Q7430 |
| concepts[14].display_name | DNA |
| concepts[15].id | https://openalex.org/C51679486 |
| concepts[15].level | 3 |
| concepts[15].score | 0.11357802152633667 |
| concepts[15].wikidata | https://www.wikidata.org/wiki/Q380546 |
| concepts[15].display_name | DNA sequencing |
| concepts[16].id | https://openalex.org/C2524010 |
| concepts[16].level | 1 |
| concepts[16].score | 0.09141889214515686 |
| concepts[16].wikidata | https://www.wikidata.org/wiki/Q8087 |
| concepts[16].display_name | Geometry |
| concepts[17].id | https://openalex.org/C77088390 |
| concepts[17].level | 1 |
| concepts[17].score | 0.0 |
| concepts[17].wikidata | https://www.wikidata.org/wiki/Q8513 |
| concepts[17].display_name | Database |
| concepts[18].id | https://openalex.org/C86803240 |
| concepts[18].level | 0 |
| concepts[18].score | 0.0 |
| concepts[18].wikidata | https://www.wikidata.org/wiki/Q420 |
| concepts[18].display_name | Biology |
| concepts[19].id | https://openalex.org/C199360897 |
| concepts[19].level | 1 |
| concepts[19].score | 0.0 |
| concepts[19].wikidata | https://www.wikidata.org/wiki/Q9143 |
| concepts[19].display_name | Programming language |
| concepts[20].id | https://openalex.org/C54355233 |
| concepts[20].level | 1 |
| concepts[20].score | 0.0 |
| concepts[20].wikidata | https://www.wikidata.org/wiki/Q7162 |
| concepts[20].display_name | Genetics |
| concepts[21].id | https://openalex.org/C37914503 |
| concepts[21].level | 1 |
| concepts[21].score | 0.0 |
| concepts[21].wikidata | https://www.wikidata.org/wiki/Q156495 |
| concepts[21].display_name | Mathematical physics |
| keywords[0].id | https://openalex.org/keywords/substring |
| keywords[0].score | 0.9733496308326721 |
| keywords[0].display_name | Substring |
| keywords[1].id | https://openalex.org/keywords/trie |
| keywords[1].score | 0.7296486496925354 |
| keywords[1].display_name | Trie |
| keywords[2].id | https://openalex.org/keywords/kernel |
| keywords[2].score | 0.6802997589111328 |
| keywords[2].display_name | Kernel (algebra) |
| keywords[3].id | https://openalex.org/keywords/string |
| keywords[3].score | 0.6591174602508545 |
| keywords[3].display_name | String (physics) |
| keywords[4].id | https://openalex.org/keywords/computer-science |
| keywords[4].score | 0.5779857635498047 |
| keywords[4].display_name | Computer science |
| keywords[5].id | https://openalex.org/keywords/k-mer |
| keywords[5].score | 0.5544005036354065 |
| keywords[5].display_name | k-mer |
| keywords[6].id | https://openalex.org/keywords/scalability |
| keywords[6].score | 0.5485230684280396 |
| keywords[6].display_name | Scalability |
| keywords[7].id | https://openalex.org/keywords/parallelizable-manifold |
| keywords[7].score | 0.5311328768730164 |
| keywords[7].display_name | Parallelizable manifold |
| keywords[8].id | https://openalex.org/keywords/algorithm |
| keywords[8].score | 0.525844395160675 |
| keywords[8].display_name | Algorithm |
| keywords[9].id | https://openalex.org/keywords/state |
| keywords[9].score | 0.494582861661911 |
| keywords[9].display_name | State (computer science) |
| keywords[10].id | https://openalex.org/keywords/character |
| keywords[10].score | 0.4430028796195984 |
| keywords[10].display_name | Character (mathematics) |
| keywords[11].id | https://openalex.org/keywords/mathematics |
| keywords[11].score | 0.2988877296447754 |
| keywords[11].display_name | Mathematics |
| keywords[12].id | https://openalex.org/keywords/combinatorics |
| keywords[12].score | 0.24151629209518433 |
| keywords[12].display_name | Combinatorics |
| keywords[13].id | https://openalex.org/keywords/data-structure |
| keywords[13].score | 0.16964566707611084 |
| keywords[13].display_name | Data structure |
| keywords[14].id | https://openalex.org/keywords/dna |
| keywords[14].score | 0.11794257164001465 |
| keywords[14].display_name | DNA |
| keywords[15].id | https://openalex.org/keywords/dna-sequencing |
| keywords[15].score | 0.11357802152633667 |
| keywords[15].display_name | DNA sequencing |
| keywords[16].id | https://openalex.org/keywords/geometry |
| keywords[16].score | 0.09141889214515686 |
| keywords[16].display_name | Geometry |
| language | en |
| locations[0].id | doi:10.1101/329425 |
| locations[0].is_oa | True |
| locations[0].source.id | https://openalex.org/S4306402567 |
| locations[0].source.issn | |
| locations[0].source.type | repository |
| locations[0].source.is_oa | False |
| locations[0].source.issn_l | |
| locations[0].source.is_core | False |
| locations[0].source.is_in_doaj | False |
| locations[0].source.display_name | bioRxiv (Cold Spring Harbor Laboratory) |
| locations[0].source.host_organization | https://openalex.org/I2750212522 |
| locations[0].source.host_organization_name | Cold Spring Harbor Laboratory |
| locations[0].source.host_organization_lineage | https://openalex.org/I2750212522 |
| locations[0].license | cc-by-nc-nd |
| locations[0].pdf_url | https://www.biorxiv.org/content/biorxiv/early/2018/05/24/329425.full.pdf |
| locations[0].version | acceptedVersion |
| locations[0].raw_type | posted-content |
| locations[0].license_id | https://openalex.org/licenses/cc-by-nc-nd |
| locations[0].is_accepted | True |
| locations[0].is_published | False |
| locations[0].raw_source_name | |
| locations[0].landing_page_url | https://doi.org/10.1101/329425 |
| indexed_in | crossref |
| authorships[0].author.id | https://openalex.org/A5054482830 |
| authorships[0].author.orcid | https://orcid.org/0000-0001-9857-1415 |
| authorships[0].author.display_name | Ritambhara Singh |
| authorships[0].countries | US |
| authorships[0].affiliations[0].institution_ids | https://openalex.org/I51556381 |
| authorships[0].affiliations[0].raw_affiliation_string | Department of Computer Science, University of Virginia () |
| authorships[0].institutions[0].id | https://openalex.org/I51556381 |
| authorships[0].institutions[0].ror | https://ror.org/0153tk833 |
| authorships[0].institutions[0].type | education |
| authorships[0].institutions[0].lineage | https://openalex.org/I51556381 |
| authorships[0].institutions[0].country_code | US |
| authorships[0].institutions[0].display_name | University of Virginia |
| authorships[0].author_position | first |
| authorships[0].raw_author_name | Ritambhara Singh |
| authorships[0].is_corresponding | True |
| authorships[0].raw_affiliation_strings | Department of Computer Science, University of Virginia () |
| authorships[1].author.id | https://openalex.org/A5048094604 |
| authorships[1].author.orcid | |
| authorships[1].author.display_name | Arshdeep Sekhon |
| authorships[1].countries | US |
| authorships[1].affiliations[0].institution_ids | https://openalex.org/I51556381 |
| authorships[1].affiliations[0].raw_affiliation_string | Department of Computer Science, University of Virginia () |
| authorships[1].institutions[0].id | https://openalex.org/I51556381 |
| authorships[1].institutions[0].ror | https://ror.org/0153tk833 |
| authorships[1].institutions[0].type | education |
| authorships[1].institutions[0].lineage | https://openalex.org/I51556381 |
| authorships[1].institutions[0].country_code | US |
| authorships[1].institutions[0].display_name | University of Virginia |
| authorships[1].author_position | middle |
| authorships[1].raw_author_name | Arshdeep Sekhon |
| authorships[1].is_corresponding | True |
| authorships[1].raw_affiliation_strings | Department of Computer Science, University of Virginia () |
| authorships[2].author.id | https://openalex.org/A5016503379 |
| authorships[2].author.orcid | https://orcid.org/0000-0003-0811-0944 |
| authorships[2].author.display_name | Jack Lanchantin |
| authorships[2].countries | US |
| authorships[2].affiliations[0].institution_ids | https://openalex.org/I51556381 |
| authorships[2].affiliations[0].raw_affiliation_string | Department of Computer Science, University of Virginia () |
| authorships[2].institutions[0].id | https://openalex.org/I51556381 |
| authorships[2].institutions[0].ror | https://ror.org/0153tk833 |
| authorships[2].institutions[0].type | education |
| authorships[2].institutions[0].lineage | https://openalex.org/I51556381 |
| authorships[2].institutions[0].country_code | US |
| authorships[2].institutions[0].display_name | University of Virginia |
| authorships[2].author_position | middle |
| authorships[2].raw_author_name | Jack Lanchantin |
| authorships[2].is_corresponding | True |
| authorships[2].raw_affiliation_strings | Department of Computer Science, University of Virginia () |
| authorships[3].author.id | https://openalex.org/A5001354047 |
| authorships[3].author.orcid | https://orcid.org/0000-0002-6451-4786 |
| authorships[3].author.display_name | Kamran Kowsari |
| authorships[3].countries | US |
| authorships[3].affiliations[0].institution_ids | https://openalex.org/I51556381 |
| authorships[3].affiliations[0].raw_affiliation_string | Department of Computer Science, University of Virginia () |
| authorships[3].institutions[0].id | https://openalex.org/I51556381 |
| authorships[3].institutions[0].ror | https://ror.org/0153tk833 |
| authorships[3].institutions[0].type | education |
| authorships[3].institutions[0].lineage | https://openalex.org/I51556381 |
| authorships[3].institutions[0].country_code | US |
| authorships[3].institutions[0].display_name | University of Virginia |
| authorships[3].author_position | middle |
| authorships[3].raw_author_name | Kamran Kowsari |
| authorships[3].is_corresponding | True |
| authorships[3].raw_affiliation_strings | Department of Computer Science, University of Virginia () |
| authorships[4].author.id | https://openalex.org/A5009957868 |
| authorships[4].author.orcid | https://orcid.org/0000-0002-2646-1492 |
| authorships[4].author.display_name | Beilun Wang |
| authorships[4].countries | US |
| authorships[4].affiliations[0].institution_ids | https://openalex.org/I51556381 |
| authorships[4].affiliations[0].raw_affiliation_string | Department of Computer Science, University of Virginia () |
| authorships[4].institutions[0].id | https://openalex.org/I51556381 |
| authorships[4].institutions[0].ror | https://ror.org/0153tk833 |
| authorships[4].institutions[0].type | education |
| authorships[4].institutions[0].lineage | https://openalex.org/I51556381 |
| authorships[4].institutions[0].country_code | US |
| authorships[4].institutions[0].display_name | University of Virginia |
| authorships[4].author_position | middle |
| authorships[4].raw_author_name | Beilun Wang |
| authorships[4].is_corresponding | True |
| authorships[4].raw_affiliation_strings | Department of Computer Science, University of Virginia () |
| authorships[5].author.id | https://openalex.org/A5101887931 |
| authorships[5].author.orcid | https://orcid.org/0000-0002-5796-7453 |
| authorships[5].author.display_name | Yanjun Qi |
| authorships[5].countries | US |
| authorships[5].affiliations[0].institution_ids | https://openalex.org/I51556381 |
| authorships[5].affiliations[0].raw_affiliation_string | Department of Computer Science, University of Virginia () |
| authorships[5].institutions[0].id | https://openalex.org/I51556381 |
| authorships[5].institutions[0].ror | https://ror.org/0153tk833 |
| authorships[5].institutions[0].type | education |
| authorships[5].institutions[0].lineage | https://openalex.org/I51556381 |
| authorships[5].institutions[0].country_code | US |
| authorships[5].institutions[0].display_name | University of Virginia |
| authorships[5].author_position | last |
| authorships[5].raw_author_name | Yanjun Qi |
| authorships[5].is_corresponding | True |
| authorships[5].raw_affiliation_strings | Department of Computer Science, University of Virginia () |
| has_content.pdf | True |
| has_content.grobid_xml | True |
| is_paratext | False |
| open_access.is_oa | True |
| open_access.oa_url | https://www.biorxiv.org/content/biorxiv/early/2018/05/24/329425.full.pdf |
| open_access.oa_status | green |
| open_access.any_repository_has_fulltext | False |
| created_date | 2025-10-10T00:00:00 |
| display_name | GaKCo: a Fast Gappedk-mer string Kernel using Counting |
| has_fulltext | True |
| is_retracted | False |
| updated_date | 2025-11-06T03:46:38.306776 |
| primary_topic.id | https://openalex.org/T10015 |
| primary_topic.field.id | https://openalex.org/fields/13 |
| primary_topic.field.display_name | Biochemistry, Genetics and Molecular Biology |
| primary_topic.score | 0.9983000159263611 |
| primary_topic.domain.id | https://openalex.org/domains/1 |
| primary_topic.domain.display_name | Life Sciences |
| primary_topic.subfield.id | https://openalex.org/subfields/1312 |
| primary_topic.subfield.display_name | Molecular Biology |
| primary_topic.display_name | Genomics and Phylogenetic Studies |
| related_works | https://openalex.org/W2196636117, https://openalex.org/W2067956663, https://openalex.org/W2051304374, https://openalex.org/W4286787697, https://openalex.org/W1505605100, https://openalex.org/W3153057271, https://openalex.org/W3043769852, https://openalex.org/W2014450733, https://openalex.org/W2411730464, https://openalex.org/W2950956043 |
| cited_by_count | 4 |
| counts_by_year[0].year | 2023 |
| counts_by_year[0].cited_by_count | 1 |
| counts_by_year[1].year | 2020 |
| counts_by_year[1].cited_by_count | 2 |
| counts_by_year[2].year | 2019 |
| counts_by_year[2].cited_by_count | 1 |
| locations_count | 1 |
| best_oa_location.id | doi:10.1101/329425 |
| best_oa_location.is_oa | True |
| best_oa_location.source.id | https://openalex.org/S4306402567 |
| best_oa_location.source.issn | |
| best_oa_location.source.type | repository |
| best_oa_location.source.is_oa | False |
| best_oa_location.source.issn_l | |
| best_oa_location.source.is_core | False |
| best_oa_location.source.is_in_doaj | False |
| best_oa_location.source.display_name | bioRxiv (Cold Spring Harbor Laboratory) |
| best_oa_location.source.host_organization | https://openalex.org/I2750212522 |
| best_oa_location.source.host_organization_name | Cold Spring Harbor Laboratory |
| best_oa_location.source.host_organization_lineage | https://openalex.org/I2750212522 |
| best_oa_location.license | cc-by-nc-nd |
| best_oa_location.pdf_url | https://www.biorxiv.org/content/biorxiv/early/2018/05/24/329425.full.pdf |
| best_oa_location.version | acceptedVersion |
| best_oa_location.raw_type | posted-content |
| best_oa_location.license_id | https://openalex.org/licenses/cc-by-nc-nd |
| best_oa_location.is_accepted | True |
| best_oa_location.is_published | False |
| best_oa_location.raw_source_name | |
| best_oa_location.landing_page_url | https://doi.org/10.1101/329425 |
| primary_location.id | doi:10.1101/329425 |
| primary_location.is_oa | True |
| primary_location.source.id | https://openalex.org/S4306402567 |
| primary_location.source.issn | |
| primary_location.source.type | repository |
| primary_location.source.is_oa | False |
| primary_location.source.issn_l | |
| primary_location.source.is_core | False |
| primary_location.source.is_in_doaj | False |
| primary_location.source.display_name | bioRxiv (Cold Spring Harbor Laboratory) |
| primary_location.source.host_organization | https://openalex.org/I2750212522 |
| primary_location.source.host_organization_name | Cold Spring Harbor Laboratory |
| primary_location.source.host_organization_lineage | https://openalex.org/I2750212522 |
| primary_location.license | cc-by-nc-nd |
| primary_location.pdf_url | https://www.biorxiv.org/content/biorxiv/early/2018/05/24/329425.full.pdf |
| primary_location.version | acceptedVersion |
| primary_location.raw_type | posted-content |
| primary_location.license_id | https://openalex.org/licenses/cc-by-nc-nd |
| primary_location.is_accepted | True |
| primary_location.is_published | False |
| primary_location.raw_source_name | |
| primary_location.landing_page_url | https://doi.org/10.1101/329425 |
| publication_date | 2018-05-24 |
| publication_year | 2018 |
| referenced_works | https://openalex.org/W1019830208, https://openalex.org/W3137072739, https://openalex.org/W1989338936, https://openalex.org/W1498183065, https://openalex.org/W3120421331, https://openalex.org/W2113592823, https://openalex.org/W2259938310, https://openalex.org/W2051520360, https://openalex.org/W1988581590, https://openalex.org/W2013887116, https://openalex.org/W2283504545, https://openalex.org/W1995918845, https://openalex.org/W2108837639, https://openalex.org/W1991315287, https://openalex.org/W2119290215, https://openalex.org/W2020816856, https://openalex.org/W2359716859, https://openalex.org/W1480643256, https://openalex.org/W2081930221, https://openalex.org/W2096634299, https://openalex.org/W2171111703, https://openalex.org/W2126486632, https://openalex.org/W2089026820, https://openalex.org/W370702354, https://openalex.org/W2512192886, https://openalex.org/W2250966211, https://openalex.org/W2124225821, https://openalex.org/W2170240176, https://openalex.org/W4285719527, https://openalex.org/W2950518324, https://openalex.org/W2148603752, https://openalex.org/W2251939518, https://openalex.org/W2153635508 |
| referenced_works_count | 33 |
| abstract_inverted_index.( | 39, 46, 72 |
| abstract_inverted_index.) | 41 |
| abstract_inverted_index., | 116 |
| abstract_inverted_index.1 | 191 |
| abstract_inverted_index.K | 87 |
| abstract_inverted_index.M | 47, 74, 115, 144 |
| abstract_inverted_index.O | 71 |
| abstract_inverted_index.a | 55, 66, 78, 122 |
| abstract_inverted_index.k | 9, 85 |
| abstract_inverted_index.(2 | 189 |
| abstract_inverted_index.(5 | 180 |
| abstract_inverted_index.). | 48, 75 |
| abstract_inverted_index.2, | 171 |
| abstract_inverted_index.4, | 174 |
| abstract_inverted_index.Co | 90 |
| abstract_inverted_index.Ga | 83 |
| abstract_inverted_index.We | 76, 120 |
| abstract_inverted_index.as | 11, 161 |
| abstract_inverted_index.by | 168 |
| abstract_inverted_index.in | 18, 65 |
| abstract_inverted_index.is | 50, 108, 139 |
| abstract_inverted_index.of | 61, 101, 137, 141, 170, 178 |
| abstract_inverted_index.on | 175 |
| abstract_inverted_index.or | 42 |
| abstract_inverted_index.to | 58, 70, 97, 111 |
| abstract_inverted_index.we | 34, 153 |
| abstract_inverted_index.Σ | 40, 73, 113, 143 |
| abstract_inverted_index.(12 | 183 |
| abstract_inverted_index.DNA | 179 |
| abstract_inverted_index.and | 24, 114, 117, 164, 173, 185 |
| abstract_inverted_index.for | 81 |
| abstract_inverted_index.its | 166 |
| abstract_inverted_index.the | 27, 36, 99, 130, 134, 142, 149, 158, 162 |
| abstract_inverted_index.(SK) | 3 |
| abstract_inverted_index.-mer | 86 |
| abstract_inverted_index.100, | 172 |
| abstract_inverted_index.DNA, | 22 |
| abstract_inverted_index.This | 49, 106 |
| abstract_inverted_index.cost | 68, 136 |
| abstract_inverted_index.down | 148 |
| abstract_inverted_index.fast | 79 |
| abstract_inverted_index.have | 14 |
| abstract_inverted_index.like | 21 |
| abstract_inverted_index.more | 44 |
| abstract_inverted_index.pped | 84 |
| abstract_inverted_index.runs | 30 |
| abstract_inverted_index.same | 159 |
| abstract_inverted_index.size | 38 |
| abstract_inverted_index.slow | 32 |
| abstract_inverted_index.term | 145 |
| abstract_inverted_index.text | 188 |
| abstract_inverted_index.that | 126, 146, 155 |
| abstract_inverted_index.time | 67, 135 |
| abstract_inverted_index.uses | 54, 94 |
| abstract_inverted_index.when | 33 |
| abstract_inverted_index.with | 129 |
| abstract_inverted_index.(gk), | 13 |
| abstract_inverted_index.-mers | 10 |
| abstract_inverted_index.GaKCo | 93, 128, 138, 156 |
| abstract_inverted_index.allow | 43 |
| abstract_inverted_index.ernel | 88 |
| abstract_inverted_index.fast, | 109 |
| abstract_inverted_index.gk-SK | 29, 53 |
| abstract_inverted_index.great | 16 |
| abstract_inverted_index.slows | 147 |
| abstract_inverted_index.speed | 167 |
| abstract_inverted_index.text. | 25 |
| abstract_inverted_index.those | 6 |
| abstract_inverted_index.using | 7, 89, 103 |
| abstract_inverted_index.Kernel | 2 |
| abstract_inverted_index.String | 1 |
| abstract_inverted_index.arrays | 96 |
| abstract_inverted_index.gapped | 8 |
| abstract_inverted_index.gk-SK. | 132 |
| abstract_inverted_index.larger | 112 |
| abstract_inverted_index.unting | 91 |
| abstract_inverted_index.English | 187 |
| abstract_inverted_index.because | 51 |
| abstract_inverted_index.current | 52 |
| abstract_inverted_index.factors | 169 |
| abstract_inverted_index.observe | 154 |
| abstract_inverted_index.propose | 77 |
| abstract_inverted_index.protein | 182 |
| abstract_inverted_index.provide | 121 |
| abstract_inverted_index.success | 17 |
| abstract_inverted_index.(GaKCo). | 92 |
| abstract_inverted_index.Abstract | 0 |
| abstract_inverted_index.However, | 26 |
| abstract_inverted_index.accuracy | 160 |
| abstract_inverted_index.achieves | 157 |
| abstract_inverted_index.analysis | 125 |
| abstract_inverted_index.compares | 127 |
| abstract_inverted_index.features | 12 |
| abstract_inverted_index.increase | 35 |
| abstract_inverted_index.obtained | 15 |
| abstract_inverted_index.protein, | 23 |
| abstract_inverted_index.rigorous | 123 |
| abstract_inverted_index.scalable | 110 |
| abstract_inverted_index.algorithm | 57, 80, 107 |
| abstract_inverted_index.approach. | 151 |
| abstract_inverted_index.calculate | 59, 98 |
| abstract_inverted_index.counting. | 105 |
| abstract_inverted_index.extremely | 31 |
| abstract_inverted_index.naturally | 118 |
| abstract_inverted_index.resulting | 64 |
| abstract_inverted_index.sequences | 20, 177 |
| abstract_inverted_index.asymptotic | 124 |
| abstract_inverted_index.cumulative | 104 |
| abstract_inverted_index.datasets), | 181, 184 |
| abstract_inverted_index.datasets). | 190 |
| abstract_inverted_index.dictionary | 37 |
| abstract_inverted_index.especially | 5 |
| abstract_inverted_index.mismatched | 62 |
| abstract_inverted_index.mismatches | 45 |
| abstract_inverted_index.substrings | 63, 102 |
| abstract_inverted_index.trie-based | 56, 150 |
| abstract_inverted_index.associative | 95 |
| abstract_inverted_index.calculating | 82 |
| abstract_inverted_index.classifying | 19, 176 |
| abstract_inverted_index.independent | 140 |
| abstract_inverted_index.outperforms | 165 |
| abstract_inverted_index.techniques, | 4 |
| abstract_inverted_index.proportional | 69 |
| abstract_inverted_index.co-occurrence | 60, 100 |
| abstract_inverted_index.Theoretically, | 133 |
| abstract_inverted_index.Experimentally, | 152 |
| abstract_inverted_index.character-based | 186 |
| abstract_inverted_index.parallelizable. | 119 |
| abstract_inverted_index.state-of-the-art | 28, 131, 163 |
| cited_by_percentile_year.max | 96 |
| cited_by_percentile_year.min | 89 |
| corresponding_author_ids | https://openalex.org/A5016503379, https://openalex.org/A5001354047, https://openalex.org/A5009957868, https://openalex.org/A5054482830, https://openalex.org/A5048094604, https://openalex.org/A5101887931 |
| countries_distinct_count | 1 |
| institutions_distinct_count | 6 |
| corresponding_institution_ids | https://openalex.org/I51556381 |
| citation_normalized_percentile.value | 0.59051749 |
| citation_normalized_percentile.is_in_top_1_percent | False |
| citation_normalized_percentile.is_in_top_10_percent | False |