Sparse Binary Relation Representations for Genome Graph Annotation Article Swipe
YOU?
·
· 2018
· Open Access
·
· DOI: https://doi.org/10.1101/468512
High-throughput DNA sequencing data is accumulating in public repositories, and efficient approaches for storing and indexing such data are in high demand. In recent research, several graph data structures have been proposed to represent large sets of sequencing data and to allow for efficient querying of sequences. In particular, the concept of labeled de Bruijn graphs has been explored by several groups. While there has been good progress towards representing the sequence graph in small space, methods for storing a set of labels on top of such graphs are still not sufficiently explored. It is also currently not clear how characteristics of the input data, such as the sparsity and correlations of labels, can help to inform the choice of method to compress the graph labeling. In this work, we present a new compression approach, Multi-BRWT , which is adaptive to different kinds of input data. We show an up to 29% improvement in compression performance over the basic BRWT method, and up to a 68% improvement over the current state-of-the-art for de Bruijn graph label compression. To put our results into perspective, we present a systematic analysis of five different state-of-the-art annotation compression schemes, evaluate key metrics on both artificial and real-world data and discuss how different data characteristics influence the compression performance. We show that the improvements of our new method can be robustly reproduced for different representative real-world datasets.
Related Topics
- Type
- preprint
- Language
- en
- Landing Page
- https://doi.org/10.1101/468512
- https://www.biorxiv.org/content/biorxiv/early/2019/01/25/468512.full.pdf
- OA Status
- green
- Cited By
- 4
- References
- 33
- Related Works
- 10
- OpenAlex ID
- https://openalex.org/W2900124800
Raw OpenAlex JSON
- OpenAlex ID
-
https://openalex.org/W2900124800Canonical identifier for this work in OpenAlex
- DOI
-
https://doi.org/10.1101/468512Digital Object Identifier
- Title
-
Sparse Binary Relation Representations for Genome Graph AnnotationWork title
- Type
-
preprintOpenAlex work type
- Language
-
enPrimary language
- Publication year
-
2018Year of publication
- Publication date
-
2018-11-12Full publication date if available
- Authors
-
Mikhail Karasikov, Harun Mustafa, Amir Joudaki, Sara Javadzadeh, Gunnar Rätsch, André KahlesList of authors in order
- Landing page
-
https://doi.org/10.1101/468512Publisher landing page
- PDF URL
-
https://www.biorxiv.org/content/biorxiv/early/2019/01/25/468512.full.pdfDirect link to full text PDF
- Open access
-
YesWhether a free full text is available
- OA status
-
greenOpen access status per OpenAlex
- OA URL
-
https://www.biorxiv.org/content/biorxiv/early/2019/01/25/468512.full.pdfDirect OA link when available
- Concepts
-
De Bruijn graph, Computer science, Graph, Search engine indexing, Theoretical computer science, Data compression, Data mining, Artificial intelligenceTop concepts (fields/topics) attached by OpenAlex
- Cited by
-
4Total citation count in OpenAlex
- Citations by year (recent)
-
2023: 1, 2020: 1, 2019: 1, 2017: 1Per-year citation counts (last 5 years)
- References (count)
-
33Number of works referenced by this work
- Related works (count)
-
10Other works algorithmically related by OpenAlex
Full payload
| id | https://openalex.org/W2900124800 |
|---|---|
| doi | https://doi.org/10.1101/468512 |
| ids.doi | https://doi.org/10.1101/468512 |
| ids.mag | 2900124800 |
| ids.openalex | https://openalex.org/W2900124800 |
| fwci | 0.21420922 |
| type | preprint |
| title | Sparse Binary Relation Representations for Genome Graph Annotation |
| biblio.issue | |
| biblio.volume | |
| biblio.last_page | |
| biblio.first_page | |
| topics[0].id | https://openalex.org/T10015 |
| topics[0].field.id | https://openalex.org/fields/13 |
| topics[0].field.display_name | Biochemistry, Genetics and Molecular Biology |
| topics[0].score | 0.9994999766349792 |
| topics[0].domain.id | https://openalex.org/domains/1 |
| topics[0].domain.display_name | Life Sciences |
| topics[0].subfield.id | https://openalex.org/subfields/1312 |
| topics[0].subfield.display_name | Molecular Biology |
| topics[0].display_name | Genomics and Phylogenetic Studies |
| topics[1].id | https://openalex.org/T11269 |
| topics[1].field.id | https://openalex.org/fields/17 |
| topics[1].field.display_name | Computer Science |
| topics[1].score | 0.9987999796867371 |
| topics[1].domain.id | https://openalex.org/domains/3 |
| topics[1].domain.display_name | Physical Sciences |
| topics[1].subfield.id | https://openalex.org/subfields/1702 |
| topics[1].subfield.display_name | Artificial Intelligence |
| topics[1].display_name | Algorithms and Data Compression |
| topics[2].id | https://openalex.org/T10885 |
| topics[2].field.id | https://openalex.org/fields/13 |
| topics[2].field.display_name | Biochemistry, Genetics and Molecular Biology |
| topics[2].score | 0.9958999752998352 |
| topics[2].domain.id | https://openalex.org/domains/1 |
| topics[2].domain.display_name | Life Sciences |
| topics[2].subfield.id | https://openalex.org/subfields/1312 |
| topics[2].subfield.display_name | Molecular Biology |
| topics[2].display_name | Gene expression and cancer classification |
| is_xpac | False |
| apc_list | |
| apc_paid | |
| concepts[0].id | https://openalex.org/C20218877 |
| concepts[0].level | 3 |
| concepts[0].score | 0.7664037942886353 |
| concepts[0].wikidata | https://www.wikidata.org/wiki/Q3066095 |
| concepts[0].display_name | De Bruijn graph |
| concepts[1].id | https://openalex.org/C41008148 |
| concepts[1].level | 0 |
| concepts[1].score | 0.7566231489181519 |
| concepts[1].wikidata | https://www.wikidata.org/wiki/Q21198 |
| concepts[1].display_name | Computer science |
| concepts[2].id | https://openalex.org/C132525143 |
| concepts[2].level | 2 |
| concepts[2].score | 0.5299913287162781 |
| concepts[2].wikidata | https://www.wikidata.org/wiki/Q141488 |
| concepts[2].display_name | Graph |
| concepts[3].id | https://openalex.org/C75165309 |
| concepts[3].level | 2 |
| concepts[3].score | 0.5097696185112 |
| concepts[3].wikidata | https://www.wikidata.org/wiki/Q2258979 |
| concepts[3].display_name | Search engine indexing |
| concepts[4].id | https://openalex.org/C80444323 |
| concepts[4].level | 1 |
| concepts[4].score | 0.48764312267303467 |
| concepts[4].wikidata | https://www.wikidata.org/wiki/Q2878974 |
| concepts[4].display_name | Theoretical computer science |
| concepts[5].id | https://openalex.org/C78548338 |
| concepts[5].level | 2 |
| concepts[5].score | 0.48764076828956604 |
| concepts[5].wikidata | https://www.wikidata.org/wiki/Q2493 |
| concepts[5].display_name | Data compression |
| concepts[6].id | https://openalex.org/C124101348 |
| concepts[6].level | 1 |
| concepts[6].score | 0.4756888151168823 |
| concepts[6].wikidata | https://www.wikidata.org/wiki/Q172491 |
| concepts[6].display_name | Data mining |
| concepts[7].id | https://openalex.org/C154945302 |
| concepts[7].level | 1 |
| concepts[7].score | 0.27772238850593567 |
| concepts[7].wikidata | https://www.wikidata.org/wiki/Q11660 |
| concepts[7].display_name | Artificial intelligence |
| keywords[0].id | https://openalex.org/keywords/de-bruijn-graph |
| keywords[0].score | 0.7664037942886353 |
| keywords[0].display_name | De Bruijn graph |
| keywords[1].id | https://openalex.org/keywords/computer-science |
| keywords[1].score | 0.7566231489181519 |
| keywords[1].display_name | Computer science |
| keywords[2].id | https://openalex.org/keywords/graph |
| keywords[2].score | 0.5299913287162781 |
| keywords[2].display_name | Graph |
| keywords[3].id | https://openalex.org/keywords/search-engine-indexing |
| keywords[3].score | 0.5097696185112 |
| keywords[3].display_name | Search engine indexing |
| keywords[4].id | https://openalex.org/keywords/theoretical-computer-science |
| keywords[4].score | 0.48764312267303467 |
| keywords[4].display_name | Theoretical computer science |
| keywords[5].id | https://openalex.org/keywords/data-compression |
| keywords[5].score | 0.48764076828956604 |
| keywords[5].display_name | Data compression |
| keywords[6].id | https://openalex.org/keywords/data-mining |
| keywords[6].score | 0.4756888151168823 |
| keywords[6].display_name | Data mining |
| keywords[7].id | https://openalex.org/keywords/artificial-intelligence |
| keywords[7].score | 0.27772238850593567 |
| keywords[7].display_name | Artificial intelligence |
| language | en |
| locations[0].id | doi:10.1101/468512 |
| locations[0].is_oa | True |
| locations[0].source.id | https://openalex.org/S4306402567 |
| locations[0].source.issn | |
| locations[0].source.type | repository |
| locations[0].source.is_oa | False |
| locations[0].source.issn_l | |
| locations[0].source.is_core | False |
| locations[0].source.is_in_doaj | False |
| locations[0].source.display_name | bioRxiv (Cold Spring Harbor Laboratory) |
| locations[0].source.host_organization | https://openalex.org/I2750212522 |
| locations[0].source.host_organization_name | Cold Spring Harbor Laboratory |
| locations[0].source.host_organization_lineage | https://openalex.org/I2750212522 |
| locations[0].license | cc-by-nc |
| locations[0].pdf_url | https://www.biorxiv.org/content/biorxiv/early/2019/01/25/468512.full.pdf |
| locations[0].version | acceptedVersion |
| locations[0].raw_type | posted-content |
| locations[0].license_id | https://openalex.org/licenses/cc-by-nc |
| locations[0].is_accepted | True |
| locations[0].is_published | False |
| locations[0].raw_source_name | |
| locations[0].landing_page_url | https://doi.org/10.1101/468512 |
| locations[1].id | pmh:oai:www.research-collection.ethz.ch:20.500.11850/315747 |
| locations[1].is_oa | True |
| locations[1].source.id | https://openalex.org/S4306402302 |
| locations[1].source.issn | |
| locations[1].source.type | repository |
| locations[1].source.is_oa | False |
| locations[1].source.issn_l | |
| locations[1].source.is_core | False |
| locations[1].source.is_in_doaj | False |
| locations[1].source.display_name | Repository for Publications and Research Data (ETH Zurich) |
| locations[1].source.host_organization | https://openalex.org/I35440088 |
| locations[1].source.host_organization_name | ETH Zurich |
| locations[1].source.host_organization_lineage | https://openalex.org/I35440088 |
| locations[1].license | other-oa |
| locations[1].pdf_url | http://hdl.handle.net/20.500.11850/315747 |
| locations[1].version | submittedVersion |
| locations[1].raw_type | info:eu-repo/semantics/workingPaper |
| locations[1].license_id | https://openalex.org/licenses/other-oa |
| locations[1].is_accepted | False |
| locations[1].is_published | False |
| locations[1].raw_source_name | bioRxiv |
| locations[1].landing_page_url | http://hdl.handle.net/20.500.11850/315747 |
| locations[2].id | pmh:oai:www.research-collection.ethz.ch:20.500.11850/441526 |
| locations[2].is_oa | True |
| locations[2].source.id | https://openalex.org/S4306402302 |
| locations[2].source.issn | |
| locations[2].source.type | repository |
| locations[2].source.is_oa | False |
| locations[2].source.issn_l | |
| locations[2].source.is_core | False |
| locations[2].source.is_in_doaj | False |
| locations[2].source.display_name | Repository for Publications and Research Data (ETH Zurich) |
| locations[2].source.host_organization | https://openalex.org/I35440088 |
| locations[2].source.host_organization_name | ETH Zurich |
| locations[2].source.host_organization_lineage | https://openalex.org/I35440088 |
| locations[2].license | |
| locations[2].pdf_url | http://hdl.handle.net/20.500.11850/441526 |
| locations[2].version | submittedVersion |
| locations[2].raw_type | info:eu-repo/semantics/publishedVersion |
| locations[2].license_id | |
| locations[2].is_accepted | False |
| locations[2].is_published | False |
| locations[2].raw_source_name | Journal of Computational Biology, 27 (4) |
| locations[2].landing_page_url | http://hdl.handle.net/20.500.11850/441526 |
| locations[3].id | pmh:oai:www.research-collection.ethz.ch:20.500.11850/393658 |
| locations[3].is_oa | True |
| locations[3].source.id | https://openalex.org/S4306402302 |
| locations[3].source.issn | |
| locations[3].source.type | repository |
| locations[3].source.is_oa | False |
| locations[3].source.issn_l | |
| locations[3].source.is_core | False |
| locations[3].source.is_in_doaj | False |
| locations[3].source.display_name | Repository for Publications and Research Data (ETH Zurich) |
| locations[3].source.host_organization | https://openalex.org/I35440088 |
| locations[3].source.host_organization_name | ETH Zurich |
| locations[3].source.host_organization_lineage | https://openalex.org/I35440088 |
| locations[3].license | cc-by-nc |
| locations[3].pdf_url | |
| locations[3].version | submittedVersion |
| locations[3].raw_type | info:eu-repo/semantics/article |
| locations[3].license_id | https://openalex.org/licenses/cc-by-nc |
| locations[3].is_accepted | False |
| locations[3].is_published | False |
| locations[3].raw_source_name | Journal of Computational Biology, 27 (4) |
| locations[3].landing_page_url | http://hdl.handle.net/20.500.11850/393658 |
| locations[4].id | doi:10.3929/ethz-b-000314581 |
| locations[4].is_oa | True |
| locations[4].source.id | https://openalex.org/S7407051236 |
| locations[4].source.type | repository |
| locations[4].source.is_oa | False |
| locations[4].source.issn_l | |
| locations[4].source.is_core | False |
| locations[4].source.is_in_doaj | False |
| locations[4].source.display_name | ETH Zürich Research Collection |
| locations[4].source.host_organization | |
| locations[4].source.host_organization_name | |
| locations[4].license | |
| locations[4].pdf_url | |
| locations[4].version | |
| locations[4].raw_type | article-journal |
| locations[4].license_id | |
| locations[4].is_accepted | False |
| locations[4].is_published | |
| locations[4].raw_source_name | |
| locations[4].landing_page_url | https://doi.org/10.3929/ethz-b-000314581 |
| locations[5].id | doi:10.3929/ethz-b-000393658 |
| locations[5].is_oa | True |
| locations[5].source.id | https://openalex.org/S7407051236 |
| locations[5].source.type | repository |
| locations[5].source.is_oa | False |
| locations[5].source.issn_l | |
| locations[5].source.is_core | False |
| locations[5].source.is_in_doaj | False |
| locations[5].source.display_name | ETH Zürich Research Collection |
| locations[5].source.host_organization | |
| locations[5].source.host_organization_name | |
| locations[5].license | |
| locations[5].pdf_url | |
| locations[5].version | |
| locations[5].raw_type | article-journal |
| locations[5].license_id | |
| locations[5].is_accepted | False |
| locations[5].is_published | |
| locations[5].raw_source_name | |
| locations[5].landing_page_url | https://doi.org/10.3929/ethz-b-000393658 |
| indexed_in | crossref, datacite |
| authorships[0].author.id | https://openalex.org/A5085609478 |
| authorships[0].author.orcid | https://orcid.org/0000-0001-6200-5972 |
| authorships[0].author.display_name | Mikhail Karasikov |
| authorships[0].countries | CH |
| authorships[0].affiliations[0].institution_ids | https://openalex.org/I12708293 |
| authorships[0].affiliations[0].raw_affiliation_string | SIB Swiss Institute of Bioinformatics, Lausanne 1015, Switzerland |
| authorships[0].affiliations[1].institution_ids | https://openalex.org/I35440088 |
| authorships[0].affiliations[1].raw_affiliation_string | Department of Computer Science, ETH Zurich, Zurich 8092, Switzerland |
| authorships[0].affiliations[2].institution_ids | https://openalex.org/I4210100468 |
| authorships[0].affiliations[2].raw_affiliation_string | University Hospital Zurich, Biomedical Informatics Research, Zurich 8091, Switzerland |
| authorships[0].institutions[0].id | https://openalex.org/I35440088 |
| authorships[0].institutions[0].ror | https://ror.org/05a28rw58 |
| authorships[0].institutions[0].type | education |
| authorships[0].institutions[0].lineage | https://openalex.org/I2799323385, https://openalex.org/I35440088 |
| authorships[0].institutions[0].country_code | CH |
| authorships[0].institutions[0].display_name | ETH Zurich |
| authorships[0].institutions[1].id | https://openalex.org/I12708293 |
| authorships[0].institutions[1].ror | https://ror.org/002n09z45 |
| authorships[0].institutions[1].type | nonprofit |
| authorships[0].institutions[1].lineage | https://openalex.org/I12708293 |
| authorships[0].institutions[1].country_code | CH |
| authorships[0].institutions[1].display_name | SIB Swiss Institute of Bioinformatics |
| authorships[0].institutions[2].id | https://openalex.org/I4210100468 |
| authorships[0].institutions[2].ror | https://ror.org/01462r250 |
| authorships[0].institutions[2].type | healthcare |
| authorships[0].institutions[2].lineage | https://openalex.org/I4210100468 |
| authorships[0].institutions[2].country_code | CH |
| authorships[0].institutions[2].display_name | University Hospital of Zurich |
| authorships[0].author_position | first |
| authorships[0].raw_author_name | Mikhail Karasikov |
| authorships[0].is_corresponding | False |
| authorships[0].raw_affiliation_strings | Department of Computer Science, ETH Zurich, Zurich 8092, Switzerland, SIB Swiss Institute of Bioinformatics, Lausanne 1015, Switzerland, University Hospital Zurich, Biomedical Informatics Research, Zurich 8091, Switzerland |
| authorships[1].author.id | https://openalex.org/A5033347097 |
| authorships[1].author.orcid | https://orcid.org/0000-0002-2125-6086 |
| authorships[1].author.display_name | Harun Mustafa |
| authorships[1].countries | CH |
| authorships[1].affiliations[0].institution_ids | https://openalex.org/I4210100468 |
| authorships[1].affiliations[0].raw_affiliation_string | University Hospital Zurich, Biomedical Informatics Research, Zurich 8091, Switzerland |
| authorships[1].affiliations[1].institution_ids | https://openalex.org/I35440088 |
| authorships[1].affiliations[1].raw_affiliation_string | Department of Computer Science, ETH Zurich, Zurich 8092, Switzerland |
| authorships[1].affiliations[2].institution_ids | https://openalex.org/I12708293 |
| authorships[1].affiliations[2].raw_affiliation_string | SIB Swiss Institute of Bioinformatics, Lausanne 1015, Switzerland |
| authorships[1].institutions[0].id | https://openalex.org/I35440088 |
| authorships[1].institutions[0].ror | https://ror.org/05a28rw58 |
| authorships[1].institutions[0].type | education |
| authorships[1].institutions[0].lineage | https://openalex.org/I2799323385, https://openalex.org/I35440088 |
| authorships[1].institutions[0].country_code | CH |
| authorships[1].institutions[0].display_name | ETH Zurich |
| authorships[1].institutions[1].id | https://openalex.org/I12708293 |
| authorships[1].institutions[1].ror | https://ror.org/002n09z45 |
| authorships[1].institutions[1].type | nonprofit |
| authorships[1].institutions[1].lineage | https://openalex.org/I12708293 |
| authorships[1].institutions[1].country_code | CH |
| authorships[1].institutions[1].display_name | SIB Swiss Institute of Bioinformatics |
| authorships[1].institutions[2].id | https://openalex.org/I4210100468 |
| authorships[1].institutions[2].ror | https://ror.org/01462r250 |
| authorships[1].institutions[2].type | healthcare |
| authorships[1].institutions[2].lineage | https://openalex.org/I4210100468 |
| authorships[1].institutions[2].country_code | CH |
| authorships[1].institutions[2].display_name | University Hospital of Zurich |
| authorships[1].author_position | middle |
| authorships[1].raw_author_name | Harun Mustafa |
| authorships[1].is_corresponding | False |
| authorships[1].raw_affiliation_strings | Department of Computer Science, ETH Zurich, Zurich 8092, Switzerland, SIB Swiss Institute of Bioinformatics, Lausanne 1015, Switzerland, University Hospital Zurich, Biomedical Informatics Research, Zurich 8091, Switzerland |
| authorships[2].author.id | https://openalex.org/A5077792383 |
| authorships[2].author.orcid | https://orcid.org/0000-0003-0095-6750 |
| authorships[2].author.display_name | Amir Joudaki |
| authorships[2].countries | CH |
| authorships[2].affiliations[0].institution_ids | https://openalex.org/I4210100468 |
| authorships[2].affiliations[0].raw_affiliation_string | University Hospital Zurich, Biomedical Informatics Research, Zurich 8091, Switzerland |
| authorships[2].affiliations[1].institution_ids | https://openalex.org/I12708293 |
| authorships[2].affiliations[1].raw_affiliation_string | SIB Swiss Institute of Bioinformatics, Lausanne 1015, Switzerland |
| authorships[2].affiliations[2].institution_ids | https://openalex.org/I35440088 |
| authorships[2].affiliations[2].raw_affiliation_string | Department of Computer Science, ETH Zurich, Zurich 8092, Switzerland |
| authorships[2].institutions[0].id | https://openalex.org/I35440088 |
| authorships[2].institutions[0].ror | https://ror.org/05a28rw58 |
| authorships[2].institutions[0].type | education |
| authorships[2].institutions[0].lineage | https://openalex.org/I2799323385, https://openalex.org/I35440088 |
| authorships[2].institutions[0].country_code | CH |
| authorships[2].institutions[0].display_name | ETH Zurich |
| authorships[2].institutions[1].id | https://openalex.org/I12708293 |
| authorships[2].institutions[1].ror | https://ror.org/002n09z45 |
| authorships[2].institutions[1].type | nonprofit |
| authorships[2].institutions[1].lineage | https://openalex.org/I12708293 |
| authorships[2].institutions[1].country_code | CH |
| authorships[2].institutions[1].display_name | SIB Swiss Institute of Bioinformatics |
| authorships[2].institutions[2].id | https://openalex.org/I4210100468 |
| authorships[2].institutions[2].ror | https://ror.org/01462r250 |
| authorships[2].institutions[2].type | healthcare |
| authorships[2].institutions[2].lineage | https://openalex.org/I4210100468 |
| authorships[2].institutions[2].country_code | CH |
| authorships[2].institutions[2].display_name | University Hospital of Zurich |
| authorships[2].author_position | middle |
| authorships[2].raw_author_name | Amir Joudaki |
| authorships[2].is_corresponding | False |
| authorships[2].raw_affiliation_strings | Department of Computer Science, ETH Zurich, Zurich 8092, Switzerland, SIB Swiss Institute of Bioinformatics, Lausanne 1015, Switzerland, University Hospital Zurich, Biomedical Informatics Research, Zurich 8091, Switzerland |
| authorships[3].author.id | https://openalex.org/A5038620773 |
| authorships[3].author.orcid | https://orcid.org/0000-0001-8240-2860 |
| authorships[3].author.display_name | Sara Javadzadeh |
| authorships[3].countries | CH |
| authorships[3].affiliations[0].institution_ids | https://openalex.org/I35440088 |
| authorships[3].affiliations[0].raw_affiliation_string | Department of Computer Science, ETH Zurich, Zurich 8092, Switzerland |
| authorships[3].institutions[0].id | https://openalex.org/I35440088 |
| authorships[3].institutions[0].ror | https://ror.org/05a28rw58 |
| authorships[3].institutions[0].type | education |
| authorships[3].institutions[0].lineage | https://openalex.org/I2799323385, https://openalex.org/I35440088 |
| authorships[3].institutions[0].country_code | CH |
| authorships[3].institutions[0].display_name | ETH Zurich |
| authorships[3].author_position | middle |
| authorships[3].raw_author_name | Sara Javadzadeh-No |
| authorships[3].is_corresponding | False |
| authorships[3].raw_affiliation_strings | Department of Computer Science, ETH Zurich, Zurich 8092, Switzerland |
| authorships[4].author.id | https://openalex.org/A5035416263 |
| authorships[4].author.orcid | https://orcid.org/0000-0001-5486-8532 |
| authorships[4].author.display_name | Gunnar Rätsch |
| authorships[4].countries | CH |
| authorships[4].affiliations[0].institution_ids | https://openalex.org/I35440088 |
| authorships[4].affiliations[0].raw_affiliation_string | Department of Computer Science, ETH Zurich, Zurich 8092, Switzerland |
| authorships[4].affiliations[1].institution_ids | https://openalex.org/I12708293 |
| authorships[4].affiliations[1].raw_affiliation_string | SIB Swiss Institute of Bioinformatics, Lausanne 1015, Switzerland |
| authorships[4].affiliations[2].institution_ids | https://openalex.org/I4210100468 |
| authorships[4].affiliations[2].raw_affiliation_string | University Hospital Zurich, Biomedical Informatics Research, Zurich 8091, Switzerland |
| authorships[4].institutions[0].id | https://openalex.org/I35440088 |
| authorships[4].institutions[0].ror | https://ror.org/05a28rw58 |
| authorships[4].institutions[0].type | education |
| authorships[4].institutions[0].lineage | https://openalex.org/I2799323385, https://openalex.org/I35440088 |
| authorships[4].institutions[0].country_code | CH |
| authorships[4].institutions[0].display_name | ETH Zurich |
| authorships[4].institutions[1].id | https://openalex.org/I12708293 |
| authorships[4].institutions[1].ror | https://ror.org/002n09z45 |
| authorships[4].institutions[1].type | nonprofit |
| authorships[4].institutions[1].lineage | https://openalex.org/I12708293 |
| authorships[4].institutions[1].country_code | CH |
| authorships[4].institutions[1].display_name | SIB Swiss Institute of Bioinformatics |
| authorships[4].institutions[2].id | https://openalex.org/I4210100468 |
| authorships[4].institutions[2].ror | https://ror.org/01462r250 |
| authorships[4].institutions[2].type | healthcare |
| authorships[4].institutions[2].lineage | https://openalex.org/I4210100468 |
| authorships[4].institutions[2].country_code | CH |
| authorships[4].institutions[2].display_name | University Hospital of Zurich |
| authorships[4].author_position | middle |
| authorships[4].raw_author_name | Gunnar Rätsch |
| authorships[4].is_corresponding | False |
| authorships[4].raw_affiliation_strings | Department of Computer Science, ETH Zurich, Zurich 8092, Switzerland, SIB Swiss Institute of Bioinformatics, Lausanne 1015, Switzerland, University Hospital Zurich, Biomedical Informatics Research, Zurich 8091, Switzerland |
| authorships[5].author.id | https://openalex.org/A5070168682 |
| authorships[5].author.orcid | https://orcid.org/0000-0002-3411-0692 |
| authorships[5].author.display_name | André Kahles |
| authorships[5].countries | CH |
| authorships[5].affiliations[0].institution_ids | https://openalex.org/I35440088 |
| authorships[5].affiliations[0].raw_affiliation_string | Department of Computer Science, ETH Zurich, Zurich 8092, Switzerland |
| authorships[5].affiliations[1].institution_ids | https://openalex.org/I12708293 |
| authorships[5].affiliations[1].raw_affiliation_string | SIB Swiss Institute of Bioinformatics, Lausanne 1015, Switzerland |
| authorships[5].affiliations[2].institution_ids | https://openalex.org/I4210100468 |
| authorships[5].affiliations[2].raw_affiliation_string | University Hospital Zurich, Biomedical Informatics Research, Zurich 8091, Switzerland |
| authorships[5].institutions[0].id | https://openalex.org/I35440088 |
| authorships[5].institutions[0].ror | https://ror.org/05a28rw58 |
| authorships[5].institutions[0].type | education |
| authorships[5].institutions[0].lineage | https://openalex.org/I2799323385, https://openalex.org/I35440088 |
| authorships[5].institutions[0].country_code | CH |
| authorships[5].institutions[0].display_name | ETH Zurich |
| authorships[5].institutions[1].id | https://openalex.org/I12708293 |
| authorships[5].institutions[1].ror | https://ror.org/002n09z45 |
| authorships[5].institutions[1].type | nonprofit |
| authorships[5].institutions[1].lineage | https://openalex.org/I12708293 |
| authorships[5].institutions[1].country_code | CH |
| authorships[5].institutions[1].display_name | SIB Swiss Institute of Bioinformatics |
| authorships[5].institutions[2].id | https://openalex.org/I4210100468 |
| authorships[5].institutions[2].ror | https://ror.org/01462r250 |
| authorships[5].institutions[2].type | healthcare |
| authorships[5].institutions[2].lineage | https://openalex.org/I4210100468 |
| authorships[5].institutions[2].country_code | CH |
| authorships[5].institutions[2].display_name | University Hospital of Zurich |
| authorships[5].author_position | last |
| authorships[5].raw_author_name | André Kahles |
| authorships[5].is_corresponding | False |
| authorships[5].raw_affiliation_strings | Department of Computer Science, ETH Zurich, Zurich 8092, Switzerland, SIB Swiss Institute of Bioinformatics, Lausanne 1015, Switzerland, University Hospital Zurich, Biomedical Informatics Research, Zurich 8091, Switzerland |
| has_content.pdf | True |
| has_content.grobid_xml | True |
| is_paratext | False |
| open_access.is_oa | True |
| open_access.oa_url | https://www.biorxiv.org/content/biorxiv/early/2019/01/25/468512.full.pdf |
| open_access.oa_status | green |
| open_access.any_repository_has_fulltext | False |
| created_date | 2025-10-10T00:00:00 |
| display_name | Sparse Binary Relation Representations for Genome Graph Annotation |
| has_fulltext | True |
| is_retracted | False |
| updated_date | 2025-11-06T03:46:38.306776 |
| primary_topic.id | https://openalex.org/T10015 |
| primary_topic.field.id | https://openalex.org/fields/13 |
| primary_topic.field.display_name | Biochemistry, Genetics and Molecular Biology |
| primary_topic.score | 0.9994999766349792 |
| primary_topic.domain.id | https://openalex.org/domains/1 |
| primary_topic.domain.display_name | Life Sciences |
| primary_topic.subfield.id | https://openalex.org/subfields/1312 |
| primary_topic.subfield.display_name | Molecular Biology |
| primary_topic.display_name | Genomics and Phylogenetic Studies |
| related_works | https://openalex.org/W3024364549, https://openalex.org/W4206019083, https://openalex.org/W2048865712, https://openalex.org/W1976265003, https://openalex.org/W2370378377, https://openalex.org/W2130160813, https://openalex.org/W2122316710, https://openalex.org/W4301429973, https://openalex.org/W2951695076, https://openalex.org/W1964377951 |
| cited_by_count | 4 |
| counts_by_year[0].year | 2023 |
| counts_by_year[0].cited_by_count | 1 |
| counts_by_year[1].year | 2020 |
| counts_by_year[1].cited_by_count | 1 |
| counts_by_year[2].year | 2019 |
| counts_by_year[2].cited_by_count | 1 |
| counts_by_year[3].year | 2017 |
| counts_by_year[3].cited_by_count | 1 |
| locations_count | 6 |
| best_oa_location.id | doi:10.1101/468512 |
| best_oa_location.is_oa | True |
| best_oa_location.source.id | https://openalex.org/S4306402567 |
| best_oa_location.source.issn | |
| best_oa_location.source.type | repository |
| best_oa_location.source.is_oa | False |
| best_oa_location.source.issn_l | |
| best_oa_location.source.is_core | False |
| best_oa_location.source.is_in_doaj | False |
| best_oa_location.source.display_name | bioRxiv (Cold Spring Harbor Laboratory) |
| best_oa_location.source.host_organization | https://openalex.org/I2750212522 |
| best_oa_location.source.host_organization_name | Cold Spring Harbor Laboratory |
| best_oa_location.source.host_organization_lineage | https://openalex.org/I2750212522 |
| best_oa_location.license | cc-by-nc |
| best_oa_location.pdf_url | https://www.biorxiv.org/content/biorxiv/early/2019/01/25/468512.full.pdf |
| best_oa_location.version | acceptedVersion |
| best_oa_location.raw_type | posted-content |
| best_oa_location.license_id | https://openalex.org/licenses/cc-by-nc |
| best_oa_location.is_accepted | True |
| best_oa_location.is_published | False |
| best_oa_location.raw_source_name | |
| best_oa_location.landing_page_url | https://doi.org/10.1101/468512 |
| primary_location.id | doi:10.1101/468512 |
| primary_location.is_oa | True |
| primary_location.source.id | https://openalex.org/S4306402567 |
| primary_location.source.issn | |
| primary_location.source.type | repository |
| primary_location.source.is_oa | False |
| primary_location.source.issn_l | |
| primary_location.source.is_core | False |
| primary_location.source.is_in_doaj | False |
| primary_location.source.display_name | bioRxiv (Cold Spring Harbor Laboratory) |
| primary_location.source.host_organization | https://openalex.org/I2750212522 |
| primary_location.source.host_organization_name | Cold Spring Harbor Laboratory |
| primary_location.source.host_organization_lineage | https://openalex.org/I2750212522 |
| primary_location.license | cc-by-nc |
| primary_location.pdf_url | https://www.biorxiv.org/content/biorxiv/early/2019/01/25/468512.full.pdf |
| primary_location.version | acceptedVersion |
| primary_location.raw_type | posted-content |
| primary_location.license_id | https://openalex.org/licenses/cc-by-nc |
| primary_location.is_accepted | True |
| primary_location.is_published | False |
| primary_location.raw_source_name | |
| primary_location.landing_page_url | https://doi.org/10.1101/468512 |
| publication_date | 2018-11-12 |
| publication_year | 2018 |
| referenced_works | https://openalex.org/W4234797890, https://openalex.org/W2728987999, https://openalex.org/W2116258248, https://openalex.org/W2092138236, https://openalex.org/W1549020781, https://openalex.org/W1962019683, https://openalex.org/W2010361633, https://openalex.org/W2583363792, https://openalex.org/W2952379095, https://openalex.org/W2951157731, https://openalex.org/W202508549, https://openalex.org/W2949956533, https://openalex.org/W2173732482, https://openalex.org/W2809649683, https://openalex.org/W2884435343, https://openalex.org/W2104846587, https://openalex.org/W2039618894, https://openalex.org/W1599106035, https://openalex.org/W2096525273, https://openalex.org/W1907868928, https://openalex.org/W2011210839, https://openalex.org/W2030449718, https://openalex.org/W6247929, https://openalex.org/W2278452282, https://openalex.org/W1590719947, https://openalex.org/W2198888083, https://openalex.org/W2103441770, https://openalex.org/W2888300707, https://openalex.org/W1985174631, https://openalex.org/W2105656684, https://openalex.org/W1582690601, https://openalex.org/W2786984148, https://openalex.org/W1949346071 |
| referenced_works_count | 33 |
| abstract_inverted_index., | 137 |
| abstract_inverted_index.a | 80, 132, 165, 186 |
| abstract_inverted_index.In | 23, 48, 127 |
| abstract_inverted_index.It | 94 |
| abstract_inverted_index.To | 178 |
| abstract_inverted_index.We | 147, 215 |
| abstract_inverted_index.an | 149 |
| abstract_inverted_index.as | 107 |
| abstract_inverted_index.be | 225 |
| abstract_inverted_index.by | 60 |
| abstract_inverted_index.de | 54, 173 |
| abstract_inverted_index.in | 7, 20, 74, 154 |
| abstract_inverted_index.is | 5, 95, 139 |
| abstract_inverted_index.of | 37, 46, 52, 82, 86, 102, 112, 120, 144, 189, 220 |
| abstract_inverted_index.on | 84, 199 |
| abstract_inverted_index.to | 33, 41, 116, 122, 141, 151, 164 |
| abstract_inverted_index.up | 150, 163 |
| abstract_inverted_index.we | 130, 184 |
| abstract_inverted_index.29% | 152 |
| abstract_inverted_index.68% | 166 |
| abstract_inverted_index.DNA | 2 |
| abstract_inverted_index.and | 10, 15, 40, 110, 162, 202, 205 |
| abstract_inverted_index.are | 19, 89 |
| abstract_inverted_index.can | 114, 224 |
| abstract_inverted_index.for | 13, 43, 78, 172, 228 |
| abstract_inverted_index.has | 57, 65 |
| abstract_inverted_index.how | 100, 207 |
| abstract_inverted_index.key | 197 |
| abstract_inverted_index.new | 133, 222 |
| abstract_inverted_index.not | 91, 98 |
| abstract_inverted_index.our | 180, 221 |
| abstract_inverted_index.put | 179 |
| abstract_inverted_index.set | 81 |
| abstract_inverted_index.the | 50, 71, 103, 108, 118, 124, 158, 169, 212, 218 |
| abstract_inverted_index.top | 85 |
| abstract_inverted_index.BRWT | 160 |
| abstract_inverted_index.also | 96 |
| abstract_inverted_index.been | 31, 58, 66 |
| abstract_inverted_index.both | 200 |
| abstract_inverted_index.data | 4, 18, 28, 39, 204, 209 |
| abstract_inverted_index.five | 190 |
| abstract_inverted_index.good | 67 |
| abstract_inverted_index.have | 30 |
| abstract_inverted_index.help | 115 |
| abstract_inverted_index.high | 21 |
| abstract_inverted_index.into | 182 |
| abstract_inverted_index.over | 157, 168 |
| abstract_inverted_index.sets | 36 |
| abstract_inverted_index.show | 148, 216 |
| abstract_inverted_index.such | 17, 87, 106 |
| abstract_inverted_index.that | 217 |
| abstract_inverted_index.this | 128 |
| abstract_inverted_index.While | 63 |
| abstract_inverted_index.allow | 42 |
| abstract_inverted_index.basic | 159 |
| abstract_inverted_index.clear | 99 |
| abstract_inverted_index.data, | 105 |
| abstract_inverted_index.data. | 146 |
| abstract_inverted_index.graph | 27, 73, 125, 175 |
| abstract_inverted_index.input | 104, 145 |
| abstract_inverted_index.kinds | 143 |
| abstract_inverted_index.label | 176 |
| abstract_inverted_index.large | 35 |
| abstract_inverted_index.small | 75 |
| abstract_inverted_index.still | 90 |
| abstract_inverted_index.there | 64 |
| abstract_inverted_index.which | 138 |
| abstract_inverted_index.work, | 129 |
| abstract_inverted_index.Bruijn | 55, 174 |
| abstract_inverted_index.choice | 119 |
| abstract_inverted_index.graphs | 56, 88 |
| abstract_inverted_index.inform | 117 |
| abstract_inverted_index.labels | 83 |
| abstract_inverted_index.method | 121, 223 |
| abstract_inverted_index.public | 8 |
| abstract_inverted_index.recent | 24 |
| abstract_inverted_index.space, | 76 |
| abstract_inverted_index.concept | 51 |
| abstract_inverted_index.current | 170 |
| abstract_inverted_index.demand. | 22 |
| abstract_inverted_index.discuss | 206 |
| abstract_inverted_index.groups. | 62 |
| abstract_inverted_index.labeled | 53 |
| abstract_inverted_index.labels, | 113 |
| abstract_inverted_index.method, | 161 |
| abstract_inverted_index.methods | 77 |
| abstract_inverted_index.metrics | 198 |
| abstract_inverted_index.present | 131, 185 |
| abstract_inverted_index.results | 181 |
| abstract_inverted_index.several | 26, 61 |
| abstract_inverted_index.storing | 14, 79 |
| abstract_inverted_index.towards | 69 |
| abstract_inverted_index.Abstract | 0 |
| abstract_inverted_index.adaptive | 140 |
| abstract_inverted_index.analysis | 188 |
| abstract_inverted_index.compress | 123 |
| abstract_inverted_index.evaluate | 196 |
| abstract_inverted_index.explored | 59 |
| abstract_inverted_index.indexing | 16 |
| abstract_inverted_index.progress | 68 |
| abstract_inverted_index.proposed | 32 |
| abstract_inverted_index.querying | 45 |
| abstract_inverted_index.robustly | 226 |
| abstract_inverted_index.schemes, | 195 |
| abstract_inverted_index.sequence | 72 |
| abstract_inverted_index.sparsity | 109 |
| abstract_inverted_index.approach, | 135 |
| abstract_inverted_index.currently | 97 |
| abstract_inverted_index.datasets. | 232 |
| abstract_inverted_index.different | 142, 191, 208, 229 |
| abstract_inverted_index.efficient | 11, 44 |
| abstract_inverted_index.explored. | 93 |
| abstract_inverted_index.influence | 211 |
| abstract_inverted_index.labeling. | 126 |
| abstract_inverted_index.represent | 34 |
| abstract_inverted_index.research, | 25 |
| abstract_inverted_index.Multi-BRWT | 136 |
| abstract_inverted_index.annotation | 193 |
| abstract_inverted_index.approaches | 12 |
| abstract_inverted_index.artificial | 201 |
| abstract_inverted_index.real-world | 203, 231 |
| abstract_inverted_index.reproduced | 227 |
| abstract_inverted_index.sequences. | 47 |
| abstract_inverted_index.sequencing | 3, 38 |
| abstract_inverted_index.structures | 29 |
| abstract_inverted_index.systematic | 187 |
| abstract_inverted_index.compression | 134, 155, 194, 213 |
| abstract_inverted_index.improvement | 153, 167 |
| abstract_inverted_index.particular, | 49 |
| abstract_inverted_index.performance | 156 |
| abstract_inverted_index.accumulating | 6 |
| abstract_inverted_index.compression. | 177 |
| abstract_inverted_index.correlations | 111 |
| abstract_inverted_index.improvements | 219 |
| abstract_inverted_index.performance. | 214 |
| abstract_inverted_index.perspective, | 183 |
| abstract_inverted_index.representing | 70 |
| abstract_inverted_index.sufficiently | 92 |
| abstract_inverted_index.repositories, | 9 |
| abstract_inverted_index.representative | 230 |
| abstract_inverted_index.High-throughput | 1 |
| abstract_inverted_index.characteristics | 101, 210 |
| abstract_inverted_index.state-of-the-art | 171, 192 |
| cited_by_percentile_year.max | 94 |
| cited_by_percentile_year.min | 89 |
| countries_distinct_count | 1 |
| institutions_distinct_count | 6 |
| citation_normalized_percentile.value | 0.54483772 |
| citation_normalized_percentile.is_in_top_1_percent | False |
| citation_normalized_percentile.is_in_top_10_percent | False |