Explanipedia

CompactLTJ: Space & Time Efficient Leapfrog Triejoin on Graph Databases Open

Diego Arroyuelo, Daniela Campos, Adrián Gómez‐Brandón, Yuval Linker, Gonzalo Navarro , et al. · 2025

Leapfrog Triejoin (LTJ) is arguably the most practical and popular worst-case-optimal (wco) algorithm for solving basic graph patterns in graph databases. Its main drawback is that it needs the database triples (subject, predicate, object)…

Smallest Suffixient Sets: Effectiveness, Resilience, and Calculation Open

Gonzalo Navarro, Cristian Urbina · 2025

A suffixient set is a novel combinatorial object that captures the essential information of repetitive strings in a way that, provided with a random access mechanism, supports various forms of pattern matching. In this paper, we study the …

Practical Adaptive Dynamic Bitvectors Open

Gonzalo Navarro · 2025

Introduction While operations rank and select on static bitvectors can be supported in constant time, lower bounds show that this is impossible when supporting updates; practical implementations offer time for the operations, which is clos…

Worst-Case-Optimal Joins on Graphs with Topological Relations Open

José Fuentes‐Sepúlveda, Adrián Gómez‐Brandón, Aidan Hogan, Ayleen Irribarra-Cortés, Gonzalo Navarro , et al. · 2025

PD155 RedETS Horizon Scanning: Impact In The Decision-Making Process Open

Janet Puñal-Riobóo, Ignacio López-Loureiro, María del Carmen Maceira Rozas, Beatriz Casal Acción, María José Faraldo Vallés , et al. · 2024

Introduction The RedETS horizon scanning (HS) program in Spain is focused on identifying non-pharmaceutical emerging health technologies. HS is organized in three steps: (i) identification using different sources (PubMed, the biomedical pr…

Computing MEMs and Relatives on Repetitive Text Collections Open

Gonzalo Navarro · 2024

We consider the problem of computing the Maximal Exact Matches (MEMs) of a given pattern $P[1\mathinner{.. }m]$ on a large repetitive text collection $T[1\mathinner{.. }n]$ over an alphabet of size $\sigma$ , which is represented as …

Fast and Small Subsampled R-indexes Open

Dustin Cobas, Travis Gagie, Gonzalo Navarro · 2024

The $r$-index represented a breakthrough in compressed indexing of repetitive text collections, outperforming its alternatives by orders of magnitude in query time. Its space usage, $O(r)$ where $r$ is the number of runs in the Burrows--Wh…

Faster run-length compressed suffix arrays Open

Travis Gagie, Giovanni Manzini, Gonzalo Navarro, Marinella Sciortino · 2024

We first review how we can store a run-length compressed suffix array (RLCSA) for a text $T$ of length $n$ over an alphabet of size $σ$ whose Burrows-Wheeler Transform (BWT) consists of $r$ runs in $O \left( \rule{0ex}{2ex} r \log (n / r) …

New Compressed Indices for Multijoins on Graph Databases Open

Diego Arroyuelo, Fabrizio Barisione, Antonio Fariña, Adrián Gómez‐Brandón, Gonzalo Navarro · 2024

A recent surprising result in the implementation of worst-case-optimal (wco) multijoins in graph databases (specifically, basic graph patterns) is that they can be supported on graph representations that take even less space than a plain r…

Counting on General Run-Length Grammars Open

Gonzalo Navarro, Alejandro Pacheco · 2024

We introduce a data structure for counting pattern occurrences in texts compressed with any run-length context-free grammar. Our structure uses space proportional to the grammar size and counts the occurrences of a pattern of length $m$ in…

Generalized Straight-Line Programs Open

Gonzalo Navarro, Francisco Javier Vidal Olivares, C. Urbina · 2024

It was recently proved that any Straight-Line Program (SLP) generating a given string can be transformed in linear time into an equivalent balanced SLP of the same asymptotic size. We generalize this proof to a general class of grammars we…

Faster Maximal Exact Matches with Lazy LCP Evaluation Open

Adrián Goga, Lore Depuydt, Nathaniel K. Brown, Jan Fostier, Travis Gagie , et al. · 2024

MONI (Rossi et al., JCB 2022) is a BWT-based compressed index for computing the matching statistics and maximal exact matches (MEMs) of a pattern (usually a DNA read) with respect to a highly repetitive text (usually a database of g…

A Textbook Solution for Dynamic Strings Open

Zsuzsanna Lipták, Francesco Masillo, Gonzalo Navarro · 2024

We consider the problem of maintaining a collection of strings while efficiently supporting splits and concatenations on them, as well as comparing two substrings, and computing the longest common prefix between two suffixes. This problem …

BAT-LZ Out of Hell Open

Zsuzsanna Lipták, Francesco Masillo, Gonzalo Navarro · 2024

Despite consistently yielding the best compression on repetitive text collections, the Lempel-Ziv parsing has resisted all attempts at offering relevant guarantees on the cost to access an arbitrary symbol. This makes it less attractive fo…

Worst-Case-Optimal Similarity Joins on Graph Databases Open

Diego Arroyuelo, Benjamín Bustos, Adrián Gómez‐Brandón, Aidan Hogan, Gonzalo Navarro , et al. · 2024

We extend the concept of worst-case optimal equijoins in graph databases to the case where some nodes are required to be within the k-nearest neighbors (kNN) of others under some similarity function. We model the problem by superimposing t…

Iterated Straight-Line Programs Open

Gonzalo Navarro, C. Urbina · 2024

We explore an extension to straight-line programs (SLPs) that outperforms, for some text families, the measure $δ$ based on substring complexity, a lower bound for most measures and compressors exploiting repetitiveness (which are crucial …

Stronger compact representations of object trajectories Open

Adrián Gómez‐Brandón, Gonzalo Navarro, José R. Paramá, Nieves R. Brisaboa, Travis Gagie · 2024

[Absctract]: GraCT and ContaCT were the first compressed data structures to represent object trajectories, demonstrating that it was possible to use orders of magnitude less space than classical indexes while staying competitive in query t…

Taxonomic classification with maximal exact matches in KATKA kernels and minimizer digests Open

Dominika Draesslerová, Omar Ahmed, Travis Gagie, Jan Holub, Ben Langmead , et al. · 2024

For taxonomic classification, we are asked to index the genomes in a phylogenetic tree such that later, given a DNA read, we can quickly choose a small subtree likely to contain the genome from which that read was drawn. Although popular c…

The Ring: Worst-case Optimal Joins in Graph Databases using (Almost) No Extra Space Open

Diego Arroyuelo, Adrián Gómez‐Brandón, Aidan Hogan, Gonzalo Navarro, Juan L. Reutter , et al. · 2024

We present an indexing scheme for triple-based graphs that supports join queries in worst-case optimal (wco) time within compact space. This scheme, called a ring , regards each triple as a cyclic string of length 3. Each rotation of the t…

Taxonomic Classification with Maximal Exact Matches in KATKA Kernels and Minimizer Digests Open

Dominika Draesslerová, Omar Ahmed, Travis Gagie, Jan Holub, Ben Langmead , et al. · 2024

For taxonomic classification, we are asked to index the genomes in a phylogenetic tree such that later, given a DNA read, we can quickly choose a small subtree likely to contain the genome from which that read was drawn. Although popular c…

Suffixient Sets Open

Omar Ahmed, Andrej Baláž, Nathaniel K. Brown, Lore Depuydt, Adrián Goga , et al. · 2023

We define a suffixient set for a text $T [1..n]$ to be a set $S$ of positions between 1 and $n$ such that, for any edge descending from a node $u$ to a node $v$ in the suffix tree of $T$, there is an element $s \in S$ such that $u$'s path …

Faster Maximal Exact Matches with Lazy LCP Evaluation Open

Adrián Goga, Lore Depuydt, Nathaniel K. Brown, Jan Fostier, Travis Gagie , et al. · 2023

MONI (Rossi et al., {\it JCB} 2022) is a BWT-based compressed index for computing the matching statistics and maximal exact matches (MEMs) of a pattern (usually a DNA read) with respect to a highly repetitive text (usually a database of ge…

Optimizing RPQs over a compact graph representation Open

Diego Arroyuelo, Adrián Gómez‐Brandón, Aidan Hogan, Gonzalo Navarro, Javiel Rojas-Ledesma · 2023

Efficient construction of the BWT for repetitive text using string compression Open

Diego Diaz-Domínguez, Gonzalo Navarro · 2023

Dynamic Compact Data Structure for Temporal Reachability with Unsorted Contact Insertions Open

Luiz Fernando Afra Brito, Marcelo Keese Albertini, Bruno Augusto Nassif Travençolo, Gonzalo Navarro · 2023

Temporal graphs represent interactions between entities over time. Deciding whether entities can reach each other through temporal paths is useful for various applications such as in communication networks and epidemiology. Previous works …

Wheeler maps Open

Andrej Baláž, Travis Gagie, Adrián Goga, Simon Heumos, Gonzalo Navarro , et al. · 2023

Motivated by challenges in pangenomic read alignment, we propose a generalization of Wheeler graphs that we call Wheeler maps. A Wheeler map stores a text $T[1..n]$ and an assignment of tags to the characters of $T$ such that we can prepro…

Evaluating Regular Path Queries on Compressed Adjacency Matrices Open

Diego Arroyuelo, Adrián Gómez‐Brandón, Gonzalo Navarro · 2023

Regular Path Queries (RPQs), which are essentially regular expressions to be matched against the labels of paths in labeled graphs, are at the core of graph database query languages like SPARQL. A way to solve RPQs is to translate them int…

MillenniumDB: An Open-Source Graph Database System Open

Domagoj Vrgoč, Carlos Rojas, Renzo Angles, Marcelo Arenas, Diego Arroyuelo , et al. · 2023

In this systems paper, we present MillenniumDB: a novel graph database engine that is modular, persistent, and open source. MillenniumDB is based on a graph data model, which we call domain graphs, that provides a simple abstraction upon w…

Maintaining the cycle structure of dynamic permutations Open

Zsuzsanna Lipták, Francesco Masillo, Gonzalo Navarro · 2023

We present a new data structure for maintaining dynamic permutations, which we call a $\textit{forest of splay trees (FST)}$. The FST allows one to efficiently maintain the cycle structure of a permutation $π$ when the allowed updates are …

Compact representations of spatial hierarchical structures with support for topological queries Open

José Fuentes‐Sepúlveda, Diego Gatica, Gonzalo Navarro, M. Andrea Rodríguez, Diego Seco · 2023

Gonzalo Navarro YOU? Author Swipe