Packed Compact Tries: A Fast and Efficient Data Structure for Online String Processing
February 01, 2016 Β· Declared Dead Β· π International Workshop on Combinatorial Algorithms
"No code URL or promise found in abstract"
Evidence collected by the PWNC Scanner
Authors
Takuya Takagi, Shunsuke Inenaga, Kunihiko Sadakane, Hiroki Arimura
arXiv ID
1602.00422
Category
cs.DS: Data Structures & Algorithms
Citations
22
Venue
International Workshop on Combinatorial Algorithms
Last Checked
3 months ago
Abstract
In this paper, we present a new data structure called the packed compact trie (packed c-trie) which stores a set $S$ of $k$ strings of total length $n$ in $n \logΟ+ O(k \log n)$ bits of space and supports fast pattern matching queries and updates, where $Ο$ is the size of an alphabet. Assume that $Ξ±= \log_Οn$ letters are packed in a single machine word on the standard word RAM model, and let $f(k,n)$ denote the query and update times of the dynamic predecessor/successor data structure of our choice which stores $k$ integers from universe $[1,n]$ in $O(k \log n)$ bits of space. Then, given a string of length $m$, our packed c-tries support pattern matching queries and insert/delete operations in $O(\frac{m}Ξ± f(k,n))$ worst-case time and in $O(\frac{m}Ξ± + f(k,n))$ expected time. Our experiments show that our packed c-tries are faster than the standard compact tries (a.k.a. Patricia trees) on real data sets. As an application of our packed c-trie, we show that the sparse suffix tree for a string of length $n$ over prefix codes with $k$ sampled positions, such as evenly-spaced and word delimited sparse suffix trees, can be constructed online in $O((\frac{n}Ξ± + k) f(k,n))$ worst-case time and $O(\frac{n}Ξ± + k f(k,n))$ expected time with $n \log Ο+ O(k \log n)$ bits of space. When $k = O(\frac{n}Ξ±)$, by using the state-of-the-art dynamic predecessor/successor data structures, we obtain sub-linear time construction algorithms using only $O(\frac{n}Ξ±)$ bits of space in both cases. We also discuss an application of our packed c-tries to online LZD factorization.
Community Contributions
Found the code? Know the venue? Think something is wrong? Let us know!
π Similar Papers
In the same crypt β Data Structures & Algorithms
π
π
The Cartographer
R.I.P.
π»
Ghosted
Route Planning in Transportation Networks
R.I.P.
π»
Ghosted
Near-linear time approximation algorithms for optimal transport via Sinkhorn iteration
R.I.P.
π»
Ghosted
Hierarchical Clustering: Objective Functions and Algorithms
R.I.P.
π»
Ghosted
Graph Isomorphism in Quasipolynomial Time
π
π
The Cartographer
Simulation optimization: A review of algorithms and applications
Died the same way β π» Ghosted
R.I.P.
π»
Ghosted
Federated Learning: Strategies for Improving Communication Efficiency
R.I.P.
π»
Ghosted
In-Datacenter Performance Analysis of a Tensor Processing Unit
R.I.P.
π»
Ghosted
Deep Convolutional Neural Networks for Computer-Aided Detection: CNN Architectures, Dataset Characteristics and Transfer Learning
R.I.P.
π»
Ghosted