Space-Efficient Construction of Compressed Suffix Trees

August 12, 2019 Β· Declared Dead Β· πŸ› Theoretical Computer Science

πŸ‘» CAUSE OF DEATH: Ghosted
No code link whatsoever

"No code URL or promise found in abstract"

Evidence collected by the PWNC Scanner

Authors Nicola Prezza, Giovanna Rosone arXiv ID 1908.04686 Category cs.DS: Data Structures & Algorithms Citations 11 Venue Theoretical Computer Science Last Checked 4 months ago
Abstract
We show how to build several data structures of central importance to string processing, taking as input the Burrows-Wheeler transform (BWT) and using small extra working space. Let $n$ be the text length and $Οƒ$ be the alphabet size. We first provide two algorithms that enumerate all LCP values and suffix tree intervals in $O(n\logΟƒ)$ time using just $o(n\logΟƒ)$ bits of working space on top of the input BWT. Using these algorithms as building blocks, for any parameter $0 < Ξ΅\leq 1$ we show how to build the PLCP bitvector and the balanced parentheses representation of the suffix tree topology in $O\left(n(\logΟƒ+ Ξ΅^{-1}\cdot \log\log n)\right)$ time using at most $n\logΟƒ\cdot(Ξ΅+ o(1))$ bits of working space on top of the input BWT and the output. In particular, this implies that we can build a compressed suffix tree from the BWT using just succinct working space (i.e. $o(n\logΟƒ)$ bits) and any time in $Θ(n\logΟƒ) + Ο‰(n\log\log n)$. This improves the previous most space-efficient algorithms, which worked in $O(n)$ bits and $O(n\log n)$ time. We also consider the problem of merging BWTs of string collections, and provide a solution running in $O(n\logΟƒ)$ time and using just $o(n\logΟƒ)$ bits of working space. An efficient implementation of our LCP construction and BWT merge algorithms use (in RAM) as few as $n$ bits on top of a packed representation of the input/output and process data as fast as $2.92$ megabases per second.
Community shame:
Not yet rated
Community Contributions

Found the code? Know the venue? Think something is wrong? Let us know!

πŸ“œ Similar Papers

In the same crypt β€” Data Structures & Algorithms

Died the same way β€” πŸ‘» Ghosted