A resource-frugal probabilistic dictionary and applications in (meta)genomics

May 26, 2016 Β· Declared Dead Β· πŸ› Prague Stringology Conference

πŸ‘» CAUSE OF DEATH: Ghosted
No code link whatsoever

"No code URL or promise found in abstract"

Evidence collected by the PWNC Scanner

Authors Camille Marchet, Antoine Limasset, Lucie Bittner, Pierre Peterlongo arXiv ID 1605.08319 Category cs.DS: Data Structures & Algorithms Cross-listed q-bio.GN Citations 12 Venue Prague Stringology Conference Last Checked 4 months ago
Abstract
Genomic and metagenomic fields, generating huge sets of short genomic sequences, brought their own share of high performance problems. To extract relevant pieces of information from the huge data sets generated by current sequencing techniques, one must rely on extremely scalable methods and solutions. Indexing billions of objects is a task considered too expensive while being a fundamental need in this field. In this paper we propose a straightforward indexing structure that scales to billions of element and we propose two direct applications in genomics and metagenomics. We show that our proposal solves problem instances for which no other known solution scales-up. We believe that many tools and applications could benefit from either the fundamental data structure we provide or from the applications developed from this structure.
Community shame:
Not yet rated
Community Contributions

Found the code? Know the venue? Think something is wrong? Let us know!

πŸ“œ Similar Papers

In the same crypt β€” Data Structures & Algorithms

Died the same way β€” πŸ‘» Ghosted