GateKeeper: A New Hardware Architecture for Accelerating Pre-Alignment in DNA Short Read Mapping
April 06, 2016 ยท Entered Twilight ยท ๐ Bioinform.
"Last commit was 6.0 years ago (โฅ5 year threshold)"
Evidence collected by the PWNC Scanner
Repo contents: FPGA Application, FPGA_Application_v2, Host Application, Host_Application_v2, LICENSE, README.md
Authors
Mohammed Alser, Hasan Hassan, Hongyi Xin, Oฤuz Ergin, Onur Mutlu, Can Alkan
arXiv ID
1604.01789
Category
q-bio.GN
Cross-listed
cs.AR,
cs.DS
Citations
116
Venue
Bioinform.
Repository
https://github.com/BilkentCompGen/GateKeeper
โญ 21
Last Checked
1 month ago
Abstract
Motivation: High throughput DNA sequencing (HTS) technologies generate an excessive number of small DNA segments -- called short reads -- that cause significant computational burden. To analyze the entire genome, each of the billions of short reads must be mapped to a reference genome based on the similarity between a read and "candidate" locations in that reference genome. The similarity measurement, called alignment, formulated as an approximate string matching problem, is the computational bottleneck because: (1) it is implemented using quadratic-time dynamic programming algorithms, and (2) the majority of candidate locations in the reference genome do not align with a given read due to high dissimilarity. Calculating the alignment of such incorrect candidate locations consumes an overwhelming majority of a modern read mapper's execution time. Therefore, it is crucial to develop a fast and effective filter that can detect incorrect candidate locations and eliminate them before invoking computationally costly alignment operations. Results: We propose GateKeeper, a new hardware accelerator that functions as a pre-alignment step that quickly filters out most incorrect candidate locations. GateKeeper is the first design to accelerate pre-alignment using Field-Programmable Gate Arrays (FPGAs), which can perform pre-alignment much faster than software. GateKeeper can be integrated with any mapper that performs sequence alignment for verification. When implemented on a single FPGA chip, GateKeeper maintains high accuracy (on average >96%) while providing up to 90-fold and 130-fold speedup over the state-of-the-art software pre-alignment techniques, Adjacency Filter and Shifted Hamming Distance (SHD), respectively. The addition of GateKeeper as a pre-alignment step can reduce the verification time of the mrFAST mapper by a factor of 10. Availability: https://github.com/BilkentCompGen/GateKeeper
Community Contributions
Found the code? Know the venue? Think something is wrong? Let us know!
๐ Similar Papers
In the same crypt โ q-bio.GN
R.I.P.
๐ป
Ghosted
R.I.P.
๐ป
Ghosted
Accurate Genomic Prediction Of Human Height
R.I.P.
๐ป
Ghosted
Synergistic Drug Combination Prediction by Integrating Multi-omics Data in Deep Learning Models
R.I.P.
๐ป
Ghosted
Tasks, Techniques, and Tools for Genomic Data Visualization
๐
๐
Old Age
Spaced seeds improve k-mer-based metagenomic classification
R.I.P.
๐ป
Ghosted