Efficient Random Sampling -- Parallel, Vectorized, Cache-Efficient, and Online

October 17, 2016 Β· Declared Dead Β· πŸ› ACM Transactions on Mathematical Software

πŸ‘» CAUSE OF DEATH: Ghosted
No code link whatsoever

"No code URL or promise found in abstract"

Evidence collected by the PWNC Scanner

Authors Peter Sanders, Sebastian Lamm, Lorenz HΓΌbschle-Schneider, Emanuel Schrade, Carsten Dachsbacher arXiv ID 1610.05141 Category cs.DS: Data Structures & Algorithms Cross-listed cs.DC, cs.MS Citations 30 Venue ACM Transactions on Mathematical Software Last Checked 3 months ago
Abstract
We consider the problem of sampling $n$ numbers from the range $\{1,\ldots,N\}$ without replacement on modern architectures. The main result is a simple divide-and-conquer scheme that makes sequential algorithms more cache efficient and leads to a parallel algorithm running in expected time $\mathcal{O}(n/p+\log p)$ on $p$ processors, i.e., scales to massively parallel machines even for moderate values of $n$. The amount of communication between the processors is very small (at most $\mathcal{O}(\log p)$) and independent of the sample size. We also discuss modifications needed for load balancing, online sampling, sampling with replacement, Bernoulli sampling, and vectorization on SIMD units or GPUs.
Community shame:
Not yet rated
Community Contributions

Found the code? Know the venue? Think something is wrong? Let us know!

πŸ“œ Similar Papers

In the same crypt β€” Data Structures & Algorithms

Died the same way β€” πŸ‘» Ghosted