Speculative Segmented Sum for Sparse Matrix-Vector Multiplication on Heterogeneous Processors

April 24, 2015 ยท Entered Twilight ยท ๐Ÿ› Parallel Computing

๐ŸŒ… TWILIGHT: Old Age
Predates the code-sharing era โ€” a pioneer of its time

"Last commit was 10.0 years ago (โ‰ฅ5 year threshold)"

Evidence collected by the PWNC Scanner

Repo contents: LICENSE, README.md, spmv_cuda, spmv_opencl_amd, spmv_opencl_intel

Authors Weifeng Liu, Brian Vinter arXiv ID 1504.06474 Category cs.MS: Mathematical Software Cross-listed cs.DC, math.NA Citations 70 Venue Parallel Computing Repository https://github.com/bhSPARSE/Benchmark_SpMV_using_CSR โญ 26 Last Checked 1 month ago
Abstract
Sparse matrix-vector multiplication (SpMV) is a central building block for scientific software and graph applications. Recently, heterogeneous processors composed of different types of cores attracted much attention because of their flexible core configuration and high energy efficiency. In this paper, we propose a compressed sparse row (CSR) format based SpMV algorithm utilizing both types of cores in a CPU-GPU heterogeneous processor. We first speculatively execute segmented sum operations on the GPU part of a heterogeneous processor and generate a possibly incorrect results. Then the CPU part of the same chip is triggered to re-arrange the predicted partial sums for a correct resulting vector. On three heterogeneous processors from Intel, AMD and nVidia, using 20 sparse matrices as a benchmark suite, the experimental results show that our method obtains significant performance improvement over the best existing CSR-based SpMV algorithms. The source code of this work is downloadable at https://github.com/bhSPARSE/Benchmark_SpMV_using_CSR
Community shame:
Not yet rated
Community Contributions

Found the code? Know the venue? Think something is wrong? Let us know!

๐Ÿ“œ Similar Papers

In the same crypt โ€” Mathematical Software