Overcoming Limitations of GPGPU-Computing in Scientific Applications
May 10, 2019 ยท Declared Dead ยท ๐ IEEE Conference on High Performance Extreme Computing
"No code URL or promise found in abstract"
Evidence collected by the PWNC Scanner
Authors
Connor Kenyon, Glenn Volkema, Gaurav Khanna
arXiv ID
1905.05175
Category
physics.comp-ph
Cross-listed
cs.DC
Citations
1
Venue
IEEE Conference on High Performance Extreme Computing
Last Checked
1 month ago
Abstract
The performance of discrete general purpose graphics processing units (GPGPUs) has been improving at a rapid pace. The PCIe interconnect that controls the communication of data between the system host memory and the GPU has not improved as quickly, leaving a gap in performance due to GPU downtime while waiting for PCIe data transfer. In this article, we explore two alternatives to the limited PCIe bandwidth, NVIDIA NVLink interconnect, and zero-copy algorithms for shared memory Heterogeneous System Architecture (HSA) devices. The OpenCL SHOC benchmark suite is used to measure the performance of each device on various scientific application kernels.
Community Contributions
Found the code? Know the venue? Think something is wrong? Let us know!
๐ Similar Papers
In the same crypt โ physics.comp-ph
R.I.P.
๐ป
Ghosted
R.I.P.
๐ป
Ghosted
Deep Potential Molecular Dynamics: a scalable model with the accuracy of quantum mechanics
R.I.P.
๐ป
Ghosted
Heterogeneous Parallelization and Acceleration of Molecular Dynamics Simulations in GROMACS
R.I.P.
๐ป
Ghosted
By-passing the Kohn-Sham equations with machine learning
R.I.P.
๐ป
Ghosted
Machine Learning of coarse-grained Molecular Dynamics Force Fields
R.I.P.
๐ป
Ghosted
Towards Physics-informed Deep Learning for Turbulent Flow Prediction
Died the same way โ ๐ป Ghosted
R.I.P.
๐ป
Ghosted
Language Models are Few-Shot Learners
R.I.P.
๐ป
Ghosted
PyTorch: An Imperative Style, High-Performance Deep Learning Library
R.I.P.
๐ป
Ghosted
XGBoost: A Scalable Tree Boosting System
R.I.P.
๐ป
Ghosted