Acceleration of tensor-product operations for high-order finite element methods
November 02, 2017 Β· Declared Dead Β· π The international journal of high performance computing applications
"No code URL or promise found in abstract"
Evidence collected by the PWNC Scanner
Authors
Kasia Εwirydowicz, Noel Chalmers, Ali Karakus, Timothy Warburton
arXiv ID
1711.00903
Category
cs.MS: Mathematical Software
Cross-listed
cs.DC,
cs.PF,
math.NA
Citations
65
Venue
The international journal of high performance computing applications
Last Checked
1 month ago
Abstract
This paper is devoted to GPU kernel optimization and performance analysis of three tensor-product operators arising in finite element methods. We provide a mathematical background to these operations and implementation details. Achieving close-to-the-peak performance for these operators requires extensive optimization because of the operators' properties: low arithmetic intensity, tiered structure, and the need to store intermediate results inside the kernel. We give a guided overview of optimization strategies and we present a performance model that allows us to compare the efficacy of these optimizations against an empirically calibrated roofline.
Community Contributions
Found the code? Know the venue? Think something is wrong? Let us know!
π Similar Papers
In the same crypt β Mathematical Software
π
π
Old Age
π
π
Old Age
CSR5: An Efficient Storage Format for Cross-Platform Sparse Matrix-Vector Multiplication
R.I.P.
π»
Ghosted
Mathematical Foundations of the GraphBLAS
R.I.P.
π»
Ghosted
The DUNE Framework: Basic Concepts and Recent Developments
R.I.P.
π»
Ghosted
Format Abstraction for Sparse Tensor Algebra Compilers
R.I.P.
π»
Ghosted
AMReX: Block-Structured Adaptive Mesh Refinement for Multiphysics Applications
Died the same way β π» Ghosted
R.I.P.
π»
Ghosted
Language Models are Few-Shot Learners
R.I.P.
π»
Ghosted
PyTorch: An Imperative Style, High-Performance Deep Learning Library
R.I.P.
π»
Ghosted
XGBoost: A Scalable Tree Boosting System
R.I.P.
π»
Ghosted