GPU Optimization of Lattice Boltzmann Method with Local Ensemble Transform Kalman Filter

August 07, 2023 · Declared Dead · 🏛 2022 IEEE/ACM Workshop on Latest Advances in Scalable Algorithms for Large-Scale Heterogeneous Systems (ScalAH)

"No code URL or promise found in abstract"

Evidence collected by the PWNC Scanner

Authors Yuta Hasegawa, Toshiyuki Imamura, Takuya Ina, Naoyuki Onodera, Yuuichi Asahi, Yasuhiro Idomura arXiv ID 2308.03310 Category physics.flu-dyn Cross-listed cs.DC, physics.comp-ph Citations 2 Venue 2022 IEEE/ACM Workshop on Latest Advances in Scalable Algorithms for Large-Scale Heterogeneous Systems (ScalAH) Last Checked 1 month ago

Abstract

The ensemble data assimilation of computational fluid dynamics simulations based on the lattice Boltzmann method (LBM) and the local ensemble transform Kalman filter (LETKF) is implemented and optimized on a GPU supercomputer based on NVIDIA A100 GPUs. To connect the LBM and LETKF parts, data transpose communication is optimized by overlapping computation, file I/O, and communication based on data dependency in each LETKF kernel. In two dimensional forced isotropic turbulence simulations with the ensemble size of $M=64$ and the number of grid points of $N_x=128^2$, the optimized implementation achieved $\times3.80$ speedup from the naive implementation, in which the LETKF part is not parallelized. The main computing kernel of the local problem is the eigenvalue decomposition (EVD) of $M\times M$ real symmetric dense matrices, which is computed by a newly developed batched EVD in $\verb|EigenG|$. The batched EVD in $\verb|EigenG|$ outperforms that in $\verb|cuSOLVER|$, and $\times65.3$ speedup was achieved.

📄 View on arXiv 🌐 View on ar5iv 📑 PDF 🎉 Report Code Found

Community Contributions

Found the code? Know the venue? Think something is wrong? Let us know!

📜 Similar Papers

In the same crypt — physics.flu-dyn

R.I.P. 👻 Ghosted

Deep Learning of Vortex Induced Vibrations

Maziar Raissi, Zhicheng Wang, ... (+2 more)

physics.flu-dyn 🏛 J.FM 📚 419 cites 7 years ago

R.I.P. 👻 Ghosted

Efficient collective swimming by harnessing vortices through deep reinforcement learning

Siddhartha Verma, Guido Novati, Petros Koumoutsakos

physics.flu-dyn 🏛 PNAS 📚 405 cites 8 years ago

R.I.P. 👻 Ghosted

NVIDIA SimNet^{TM}: an AI-accelerated multi-physics simulation framework

Oliver Hennigh, Susheela Narasimhan, ... (+8 more)

physics.flu-dyn 🏛 arXiv 📚 135 cites 5 years ago

R.I.P. 👻 Ghosted

Teaching the Incompressible Navier-Stokes Equations to Fast Neural Surrogate Models in 3D

Nils Wandel, Michael Weinmann, Reinhard Klein

physics.flu-dyn 🏛 The Physics of Fluids 📚 60 cites 5 years ago

R.I.P. 👻 Ghosted

Prediction of Reynolds Stresses in High-Mach-Number Turbulent Boundary Layers using Physics-Informed Machine Learning

Jian-Xun Wang, Junji Huang, ... (+2 more)

physics.flu-dyn 🏛 Theoretical and Computational Fluid Dynamics 📚 59 cites 7 years ago

R.I.P. 👻 Ghosted

From Deep to Physics-Informed Learning of Turbulence: Diagnostics

Ryan King, Oliver Hennigh, ... (+2 more)

physics.flu-dyn 🏛 arXiv 📚 58 cites 7 years ago

Died the same way — 👻 Ghosted

R.I.P. 👻 Ghosted

Language Models are Few-Shot Learners

Tom B. Brown, Benjamin Mann, ... (+29 more)

cs.CL 🏛 NeurIPS 📚 54.2K cites 5 years ago

R.I.P. 👻 Ghosted

PyTorch: An Imperative Style, High-Performance Deep Learning Library

Adam Paszke, Sam Gross, ... (+19 more)

cs.LG 🏛 NeurIPS 📚 49.7K cites 6 years ago

R.I.P. 👻 Ghosted

XGBoost: A Scalable Tree Boosting System

Tianqi Chen, Carlos Guestrin

cs.LG 🏛 KDD 📚 49.2K cites 10 years ago

R.I.P. 👻 Ghosted

Batch Normalization: Accelerating Deep Network Training by Reducing Internal Covariate Shift

Sergey Ioffe, Christian Szegedy

cs.LG 🏛 ICML 📚 46.0K cites 11 years ago