R.I.P.
๐ป
Ghosted
GEAR: A GPU-Centric Experience Replay System for Large Reinforcement Learning Models
October 08, 2023 ยท Entered Twilight ยท ๐ International Conference on Machine Learning
Repo contents: .gitignore, CMakeLists.txt, Dockerfile, README.md, api.md, compile.sh, examples, gear, icon.jpg, include, libgear, requirements.txt, setup.py, src, tests, third-party
Authors
Hanjing Wang, Man-Kit Sit, Congjie He, Ying Wen, Weinan Zhang, Jun Wang, Yaodong Yang, Luo Mai
arXiv ID
2310.05205
Category
cs.LG: Machine Learning
Cross-listed
cs.AI,
cs.DC
Citations
5
Venue
International Conference on Machine Learning
Repository
https://github.com/bigrl-team/gear
โญ 18
Last Checked
1 month ago
Abstract
This paper introduces a distributed, GPU-centric experience replay system, GEAR, designed to perform scalable reinforcement learning (RL) with large sequence models (such as transformers). With such models, existing systems such as Reverb face considerable bottlenecks in memory, computation, and communication. GEAR, however, optimizes memory efficiency by enabling the memory resources on GPU servers (including host memory and device memory) to manage trajectory data. Furthermore, it facilitates decentralized GPU devices to expedite various trajectory selection strategies, circumventing computational bottlenecks. GEAR is equipped with GPU kernels capable of collecting trajectories using zero-copy access to host memory, along with remote-directed-memory access over InfiniBand, improving communication efficiency. Cluster experiments have shown that GEAR can achieve performance levels up to 6x greater than Reverb when training state-of-the-art large RL models. GEAR is open-sourced at https://github.com/bigrl-team/gear.
Community Contributions
Found the code? Know the venue? Think something is wrong? Let us know!
๐ Similar Papers
In the same crypt โ Machine Learning
R.I.P.
๐ป
Ghosted
XGBoost: A Scalable Tree Boosting System
R.I.P.
๐ป
Ghosted
Batch Normalization: Accelerating Deep Network Training by Reducing Internal Covariate Shift
R.I.P.
๐ป
Ghosted
Semi-Supervised Classification with Graph Convolutional Networks
R.I.P.
๐ป
Ghosted
Proximal Policy Optimization Algorithms
R.I.P.
๐ป
Ghosted