Novelty-based Sample Reuse for Continuous Robotics Control

October 17, 2024 ยท Declared Dead ยท ๐Ÿ› IEEE International Conference on Robotics and Biomimetics

๐Ÿ’€ CAUSE OF DEATH: 404 Not Found
Code link is broken/dead
Authors Ke Duan, Kai Yang, Houde Liu, Xueqian Wang arXiv ID 2410.13490 Category cs.RO: Robotics Cross-listed cs.LG Citations 0 Venue IEEE International Conference on Robotics and Biomimetics Repository https://github.com/ppksigs/NSR-DDPG-HER Last Checked 2 months ago
Abstract
In reinforcement learning, agents collect state information and rewards through environmental interactions, essential for policy refinement. This process is notably time-consuming, especially in complex robotic simulations and real-world applications. Traditional algorithms usually re-engage with the environment after processing a single batch of samples, thereby failing to fully capitalize on historical data. However, frequently observed states, with reliable value estimates, require minimal updates; in contrast, rare observed states necessitate more intensive updates for achieving accurate value estimations. To address uneven sample utilization, we propose Novelty-guided Sample Reuse (NSR). NSR provides extra updates for infrequent, novel states and skips additional updates for frequent states, maximizing sample use before interacting with the environment again. Our experiments show that NSR improves the convergence rate and success rate of algorithms without significantly increasing time consumption. Our code is publicly available at https://github.com/ppksigs/NSR-DDPG-HER.
Community shame:
Not yet rated
Community Contributions

Found the code? Know the venue? Think something is wrong? Let us know!

๐Ÿ“œ Similar Papers

In the same crypt โ€” Robotics

Died the same way โ€” ๐Ÿ’€ 404 Not Found