R.I.P.
๐ป
Ghosted
Variational Curriculum Reinforcement Learning for Unsupervised Discovery of Skills
October 30, 2023 ยท Entered Twilight ยท ๐ International Conference on Machine Learning
Repo contents: .gitignore, LICENSE, README.md, assets, environment.yml, multiworld, scripts, setup.py, vcrl
Authors
Seongun Kim, Kyowoon Lee, Jaesik Choi
arXiv ID
2310.19424
Category
cs.LG: Machine Learning
Cross-listed
cs.AI,
cs.RO
Citations
16
Venue
International Conference on Machine Learning
Repository
https://github.com/seongun-kim/vcrl
โญ 12
Last Checked
1 month ago
Abstract
Mutual information-based reinforcement learning (RL) has been proposed as a promising framework for retrieving complex skills autonomously without a task-oriented reward function through mutual information (MI) maximization or variational empowerment. However, learning complex skills is still challenging, due to the fact that the order of training skills can largely affect sample efficiency. Inspired by this, we recast variational empowerment as curriculum learning in goal-conditioned RL with an intrinsic reward function, which we name Variational Curriculum RL (VCRL). From this perspective, we propose a novel approach to unsupervised skill discovery based on information theory, called Value Uncertainty Variational Curriculum (VUVC). We prove that, under regularity conditions, VUVC accelerates the increase of entropy in the visited states compared to the uniform curriculum. We validate the effectiveness of our approach on complex navigation and robotic manipulation tasks in terms of sample efficiency and state coverage speed. We also demonstrate that the skills discovered by our method successfully complete a real-world robot navigation task in a zero-shot setup and that incorporating these skills with a global planner further increases the performance.
Community Contributions
Found the code? Know the venue? Think something is wrong? Let us know!
๐ Similar Papers
In the same crypt โ Machine Learning
R.I.P.
๐ป
Ghosted
XGBoost: A Scalable Tree Boosting System
R.I.P.
๐ป
Ghosted
Batch Normalization: Accelerating Deep Network Training by Reducing Internal Covariate Shift
R.I.P.
๐ป
Ghosted
Semi-Supervised Classification with Graph Convolutional Networks
R.I.P.
๐ป
Ghosted
Proximal Policy Optimization Algorithms
R.I.P.
๐ป
Ghosted