Energy-based Legged Robots Terrain Traversability Modeling via Deep Inverse Reinforcement Learning

July 07, 2022 ยท Entered Twilight ยท ๐Ÿ› IEEE Robotics and Automation Letters

๐Ÿ’ค TWILIGHT: Eternal Rest
Repo abandoned since publication

Repo contents: .gitignore, LICENSE, README.md, baseline_test.py, baseline_train.py, demo.ipynb, demo.py, environment.yml, example_data, loader, maxent_kinematic.py, maxent_nonlinear_offroad.py, maxent_nonlinear_offroad_rank.py, mdp, network, requirements.txt, scripts, test.py, test_rank.py, train.py, train_rank.py, viz.py

Authors Lu Gan, Jessy W. Grizzle, Ryan M. Eustice, Maani Ghaffari arXiv ID 2207.03034 Category cs.RO: Robotics Citations 35 Venue IEEE Robotics and Automation Letters Repository https://github.com/ganlumomo/minicheetah-traversability-irl โญ 22 Last Checked 1 month ago
Abstract
This work reports on developing a deep inverse reinforcement learning method for legged robots terrain traversability modeling that incorporates both exteroceptive and proprioceptive sensory data. Existing works use robot-agnostic exteroceptive environmental features or handcrafted kinematic features; instead, we propose to also learn robot-specific inertial features from proprioceptive sensory data for reward approximation in a single deep neural network. Incorporating the inertial features can improve the model fidelity and provide a reward that depends on the robot's state during deployment. We train the reward network using the Maximum Entropy Deep Inverse Reinforcement Learning (MEDIRL) algorithm and propose simultaneously minimizing a trajectory ranking loss to deal with the suboptimality of legged robot demonstrations. The demonstrated trajectories are ranked by locomotion energy consumption, in order to learn an energy-aware reward function and a more energy-efficient policy than demonstration. We evaluate our method using a dataset collected by an MIT Mini-Cheetah robot and a Mini-Cheetah simulator. The code is publicly available at https://github.com/ganlumomo/minicheetah-traversability-irl.
Community shame:
Not yet rated
Community Contributions

Found the code? Know the venue? Think something is wrong? Let us know!

๐Ÿ“œ Similar Papers

In the same crypt โ€” Robotics