Quality-Diversity Optimisation on a Physical Robot Through Dynamics-Aware and Reset-Free Learning

April 24, 2023 · Declared Dead · 🏛 GECCO Companion

"No code URL or promise found in abstract"

Evidence collected by the PWNC Scanner

Authors Simón C. Smith, Bryan Lim, Hannah Janmohamed, Antoine Cully arXiv ID 2304.12080 Category cs.RO: Robotics Cross-listed cs.AI, cs.NE Citations 3 Venue GECCO Companion Last Checked 3 months ago

Abstract

Learning algorithms, like Quality-Diversity (QD), can be used to acquire repertoires of diverse robotics skills. This learning is commonly done via computer simulation due to the large number of evaluations required. However, training in a virtual environment generates a gap between simulation and reality. Here, we build upon the Reset-Free QD (RF-QD) algorithm to learn controllers directly on a physical robot. This method uses a dynamics model, learned from interactions between the robot and the environment, to predict the robot's behaviour and improve sample efficiency. A behaviour selection policy filters out uninteresting or unsafe policies predicted by the model. RF-QD also includes a recovery policy that returns the robot to a safe zone when it has walked outside of it, allowing continuous learning. We demonstrate that our method enables a physical quadruped robot to learn a repertoire of behaviours in two hours without human supervision. We successfully test the solution repertoire using a maze navigation task. Finally, we compare our approach to the MAP-Elites algorithm. We show that dynamics awareness and a recovery policy are required for training on a physical robot for optimal archive generation. Video available at https://youtu.be/BgGNvIsRh7Q

📄 View on arXiv 🌐 View on ar5iv 📑 PDF 🎉 Report Code Found

Community Contributions

Found the code? Know the venue? Think something is wrong? Let us know!

📜 Similar Papers

In the same crypt — Robotics

R.I.P. 👻 Ghosted

Past, Present, and Future of Simultaneous Localization And Mapping: Towards the Robust-Perception Age

Cesar Cadena, Luca Carlone, ... (+6 more)

cs.RO 🏛 IEEE TRO 📚 3.2K cites 10 years ago

R.I.P. 👻 Ghosted

AirSim: High-Fidelity Visual and Physical Simulation for Autonomous Vehicles

Shital Shah, Debadeepta Dey, ... (+2 more)

cs.RO 🏛 ICFSR 📚 2.3K cites 9 years ago

📚 📚 The Cartographer

A Survey of Motion Planning and Control Techniques for Self-driving Urban Vehicles

Brian Paden, Michal Cap, ... (+3 more)

cs.RO 🏛 IEEE TIV 📚 2.3K cites 10 years ago

📚 📚 The Cartographer

Unmanned Aerial Vehicles: A Survey on Civil Applications and Key Research Challenges

Hazim Shakhatreh, Ahmad Sawalmeh, ... (+7 more)

cs.RO 🏛 arXiv 📚 1.8K cites 8 years ago

📚 📚 The Cartographer

A Survey of Autonomous Driving: Common Practices and Emerging Technologies

Ekim Yurtsever, Jacob Lambert, ... (+2 more)

cs.RO 🏛 IEEE Access 📚 1.7K cites 7 years ago

R.I.P. 👻 Ghosted

Learning agile and dynamic motor skills for legged robots

Jemin Hwangbo, Joonho Lee, ... (+5 more)

cs.RO 🏛 Sci. Robot. 📚 1.6K cites 7 years ago

Died the same way — 👻 Ghosted

R.I.P. 👻 Ghosted

Federated Learning: Strategies for Improving Communication Efficiency

Jakub Konečný, H. Brendan McMahan, ... (+4 more)

cs.LG 🏛 arXiv 📚 5.2K cites 9 years ago

R.I.P. 👻 Ghosted

In-Datacenter Performance Analysis of a Tensor Processing Unit

Norman P. Jouppi, Cliff Young, ... (+73 more)

cs.AR 🏛 ISCA 📚 5.1K cites 9 years ago

R.I.P. 👻 Ghosted

Deep Convolutional Neural Networks for Computer-Aided Detection: CNN Architectures, Dataset Characteristics and Transfer Learning

Hoo-Chang Shin, Holger R. Roth, ... (+7 more)

cs.CV 🏛 IEEE TMI 📚 4.9K cites 10 years ago

R.I.P. 👻 Ghosted

Explanation in Artificial Intelligence: Insights from the Social Sciences

Tim Miller

cs.AI 🏛 AI 📚 4.9K cites 8 years ago