Environmental effects on emergent strategy in micro-scale multi-agent reinforcement learning

July 03, 2023 · Declared Dead · 🏛 arXiv.org

"No code URL or promise found in abstract"

Evidence collected by the PWNC Scanner

Authors Samuel Tovey, David Zimmer, Christoph Lohrmann, Tobias Merkt, Simon Koppenhoefer, Veit-Lorenz Heuthe, Clemens Bechinger, Christian Holm arXiv ID 2307.00994 Category physics.bio-ph Cross-listed cs.LG, cs.RO Citations 3 Venue arXiv.org Last Checked 1 month ago

Abstract

Multi-Agent Reinforcement Learning (MARL) is a promising candidate for realizing efficient control of microscopic particles, of which micro-robots are a subset. However, the microscopic particles' environment presents unique challenges, such as Brownian motion at sufficiently small length-scales. In this work, we explore the role of temperature in the emergence and efficacy of strategies in MARL systems using particle-based Langevin molecular dynamics simulations as a realistic representation of micro-scale environments. To this end, we perform experiments on two different multi-agent tasks in microscopic environments at different temperatures, detecting the source of a concentration gradient and rotation of a rod. We find that at higher temperatures, the RL agents identify new strategies for achieving these tasks, highlighting the importance of understanding this regime and providing insight into optimal training strategies for bridging the generalization gap between simulation and reality. We also introduce a novel Python package for studying microscopic agents using reinforcement learning (RL) to accompany our results.

📄 View on arXiv 🌐 View on ar5iv 📑 PDF 🎉 Report Code Found

Community Contributions

Found the code? Know the venue? Think something is wrong? Let us know!

📜 Similar Papers

In the same crypt — physics.bio-ph

R.I.P. 👻 Ghosted

Predictability and hierarchy in Drosophila behavior

Gordon J. Berman, William Bialek, Joshua W. Shaevitz

physics.bio-ph 🏛 PNAS 📚 193 cites 9 years ago

R.I.P. 👻 Ghosted

Passive wing deployment and retraction in beetles and flapping microrobots

Hoang-Vu Phan, Hoon Cheol Park, Dario Floreano

physics.bio-ph 🏛 Nature 📚 40 cites 1 year ago

R.I.P. 👻 Ghosted

Body-terrain interaction affects large bump traversal of insects and legged robots

Sean W. Gart, Chen Li

physics.bio-ph 🏛 Bioinspiration & Biomimetics 📚 34 cites 6 years ago

R.I.P. 👻 Ghosted

Comparison of Decision Tree Based Classification Strategies to Detect External Chemical Stimuli from Raw and Filtered Plant Electrical Response

Shre Kumar Chatterjee, Saptarshi Das, ... (+6 more)

physics.bio-ph 🏛 arXiv 📚 27 cites 8 years ago

R.I.P. 👻 Ghosted

First free-flight flow visualisation of a flapping-wing robot

Matěj Karásek, Mustafa Percin, ... (+5 more)

physics.bio-ph 🏛 International Journal of Micro Air Vehicles 📚 19 cites 9 years ago

R.I.P. 👻 Ghosted

Learning the shape of protein micro-environments with a holographic convolutional neural network

Michael N. Pun, Andrew Ivanov, ... (+6 more)

physics.bio-ph 🏛 bioRxiv 📚 16 cites 3 years ago

Died the same way — 👻 Ghosted

R.I.P. 👻 Ghosted

Language Models are Few-Shot Learners

Tom B. Brown, Benjamin Mann, ... (+29 more)

cs.CL 🏛 NeurIPS 📚 54.2K cites 5 years ago

R.I.P. 👻 Ghosted

PyTorch: An Imperative Style, High-Performance Deep Learning Library

Adam Paszke, Sam Gross, ... (+19 more)

cs.LG 🏛 NeurIPS 📚 49.7K cites 6 years ago

R.I.P. 👻 Ghosted

XGBoost: A Scalable Tree Boosting System

Tianqi Chen, Carlos Guestrin

cs.LG 🏛 KDD 📚 49.2K cites 10 years ago

R.I.P. 👻 Ghosted

Batch Normalization: Accelerating Deep Network Training by Reducing Internal Covariate Shift

Sergey Ioffe, Christian Szegedy

cs.LG 🏛 ICML 📚 46.0K cites 11 years ago