Environmental effects on emergent strategy in micro-scale multi-agent reinforcement learning
July 03, 2023 Β· Declared Dead Β· π arXiv.org
"No code URL or promise found in abstract"
Evidence collected by the PWNC Scanner
Authors
Samuel Tovey, David Zimmer, Christoph Lohrmann, Tobias Merkt, Simon Koppenhoefer, Veit-Lorenz Heuthe, Clemens Bechinger, Christian Holm
arXiv ID
2307.00994
Category
physics.bio-ph
Cross-listed
cs.LG,
cs.RO
Citations
3
Venue
arXiv.org
Last Checked
1 month ago
Abstract
Multi-Agent Reinforcement Learning (MARL) is a promising candidate for realizing efficient control of microscopic particles, of which micro-robots are a subset. However, the microscopic particles' environment presents unique challenges, such as Brownian motion at sufficiently small length-scales. In this work, we explore the role of temperature in the emergence and efficacy of strategies in MARL systems using particle-based Langevin molecular dynamics simulations as a realistic representation of micro-scale environments. To this end, we perform experiments on two different multi-agent tasks in microscopic environments at different temperatures, detecting the source of a concentration gradient and rotation of a rod. We find that at higher temperatures, the RL agents identify new strategies for achieving these tasks, highlighting the importance of understanding this regime and providing insight into optimal training strategies for bridging the generalization gap between simulation and reality. We also introduce a novel Python package for studying microscopic agents using reinforcement learning (RL) to accompany our results.
Community Contributions
Found the code? Know the venue? Think something is wrong? Let us know!
π Similar Papers
In the same crypt β physics.bio-ph
R.I.P.
π»
Ghosted
R.I.P.
π»
Ghosted
Passive wing deployment and retraction in beetles and flapping microrobots
R.I.P.
π»
Ghosted
Body-terrain interaction affects large bump traversal of insects and legged robots
R.I.P.
π»
Ghosted
Comparison of Decision Tree Based Classification Strategies to Detect External Chemical Stimuli from Raw and Filtered Plant Electrical Response
R.I.P.
π»
Ghosted
First free-flight flow visualisation of a flapping-wing robot
R.I.P.
π»
Ghosted
Learning the shape of protein micro-environments with a holographic convolutional neural network
Died the same way β π» Ghosted
R.I.P.
π»
Ghosted
Language Models are Few-Shot Learners
R.I.P.
π»
Ghosted
PyTorch: An Imperative Style, High-Performance Deep Learning Library
R.I.P.
π»
Ghosted
XGBoost: A Scalable Tree Boosting System
R.I.P.
π»
Ghosted