Learning to Communicate in Multi-Agent Reinforcement Learning : A Review

November 13, 2019 · The Cartographer · 🏛 arXiv.org

"No code URL or promise found in abstract"
"Title-pattern auto-detect: Learning to Communicate in Multi-Agent Reinforcement Learning : A Review"

Evidence collected by the PWNC Scanner

Authors Mohamed Salah Zaïem, Etienne Bennequin arXiv ID 1911.05438 Category cs.LG: Machine Learning Cross-listed cs.MA, stat.ML Citations 17 Venue arXiv.org Last Checked 10 days ago

Abstract

We consider the issue of multiple agents learning to communicate through reinforcement learning within partially observable environments, with a focus on information asymmetry in the second part of our work. We provide a review of the recent algorithms developed to improve the agents' policy by allowing the sharing of information between agents and the learning of communication strategies, with a focus on Deep Recurrent Q-Network-based models. We also describe recent efforts to interpret the languages generated by these agents and study their properties in an attempt to generate human-language-like sentences. We discuss the metrics used to evaluate the generated communication strategies and propose a novel entropy-based evaluation metric. Finally, we address the issue of the cost of communication and introduce the idea of an experimental setup to expose this cost in cooperative-competitive game.