Evolutionary Action Selection for Gradient-based Policy Learning

January 12, 2022 · Declared Dead · 🏛 International Conference on Neural Information Processing

"No code URL or promise found in abstract"

Evidence collected by the PWNC Scanner

Authors Yan Ma, Tianxing Liu, Bingsheng Wei, Yi Liu, Kang Xu, Wei Li arXiv ID 2201.04286 Category cs.NE: Neural & Evolutionary Cross-listed cs.LG Citations 12 Venue International Conference on Neural Information Processing Last Checked 3 months ago

Abstract

Evolutionary Algorithms (EAs) and Deep Reinforcement Learning (DRL) have recently been integrated to take the advantage of the both methods for better exploration and exploitation.The evolutionary part in these hybrid methods maintains a population of policy networks.However, existing methods focus on optimizing the parameters of policy network, which is usually high-dimensional and tricky for EA.In this paper, we shift the target of evolution from high-dimensional parameter space to low-dimensional action space.We propose Evolutionary Action Selection-Twin Delayed Deep Deterministic Policy Gradient (EAS-TD3), a novel hybrid method of EA and DRL.In EAS, we focus on optimizing the action chosen by the policy network and attempt to obtain high-quality actions to promote policy learning through an evolutionary algorithm. We conduct several experiments on challenging continuous control tasks.The result shows that EAS-TD3 shows superior performance over other state-of-art methods.

📄 View on arXiv 🌐 View on ar5iv 📑 PDF 🎉 Report Code Found

Community Contributions

Found the code? Know the venue? Think something is wrong? Let us know!

📜 Similar Papers

In the same crypt — Neural & Evolutionary

R.I.P. 👻 Ghosted

A Style-Based Generator Architecture for Generative Adversarial Networks

Tero Karras, Samuli Laine, Timo Aila

cs.NE 🏛 CVPR 📚 12.3K cites 7 years ago

R.I.P. 👻 Ghosted

Progressive Growing of GANs for Improved Quality, Stability, and Variation

Tero Karras, Timo Aila, ... (+2 more)

cs.NE 🏛 ICLR 📚 8.2K cites 8 years ago

R.I.P. 👻 Ghosted

Learning both Weights and Connections for Efficient Neural Networks

Song Han, Jeff Pool, ... (+2 more)

cs.NE 🏛 NeurIPS 📚 7.4K cites 10 years ago

R.I.P. 👻 Ghosted

LSTM: A Search Space Odyssey

Klaus Greff, Rupesh Kumar Srivastava, ... (+3 more)

cs.NE 🏛 IEEE TNNLS 📚 6.0K cites 11 years ago

R.I.P. 👻 Ghosted

A Baseline for Detecting Misclassified and Out-of-Distribution Examples in Neural Networks

Dan Hendrycks, Kevin Gimpel

cs.NE 🏛 ICLR 📚 4.0K cites 9 years ago

R.I.P. 👻 Ghosted

An Introduction to Convolutional Neural Networks

Keiron O'Shea, Ryan Nash

cs.NE 🏛 arXiv 📚 3.8K cites 10 years ago

Died the same way — 👻 Ghosted

R.I.P. 👻 Ghosted

Language Models are Few-Shot Learners

Tom B. Brown, Benjamin Mann, ... (+29 more)

cs.CL 🏛 NeurIPS 📚 54.2K cites 6 years ago

R.I.P. 👻 Ghosted

PyTorch: An Imperative Style, High-Performance Deep Learning Library

Adam Paszke, Sam Gross, ... (+19 more)

cs.LG 🏛 NeurIPS 📚 49.7K cites 6 years ago

R.I.P. 👻 Ghosted

XGBoost: A Scalable Tree Boosting System

Tianqi Chen, Carlos Guestrin

cs.LG 🏛 KDD 📚 49.2K cites 10 years ago

R.I.P. 👻 Ghosted

Batch Normalization: Accelerating Deep Network Training by Reducing Internal Covariate Shift

Sergey Ioffe, Christian Szegedy

cs.LG 🏛 ICML 📚 46.0K cites 11 years ago