Primal-dual Learning for the Model-free Risk-constrained Linear Quadratic Regulator

November 22, 2020 · Declared Dead · 🏛 Conference on Learning for Dynamics & Control

"No code URL or promise found in abstract"

Evidence collected by the PWNC Scanner

Authors Feiran Zhao, Keyou You arXiv ID 2011.10931 Category eess.SY: Systems & Control (EE) Cross-listed cs.LG Citations 22 Venue Conference on Learning for Dynamics & Control Last Checked 1 month ago

Abstract

Risk-aware control, though with promise to tackle unexpected events, requires a known exact dynamical model. In this work, we propose a model-free framework to learn a risk-aware controller with a focus on the linear system. We formulate it as a discrete-time infinite-horizon LQR problem with a state predictive variance constraint. To solve it, we parameterize the policy with a feedback gain pair and leverage primal-dual methods to optimize it by solely using data. We first study the optimization landscape of the Lagrangian function and establish the strong duality in spite of its non-convex nature. Alongside, we find that the Lagrangian function enjoys an important local gradient dominance property, which is then exploited to develop a convergent random search algorithm to learn the dual function. Furthermore, we propose a primal-dual algorithm with global convergence to learn the optimal policy-multiplier pair. Finally, we validate our results via simulations.

📄 View on arXiv 🌐 View on ar5iv 📑 PDF 🎉 Report Code Found

Community Contributions

Found the code? Know the venue? Think something is wrong? Let us know!

📜 Similar Papers

In the same crypt — Systems & Control (EE)

R.I.P. 👻 Ghosted

A Tutorial on Modeling and Analysis of Dynamic Social Networks. Part I

Anton V. Proskurnikov, Roberto Tempo

eess.SY 🏛 Annual Reviews in Control 📚 560 cites 9 years ago

R.I.P. 👻 Ghosted

Incremental Gradient, Subgradient, and Proximal Methods for Convex Optimization: A Survey

Dimitri P. Bertsekas

eess.SY 🏛 arXiv 📚 454 cites 10 years ago

R.I.P. 👻 Ghosted

Wireless Network Design for Control Systems: A Survey

Pangun Park, Sinem Coleri Ergen, ... (+3 more)

eess.SY 🏛 IEEE COMST 📚 447 cites 8 years ago

R.I.P. 👻 Ghosted

Learning-based Model Predictive Control for Safe Exploration

Torsten Koller, Felix Berkenkamp, ... (+2 more)

eess.SY 🏛 CDC 📚 412 cites 8 years ago

R.I.P. 👻 Ghosted

Safety-Critical Model Predictive Control with Discrete-Time Control Barrier Function

Jun Zeng, Bike Zhang, Koushil Sreenath

eess.SY 🏛 ACC 📚 388 cites 5 years ago

R.I.P. 👻 Ghosted

Novel Multidimensional Models of Opinion Dynamics in Social Networks

Sergey E. Parsegov, Anton V. Proskurnikov, ... (+2 more)

eess.SY 🏛 IEEE TAC 📚 372 cites 10 years ago

Died the same way — 👻 Ghosted

R.I.P. 👻 Ghosted

Language Models are Few-Shot Learners

Tom B. Brown, Benjamin Mann, ... (+29 more)

cs.CL 🏛 NeurIPS 📚 54.2K cites 5 years ago

R.I.P. 👻 Ghosted

PyTorch: An Imperative Style, High-Performance Deep Learning Library

Adam Paszke, Sam Gross, ... (+19 more)

cs.LG 🏛 NeurIPS 📚 49.7K cites 6 years ago

R.I.P. 👻 Ghosted

XGBoost: A Scalable Tree Boosting System

Tianqi Chen, Carlos Guestrin

cs.LG 🏛 KDD 📚 49.2K cites 10 years ago

R.I.P. 👻 Ghosted

Batch Normalization: Accelerating Deep Network Training by Reducing Internal Covariate Shift

Sergey Ioffe, Christian Szegedy

cs.LG 🏛 ICML 📚 46.0K cites 11 years ago