Regret-optimal measurement-feedback control

November 24, 2020 · Declared Dead · 🏛 Conference on Learning for Dynamics & Control

"No code URL or promise found in abstract"

Evidence collected by the PWNC Scanner

Authors Gautam Goel, Babak Hassibi arXiv ID 2011.12785 Category eess.SY: Systems & Control (EE) Cross-listed cs.LG, math.OC Citations 25 Venue Conference on Learning for Dynamics & Control Last Checked 1 month ago

Abstract

We consider measurement-feedback control in linear dynamical systems from the perspective of regret minimization. Unlike most prior work in this area, we focus on the problem of designing an online controller which competes with the optimal dynamic sequence of control actions selected in hindsight, instead of the best controller in some specific class of controllers. This formulation of regret is attractive when the environment changes over time and no single controller achieves good performance over the entire time horizon. We show that in the measurement-feedback setting, unlike in the full-information setting, there is no single offline controller which outperforms every other offline controller on every disturbance, and propose a new $H_2$-optimal offline controller as a benchmark for the online controller to compete against. We show that the corresponding regret-optimal online controller can be found via a novel reduction to the classical Nehari problem from robust control and present a tight data-dependent bound on its regret.

📄 View on arXiv 🌐 View on ar5iv 📑 PDF 🎉 Report Code Found

Community Contributions

Found the code? Know the venue? Think something is wrong? Let us know!

📜 Similar Papers

In the same crypt — Systems & Control (EE)

R.I.P. 👻 Ghosted

A Tutorial on Modeling and Analysis of Dynamic Social Networks. Part I

Anton V. Proskurnikov, Roberto Tempo

eess.SY 🏛 Annual Reviews in Control 📚 560 cites 9 years ago

R.I.P. 👻 Ghosted

Incremental Gradient, Subgradient, and Proximal Methods for Convex Optimization: A Survey

Dimitri P. Bertsekas

eess.SY 🏛 arXiv 📚 454 cites 10 years ago

R.I.P. 👻 Ghosted

Wireless Network Design for Control Systems: A Survey

Pangun Park, Sinem Coleri Ergen, ... (+3 more)

eess.SY 🏛 IEEE COMST 📚 447 cites 8 years ago

R.I.P. 👻 Ghosted

Learning-based Model Predictive Control for Safe Exploration

Torsten Koller, Felix Berkenkamp, ... (+2 more)

eess.SY 🏛 CDC 📚 412 cites 8 years ago

R.I.P. 👻 Ghosted

Safety-Critical Model Predictive Control with Discrete-Time Control Barrier Function

Jun Zeng, Bike Zhang, Koushil Sreenath

eess.SY 🏛 ACC 📚 388 cites 5 years ago

R.I.P. 👻 Ghosted

Novel Multidimensional Models of Opinion Dynamics in Social Networks

Sergey E. Parsegov, Anton V. Proskurnikov, ... (+2 more)

eess.SY 🏛 IEEE TAC 📚 372 cites 10 years ago

Died the same way — 👻 Ghosted

R.I.P. 👻 Ghosted

Language Models are Few-Shot Learners

Tom B. Brown, Benjamin Mann, ... (+29 more)

cs.CL 🏛 NeurIPS 📚 54.2K cites 5 years ago

R.I.P. 👻 Ghosted

PyTorch: An Imperative Style, High-Performance Deep Learning Library

Adam Paszke, Sam Gross, ... (+19 more)

cs.LG 🏛 NeurIPS 📚 49.7K cites 6 years ago

R.I.P. 👻 Ghosted

XGBoost: A Scalable Tree Boosting System

Tianqi Chen, Carlos Guestrin

cs.LG 🏛 KDD 📚 49.2K cites 10 years ago

R.I.P. 👻 Ghosted

Batch Normalization: Accelerating Deep Network Training by Reducing Internal Covariate Shift

Sergey Ioffe, Christian Szegedy

cs.LG 🏛 ICML 📚 46.0K cites 11 years ago