Tractable Reinforcement Learning of Signal Temporal Logic Objectives
January 26, 2020 ยท Entered Twilight ยท ๐ Conference on Learning for Dynamics & Control
"Last commit was 6.0 years ago (โฅ5 year threshold)"
Evidence collected by the PWNC Scanner
Repo contents: CompRobustness.m, CompRobustness_signal.m, ComputeRobustness.m, ComputeRobustness2.m, Create_Ts_adj.m, Indicator.m, README.md, STL_Learning__L4DC_.pdf, dynamics.m, eGreedy.m, full_fact.m, main.m, updateFlag.m
Authors
Harish Venkataraman, Derya Aksaray, Peter Seiler
arXiv ID
2001.09467
Category
cs.LG: Machine Learning
Cross-listed
cs.AI,
cs.RO,
stat.ML
Citations
33
Venue
Conference on Learning for Dynamics & Control
Repository
https://github.com/kumaa001/Tractable_RL_for_STL_Objectives
Last Checked
1 month ago
Abstract
Signal temporal logic (STL) is an expressive language to specify time-bound real-world robotic tasks and safety specifications. Recently, there has been an interest in learning optimal policies to satisfy STL specifications via reinforcement learning (RL). Learning to satisfy STL specifications often needs a sufficient length of state history to compute reward and the next action. The need for history results in exponential state-space growth for the learning problem. Thus the learning problem becomes computationally intractable for most real-world applications. In this paper, we propose a compact means to capture state history in a new augmented state-space representation. An approximation to the objective (maximizing probability of satisfaction) is proposed and solved for in the new augmented state-space. We show the performance bound of the approximate solution and compare it with the solution of an existing technique via simulations.
Community Contributions
Found the code? Know the venue? Think something is wrong? Let us know!
๐ Similar Papers
In the same crypt โ Machine Learning
R.I.P.
๐ป
Ghosted
R.I.P.
๐ป
Ghosted
XGBoost: A Scalable Tree Boosting System
R.I.P.
๐ป
Ghosted
Batch Normalization: Accelerating Deep Network Training by Reducing Internal Covariate Shift
R.I.P.
๐ป
Ghosted
Semi-Supervised Classification with Graph Convolutional Networks
R.I.P.
๐ป
Ghosted
Proximal Policy Optimization Algorithms
R.I.P.
๐ป
Ghosted