Empirical Study of Off-Policy Policy Evaluation for Reinforcement Learning

November 15, 2019 ยท Declared Dead ยท ๐Ÿ› NeurIPS Datasets and Benchmarks

๐Ÿ‘ป CAUSE OF DEATH: Ghosted
No code link whatsoever

"No code URL or promise found in abstract"

Evidence collected by the PWNC Scanner

Authors Cameron Voloshin, Hoang M. Le, Nan Jiang, Yisong Yue arXiv ID 1911.06854 Category cs.LG: Machine Learning Cross-listed cs.AI, cs.RO, stat.ML Citations 168 Venue NeurIPS Datasets and Benchmarks Last Checked 3 months ago
Abstract
We offer an experimental benchmark and empirical study for off-policy policy evaluation (OPE) in reinforcement learning, which is a key problem in many safety critical applications. Given the increasing interest in deploying learning-based methods, there has been a flurry of recent proposals for OPE method, leading to a need for standardized empirical analyses. Our work takes a strong focus on diversity of experimental design to enable stress testing of OPE methods. We provide a comprehensive benchmarking suite to study the interplay of different attributes on method performance. We distill the results into a summarized set of guidelines for OPE in practice. Our software package, the Caltech OPE Benchmarking Suite (COBS), is open-sourced and we invite interested researchers to further contribute to the benchmark.
Community shame:
Not yet rated
Community Contributions

Found the code? Know the venue? Think something is wrong? Let us know!

๐Ÿ“œ Similar Papers

In the same crypt โ€” Machine Learning

Died the same way โ€” ๐Ÿ‘ป Ghosted