Robust and Efficient Transfer Learning with Hidden-Parameter Markov Decision Processes

June 20, 2017 Β· Entered Twilight Β· πŸ› AAAI Conference on Artificial Intelligence

πŸŒ… TWILIGHT: Old Age
Predates the code-sharing era β€” a pioneer of its time

"Last commit was 8.0 years ago (β‰₯5 year threshold)"

Evidence collected by the PWNC Scanner

Repo contents: .gitignore, BayesianNeuralNetwork.py, ExperienceReplay.py, HiPMDP.py, LICENSE.txt, PriorityQueue.py, Qnetwork.py, README.md, acrobot_simulator, grid_simulator, hiv_simulator, preset_parameters, toy_example.ipynb

Authors Taylor Killian, Samuel Daulton, George Konidaris, Finale Doshi-Velez arXiv ID 1706.06544 Category stat.ML: Machine Learning (Stat) Cross-listed cs.AI, cs.LG Citations 110 Venue AAAI Conference on Artificial Intelligence Repository https://github.com/dtak/hip-mdp-public ⭐ 30 Last Checked 1 month ago
Abstract
We introduce a new formulation of the Hidden Parameter Markov Decision Process (HiP-MDP), a framework for modeling families of related tasks using low-dimensional latent embeddings. Our new framework correctly models the joint uncertainty in the latent parameters and the state space. We also replace the original Gaussian Process-based model with a Bayesian Neural Network, enabling more scalable inference. Thus, we expand the scope of the HiP-MDP to applications with higher dimensions and more complex dynamics.
Community shame:
Not yet rated
Community Contributions

Found the code? Know the venue? Think something is wrong? Let us know!

πŸ“œ Similar Papers

In the same crypt β€” Machine Learning (Stat)

R.I.P. πŸ‘» Ghosted

Graph Attention Networks

Petar VeličkoviΔ‡, Guillem Cucurull, ... (+4 more)

stat.ML πŸ› ICLR πŸ“š 24.7K cites 8 years ago
R.I.P. πŸ‘» Ghosted

Layer Normalization

Jimmy Lei Ba, Jamie Ryan Kiros, Geoffrey E. Hinton

stat.ML πŸ› arXiv πŸ“š 12.0K cites 9 years ago