Propagation Networks for Model-Based Control Under Partial Observation

September 28, 2018 Β· Entered Twilight Β· πŸ› IEEE International Conference on Robotics and Automation

πŸŒ… TWILIGHT: Old Age
Predates the code-sharing era β€” a pioneer of its time

"No code URL or promise found in abstract"
"Code repo scraped from project page (backfill)"

Evidence collected by the PWNC Scanner

Repo contents: .gitignore, README.md, data.py, data, dump_Box, dump_Cradle, dump_Rope, eval.py, imgs, make_gifs.py, models.py, mpc.py, physics_engine.py, scripts, train.py, utils.py

Authors Yunzhu Li, Jiajun Wu, Jun-Yan Zhu, Joshua B. Tenenbaum, Antonio Torralba, Russ Tedrake arXiv ID 1809.11169 Category cs.AI: Artificial Intelligence Cross-listed cs.LG, cs.RO Citations 151 Venue IEEE International Conference on Robotics and Automation Repository https://github.com/YunzhuLi/PropNet ⭐ 48 Last Checked 6 days ago
Abstract
There has been an increasing interest in learning dynamics simulators for model-based control. Compared with off-the-shelf physics engines, a learnable simulator can quickly adapt to unseen objects, scenes, and tasks. However, existing models like interaction networks only work for fully observable systems; they also only consider pairwise interactions within a single time step, both restricting their use in practical systems. We introduce Propagation Networks (PropNet), a differentiable, learnable dynamics model that handles partially observable scenarios and enables instantaneous propagation of signals beyond pairwise interactions. Experiments show that our propagation networks not only outperform current learnable physics engines in forward simulation, but also achieve superior performance on various control tasks. Compared with existing model-free deep reinforcement learning algorithms, model-based control with propagation networks is more accurate, efficient, and generalizable to new, partially observable scenes and tasks.
Community shame:
Not yet rated
Community Contributions

Found the code? Know the venue? Think something is wrong? Let us know!

πŸ“œ Similar Papers

In the same crypt β€” Artificial Intelligence