Recurrent Neural Networks for Driver Activity Anticipation via Sensory-Fusion Architecture

September 16, 2015 · Entered Twilight · 🏛 IEEE International Conference on Robotics and Automation

"No code URL or promise found in abstract"
"Code repo scraped from project page (backfill)"

Evidence collected by the PWNC Scanner

Repo contents: .gitignore, LICENSE, README.md, anticipatory-rnn, structural_rnn

Authors Ashesh Jain, Avi Singh, Hema S Koppula, Shane Soh, Ashutosh Saxena arXiv ID 1509.05016 Category cs.CV: Computer Vision Cross-listed cs.AI, cs.RO Citations 272 Venue IEEE International Conference on Robotics and Automation Repository https://github.com/asheshjain399/RNNexp ⭐ 256 Last Checked 7 days ago

Abstract

Anticipating the future actions of a human is a widely studied problem in robotics that requires spatio-temporal reasoning. In this work we propose a deep learning approach for anticipation in sensory-rich robotics applications. We introduce a sensory-fusion architecture which jointly learns to anticipate and fuse information from multiple sensory streams. Our architecture consists of Recurrent Neural Networks (RNNs) that use Long Short-Term Memory (LSTM) units to capture long temporal dependencies. We train our architecture in a sequence-to-sequence prediction manner, and it explicitly learns to predict the future given only a partial temporal context. We further introduce a novel loss layer for anticipation which prevents over-fitting and encourages early anticipation. We use our architecture to anticipate driving maneuvers several seconds before they happen on a natural driving data set of 1180 miles. The context for maneuver anticipation comes from multiple sensors installed on the vehicle. Our approach shows significant improvement over the state-of-the-art in maneuver anticipation by increasing the precision from 77.4% to 90.5% and recall from 71.2% to 87.4%.