Looking Inside Out: Anticipating Driver Intent From Videos

December 03, 2023 ยท Entered Twilight ยท ๐Ÿ› IEEE International Conference on Robotics and Automation

๐Ÿ’ค TWILIGHT: Eternal Rest
Repo abandoned since publication

Repo contents: .gitignore, README.md, basictrainedlstm.keras, config, dataloader, gazeproject.ipynb, lstm gaze.ipynb, models, test.py, train.py, transformer gaze.ipynb

Authors Yung-chi Kung, Arthur Zhang, Junmin Wang, Joydeep Biswas arXiv ID 2312.01444 Category cs.CV: Computer Vision Cross-listed cs.HC Citations 5 Venue IEEE International Conference on Robotics and Automation Repository https://github.com/ykung83/Driver-Intent-Prediction โญ 5 Last Checked 1 month ago
Abstract
Anticipating driver intention is an important task when vehicles of mixed and varying levels of human/machine autonomy share roadways. Driver intention can be leveraged to improve road safety, such as warning surrounding vehicles in the event the driver is attempting a dangerous maneuver. In this work, we propose a novel method of utilizing in-cabin and external camera data to improve state-of-the-art (SOTA) performance in predicting future driver actions. Compared to existing methods, our approach explicitly extracts object and road-level features from external camera data, which we demonstrate are important features for predicting driver intention. Using our handcrafted features as inputs for both a transformer and an LSTM-based architecture, we empirically show that jointly utilizing in-cabin and external features improves performance compared to using in-cabin features alone. Furthermore, our models predict driver maneuvers more accurately and earlier than existing approaches, with an accuracy of 87.5% and an average prediction time of 4.35 seconds before the maneuver takes place. We release our model configurations and training scripts on https://github.com/ykung83/Driver-Intent-Prediction
Community shame:
Not yet rated
Community Contributions

Found the code? Know the venue? Think something is wrong? Let us know!

๐Ÿ“œ Similar Papers

In the same crypt โ€” Computer Vision