Textual Explanations for Self-Driving Vehicles

July 30, 2018 ยท Entered Twilight ยท ๐Ÿ› European Conference on Computer Vision

๐ŸŒ… TWILIGHT: Old Age
Predates the code-sharing era โ€” a pioneer of its time

"Last commit was 7.0 years ago (โ‰ฅ5 year threshold)"

Evidence collected by the PWNC Scanner

Repo contents: LICENSE, README.md, Step0_download_BDDVdata.py, Step1_preprocessing.py, Step2_1_test_CNNonly.py, Step2_train_CNNonly.py, Step3_1_test_Attention.py, Step3_train_Attention.py, Step4_preprocessing_explanation.py, dask_generator.py, data, requirements.txt, server.py, src

Authors Jinkyu Kim, Anna Rohrbach, Trevor Darrell, John Canny, Zeynep Akata arXiv ID 1807.11546 Category cs.CV: Computer Vision Citations 410 Venue European Conference on Computer Vision Repository https://github.com/JinkyuKimUCB/explainable-deep-driving โญ 87 Last Checked 1 month ago
Abstract
Deep neural perception and control networks have become key components of self-driving vehicles. User acceptance is likely to benefit from easy-to-interpret textual explanations which allow end-users to understand what triggered a particular behavior. Explanations may be triggered by the neural controller, namely introspective explanations, or informed by the neural controller's output, namely rationalizations. We propose a new approach to introspective explanations which consists of two parts. First, we use a visual (spatial) attention model to train a convolutional network end-to-end from images to the vehicle control commands, i.e., acceleration and change of course. The controller's attention identifies image regions that potentially influence the network's output. Second, we use an attention-based video-to-text model to produce textual explanations of model actions. The attention maps of controller and explanation model are aligned so that explanations are grounded in the parts of the scene that mattered to the controller. We explore two approaches to attention alignment, strong- and weak-alignment. Finally, we explore a version of our model that generates rationalizations, and compare with introspective explanations on the same video segments. We evaluate these models on a novel driving dataset with ground-truth human explanations, the Berkeley DeepDrive eXplanation (BDD-X) dataset. Code is available at https://github.com/JinkyuKimUCB/explainable-deep-driving.
Community shame:
Not yet rated
Community Contributions

Found the code? Know the venue? Think something is wrong? Let us know!

๐Ÿ“œ Similar Papers

In the same crypt โ€” Computer Vision