S2TNet: Spatio-Temporal Transformer Networks for Trajectory Prediction in Autonomous Driving

June 22, 2022 ยท Entered Twilight ยท ๐Ÿ› Asian Conference on Machine Learning

๐Ÿ’ค TWILIGHT: Eternal Rest
Repo abandoned since publication

Repo contents: LICENSE, README.md, arg_parser.py, arg_types.py, checkpoints, config, data, data_processor.py, feeder.py, main.py, model.py, multi_attention_forward.py, prediction_result.txt, prediction_result.zip, requirements.txt, seq2seq_transformer.py, sttf_layer.py, utilies.py

Authors Weihuang Chen, Fangfang Wang, Hongbin Sun arXiv ID 2206.10902 Category cs.CV: Computer Vision Cross-listed cs.LG, cs.RO Citations 55 Venue Asian Conference on Machine Learning Repository https://github.com/chenghuang66/s2tnet โญ 47 Last Checked 1 month ago
Abstract
To safely and rationally participate in dense and heterogeneous traffic, autonomous vehicles require to sufficiently analyze the motion patterns of surrounding traffic-agents and accurately predict their future trajectories. This is challenging because the trajectories of traffic-agents are not only influenced by the traffic-agents themselves but also by spatial interaction with each other. Previous methods usually rely on the sequential step-by-step processing of Long Short-Term Memory networks (LSTMs) and merely extract the interactions between spatial neighbors for single type traffic-agents. We propose the Spatio-Temporal Transformer Networks (S2TNet), which models the spatio-temporal interactions by spatio-temporal Transformer and deals with the temporel sequences by temporal Transformer. We input additional category, shape and heading information into our networks to handle the heterogeneity of traffic-agents. The proposed methods outperforms state-of-the-art methods on ApolloScape Trajectory dataset by more than 7\% on both the weighted sum of Average and Final Displacement Error. Our code is available at https://github.com/chenghuang66/s2tnet.
Community shame:
Not yet rated
Community Contributions

Found the code? Know the venue? Think something is wrong? Let us know!

๐Ÿ“œ Similar Papers

In the same crypt โ€” Computer Vision