S2TNet: Spatio-Temporal Transformer Networks for Trajectory Prediction in Autonomous Driving
June 22, 2022 ยท Entered Twilight ยท ๐ Asian Conference on Machine Learning
Repo contents: LICENSE, README.md, arg_parser.py, arg_types.py, checkpoints, config, data, data_processor.py, feeder.py, main.py, model.py, multi_attention_forward.py, prediction_result.txt, prediction_result.zip, requirements.txt, seq2seq_transformer.py, sttf_layer.py, utilies.py
Authors
Weihuang Chen, Fangfang Wang, Hongbin Sun
arXiv ID
2206.10902
Category
cs.CV: Computer Vision
Cross-listed
cs.LG,
cs.RO
Citations
55
Venue
Asian Conference on Machine Learning
Repository
https://github.com/chenghuang66/s2tnet
โญ 47
Last Checked
1 month ago
Abstract
To safely and rationally participate in dense and heterogeneous traffic, autonomous vehicles require to sufficiently analyze the motion patterns of surrounding traffic-agents and accurately predict their future trajectories. This is challenging because the trajectories of traffic-agents are not only influenced by the traffic-agents themselves but also by spatial interaction with each other. Previous methods usually rely on the sequential step-by-step processing of Long Short-Term Memory networks (LSTMs) and merely extract the interactions between spatial neighbors for single type traffic-agents. We propose the Spatio-Temporal Transformer Networks (S2TNet), which models the spatio-temporal interactions by spatio-temporal Transformer and deals with the temporel sequences by temporal Transformer. We input additional category, shape and heading information into our networks to handle the heterogeneity of traffic-agents. The proposed methods outperforms state-of-the-art methods on ApolloScape Trajectory dataset by more than 7\% on both the weighted sum of Average and Final Displacement Error. Our code is available at https://github.com/chenghuang66/s2tnet.
Community Contributions
Found the code? Know the venue? Think something is wrong? Let us know!
๐ Similar Papers
In the same crypt โ Computer Vision
๐
๐
Old Age
๐
๐
Old Age
Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks
R.I.P.
๐ป
Ghosted
You Only Look Once: Unified, Real-Time Object Detection
๐
๐
Old Age
SSD: Single Shot MultiBox Detector
๐
๐
Old Age
Squeeze-and-Excitation Networks
R.I.P.
๐ป
Ghosted