Multiple Object Forecasting: Predicting Future Object Locations in Diverse Environments

September 26, 2019 · Entered Twilight · 🏛 IEEE Workshop/Winter Conference on Applications of Computer Vision

"Last commit was 5.0 years ago (≥5 year threshold)"

Evidence collected by the PWNC Scanner

Repo contents: .gitignore, LICENSE, README.md, citywalks.gif, datasets.py, evaluate_outputs.py, metrics.py, models.py, mof.jpg, requirements.txt, sted.jpg, sted_evaluate.py, sted_train.py, trainer.py, utils.py

Authors Olly Styles, Tanaya Guha, Victor Sanchez arXiv ID 1909.11944 Category cs.CV: Computer Vision Citations 43 Venue IEEE Workshop/Winter Conference on Applications of Computer Vision Repository https://github.com/olly-styles/Multiple-Object-Forecasting ⭐ 60 Last Checked 1 month ago

Abstract

This paper introduces the problem of multiple object forecasting (MOF), in which the goal is to predict future bounding boxes of tracked objects. In contrast to existing works on object trajectory forecasting which primarily consider the problem from a birds-eye perspective, we formulate the problem from an object-level perspective and call for the prediction of full object bounding boxes, rather than trajectories alone. Towards solving this task, we introduce the Citywalks dataset, which consists of over 200k high-resolution video frames. Citywalks comprises of footage recorded in 21 cities from 10 European countries in a variety of weather conditions and over 3.5k unique pedestrian trajectories. For evaluation, we adapt existing trajectory forecasting methods for MOF and confirm cross-dataset generalizability on the MOT-17 dataset without fine-tuning. Finally, we present STED, a novel encoder-decoder architecture for MOF. STED combines visual and temporal features to model both object-motion and ego-motion, and outperforms existing approaches for MOF. Code & dataset link: https://github.com/olly-styles/Multiple-Object-Forecasting