PAPER DIGEST
Most Influential ECCV 2020 Paper · 2026-03 edition

Tracking Objects As Points

Xingyi Zhou; Vladlen Koltun; Philipp Kr&aumlhenb&uumlhl

Venue
European Conference on Computer Vision (ECCV) 2020
Recognition
Most Influential ECCV 2020 Paper (Rank No. 10)
Edition
2026-03
Impact factor
8
Certificate ID
03c0d1ed10c3b4f6

Abstract

Tracking has traditionally been the art of following interest points through space and time. This changed with the rise of powerful deep networks. Nowadays, tracking is dominated by pipelines that perform object detection followed by temporal association, also known as tracking-by-detection. In this paper, we present a simultaneous detection and tracking algorithm that is simpler, faster, and more accurate than the state of the art. Our tracker, CenterTrack, applies a detection model to a pair of images and detections from the prior frame. Given this minimal input, CenterTrack localizes objects and predicts their associations with the previous frame. That's it. CenterTrack is simple, online (no peeking into the future), and real-time. It achieves $67.3\%$ MOTA on the MOT17 challenge at 17 FPS and $89.4\%$ MOTA on the KITTI tracking benchmark at 12 FPS, setting a new state of the art on both datasets. CenterTrack is easily extended to monocular 3D tracking by regressing additional 3D attributes. Using monocular video input, it achieves $28.3\%$ AMOTA@0.2 on the newly released nuScenes 3D tracking benchmark, substantially outperforming the monocular baseline on this benchmark while running at 22 FPS.

Download PDF certificate