MeMOTR: Long-Term Memory-Augmented Transformer for Multi-Object Tracking

2023-07-28ICCV 2023Code Available2· sign in to hype

Ruopeng Gao, LiMin Wang

Code Available — Be the first to reproduce this paper.

Code

github.com/mcg-nju/memotr
OfficialIn paperpytorch★ 219

Abstract

As a video task, Multiple Object Tracking (MOT) is expected to capture temporal information of targets effectively. Unfortunately, most existing methods only explicitly exploit the object features between adjacent frames, while lacking the capacity to model long-term temporal information. In this paper, we propose MeMOTR, a long-term memory-augmented Transformer for multi-object tracking. Our method is able to make the same object's track embedding more stable and distinguishable by leveraging long-term memory injection with a customized memory-attention layer. This significantly improves the target association ability of our model. Experimental results on DanceTrack show that MeMOTR impressively surpasses the state-of-the-art method by 7.9% and 13.0% on HOTA and AssA metrics, respectively. Furthermore, our model also outperforms other Transformer-based methods on association performance on MOT17 and generalizes well on BDD100K. Code is available at https://github.com/MCG-NJU/MeMOTR.

Tasks

Multi-Object Tracking Multiple Object Tracking Object Object Tracking

Benchmark Results

Dataset	Model	Metric	Claimed	Verified	Status
DanceTrack	MeMOTR	HOTA	68.5	—	Unverified
DanceTrack	MeMOTR (Deformable DETR)	HOTA	63.4	—	Unverified
SportsMOT	MeMOTR	HOTA	70	—	Unverified
SportsMOT	MeMOTR (Deformable-DETR)	HOTA	68.8	—	Unverified

MeMOTR: Long-Term Memory-Augmented Transformer for Multi-Object Tracking

Code

Abstract

Tasks

Benchmark Results

Reproductions