Group DETR v2: Strong Object Detector with Encoder-Decoder Pretraining

2022-11-07arXiv 2022Unverified0· sign in to hype

Qiang Chen, Jian Wang, Chuchu Han, Shan Zhang, Zexian Li, Xiaokang Chen, Jiahui Chen, Xiaodi Wang, Shuming Han, Gang Zhang, Haocheng Feng, Kun Yao, Junyu Han, Errui Ding, Jingdong Wang

arXiv PDF

Unverified — Be the first to reproduce this paper.

Reproduce

Abstract

We present a strong object detector with encoder-decoder pretraining and finetuning. Our method, called Group DETR v2, is built upon a vision transformer encoder ViT-Huge~dosovitskiy2020image, a DETR variant DINO~zhang2022dino, and an efficient DETR training method Group DETR~chen2022group. The training process consists of self-supervised pretraining and finetuning a ViT-Huge encoder on ImageNet-1K, pretraining the detector on Object365, and finally finetuning it on COCO. Group DETR v2 achieves 64.5 mAP on COCO test-dev, and establishes a new SoTA on the COCO leaderboard https://paperswithcode.com/sota/object-detection-on-coco

Tasks

Decoder Object object-detection Object Detection

Benchmark Results

Dataset	Model	Metric	Claimed	Verified	Status
COCO test-dev	Group DETR v2	box mAP	64.5	—	Unverified

Group DETR v2: Strong Object Detector with Encoder-Decoder Pretraining

Abstract

Tasks

Benchmark Results

Reproductions