RT-DATR:Real-time Unsupervised Domain Adaptive Detection Transformer with Adversarial Feature Learning

2025-04-12Code Available1· sign in to hype

Feng Lv, Chunlong Xia, Shuo Wang, Huo Cao

Code Available — Be the first to reproduce this paper.

Code

github.com/Jeremy-lf/RT-DATR
paddle★ 18

Abstract

Despite domain-adaptive object detectors based on CNN and transformers have made significant progress in cross-domain detection tasks, it is regrettable that domain adaptation for real-time transformer-based detectors has not yet been explored. Directly applying existing domain adaptation algorithms has proven to be suboptimal. In this paper, we propose RT-DATR, a simple and efficient real-time domain adaptive detection transformer. Building on RT-DETR as our base detector, we first introduce a local object-level feature alignment module to significantly enhance the feature representation of domain invariance during object transfer. Additionally, we introduce a scene semantic feature alignment module designed to boost cross-domain detection performance by aligning scene semantic features. Finally, we introduced a domain query and decoupled it from the object query to further align the instance feature distribution within the decoder layer, reduce the domain gap, and maintain discriminative ability. Experimental results on various benchmarks demonstrate that our method outperforms current state-of-the-art approaches. Our code will be released soon.

Tasks

Domain Adaptation Domain Generalization Object Detection Real-Time Object Detection Unsupervised Domain Adaptation

Benchmark Results

Dataset	Model	Metric	Claimed	Verified	Status
BDD100k to Cityscapes	RT-DATR(real-time, 640x640，R-34)	mAP	46.5	—	Unverified
Cityscapes-to-Foggy Cityscapes	RT-DATR(640x640, real-time)	mAP@0.5	52.7	—	Unverified
Kitti to Cityscapes	RT-DATR(real-time, 640x640)	mAP@0.5	50.3	—	Unverified
SIM10K to Cityscapes	RT-DATR(real-time, 640x640)	mAP@0.5	67.2	—	Unverified

RT-DATR:Real-time Unsupervised Domain Adaptive Detection Transformer with Adversarial Feature Learning

Code

Abstract

Tasks

Benchmark Results

Reproductions