SOTAVerified

Object Detection

Papers

Showing 326350 of 10957 papers

TitleStatusHype
VkD: Improving Knowledge Distillation using Orthogonal ProjectionsCode2
Realistic Rainy Weather Simulation for LiDARs in CARLA SimulatorCode2
Agent Attention: On the Integration of Softmax and Linear AttentionCode2
Towards Automatic Power Battery Detection: New Challenge, Benchmark Dataset and BaselineCode2
Hulk: A Universal Knowledge Translator for Human-Centric TasksCode2
Aligning and Prompting Everything All at Once for Universal Visual PerceptionCode2
Segment and Caption AnythingCode2
TrackDiffusion: Tracklet-Conditioned Video Generation via Diffusion ModelsCode2
TransNeXt: Robust Foveal Visual Perception for Vision TransformersCode2
Adapter is All You Need for Tuning Visual TasksCode2
FlashOcc: Fast and Memory-Efficient Occupancy Prediction via Channel-to-Height PluginCode2
TransXNet: Learning Both Global and Local Dynamics with a Dual Dynamic Token Mixer for Visual RecognitionCode2
Battle of the Backbones: A Large-Scale Comparison of Pretrained Models across Computer Vision TasksCode2
GenEval: An Object-Focused Framework for Evaluating Text-to-Image AlignmentCode2
UniPAD: A Universal Pre-training Paradigm for Autonomous DrivingCode2
CoDA: Collaborative Novel Box Discovery and Cross-modal Alignment for Open-vocabulary 3D Object DetectionCode2
CLIPSelf: Vision Transformer Distills Itself for Open-Vocabulary Dense PredictionCode2
You Only Look at Once for Real-time and Generic Multi-TaskCode2
InstructCV: Instruction-Tuned Text-to-Image Diffusion Models as Vision GeneralistsCode2
Detect Everything with Few ExamplesCode2
EPTQ: Enhanced Post-Training Quantization via Hessian-guided Network-wise OptimizationCode2
RMT: Retentive Networks Meet Vision TransformersCode2
RaTrack: Moving Object Detection and Tracking with 4D Radar Point CloudCode2
DFormer: Rethinking RGBD Representation Learning for Semantic SegmentationCode2
DAT++: Spatially Dynamic Vision Transformer with Deformable AttentionCode2
Show:102550
← PrevPage 14 of 439Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1Co-DETRbox mAP66Unverified
2InternImage-H (M3I Pre-training)box mAP65.5Unverified
3M3I Pre-training (InternImage-H)box mAP65.4Unverified
4MoCaEbox mAP65.1Unverified
5Co-DETR (Swin-L)box mAP64.8Unverified
6Focal-Stable-DINO (Focal-Huge, no TTA)box mAP64.8Unverified
7EVAbox mAP64.7Unverified
8Group DETR v2box mAP64.5Unverified
9FocalNet-H (DINO)box mAP64.4Unverified
10InternImage-XLbox mAP64.3Unverified