SOTAVerified

Object Detection

Papers

Showing 150 of 10957 papers

TitleStatusHype
YOLOv9: Learning What You Want to Learn Using Programmable Gradient InformationCode16
YOLOv10: Real-Time End-to-End Object DetectionCode11
LW-DETR: A Transformer Replacement to YOLO for Real-Time DetectionCode9
YOLO-World: Real-Time Open-Vocabulary Object DetectionCode9
DocLayNet: A Large Human-Annotated Dataset for Document-Layout AnalysisCode8
DETRs Beat YOLOs on Real-time Object DetectionCode8
Perception Encoder: The best visual embeddings are not at the output of the networkCode8
Visual-RFT: Visual Reinforcement Fine-TuningCode7
YOLOv7: Trainable bag-of-freebies sets new state-of-the-art for real-time object detectorsCode7
Grounding DINO 1.5: Advance the "Edge" of Open-Set Object DetectionCode7
T-Rex2: Towards Generic Object Detection via Text-Visual Prompt SynergyCode7
MambaVision: A Hybrid Mamba-Transformer Vision BackboneCode7
MambaOut: Do We Really Need Mamba for Vision?Code7
Patch n' Pack: NaViT, a Vision Transformer for any Aspect Ratio and ResolutionCode6
Slicing Aided Hyper Inference and Fine-tuning for Small Object DetectionCode5
Infinite Photorealistic Worlds using Procedural GenerationCode5
Retinexformer: One-stage Retinex-based Transformer for Low-light Image EnhancementCode5
SAM2-Adapter: Evaluating & Adapting Segment Anything 2 in Downstream Tasks: Camouflage, Shadow, Medical Image Segmentation, and MoreCode5
DEIM: DETR with Improved Matching for Fast ConvergenceCode5
EfficientRep:An Efficient Repvgg-style ConvNets with Hardware-aware Neural Network DesignCode5
YOLOv6 v3.0: A Full-Scale ReloadingCode5
YOLO-RD: Introducing Relevant and Compact Explicit Knowledge to YOLO by Retriever-DictionaryCode5
YOLOR-Based Multi-Task LearningCode5
YOLOv13: Real-Time Object Detection with Hypergraph-Enhanced Adaptive Visual PerceptionCode5
Real-time Transformer-based Open-Vocabulary Detection with Efficient Fusion HeadCode5
A ConvNet for the 2020sCode5
DINO-X: A Unified Vision Model for Open-World Object Detection and UnderstandingCode5
Grounding DINO: Marrying DINO with Grounded Pre-Training for Open-Set Object DetectionCode5
YOLOv6: A Single-Stage Object Detection Framework for Industrial ApplicationsCode5
GCoNet+: A Stronger Group Collaborative Co-Salient Object DetectorCode4
Seg-Zero: Reasoning-Chain Guided Segmentation via Cognitive ReinforcementCode4
FG-CLIP: Fine-Grained Visual and Textual AlignmentCode4
BEVFusion: Multi-Task Multi-Sensor Fusion with Unified Bird's-Eye View RepresentationCode4
SARDet-100K: Towards Open-Source Benchmark and ToolKit for Large-Scale SAR Object DetectionCode4
Strip R-CNN: Large Strip Convolution for Remote Sensing Object DetectionCode4
EfficientViT: Multi-Scale Linear Attention for High-Resolution Dense PredictionCode4
ELEVATER: A Benchmark and Toolkit for Evaluating Language-Augmented Visual ModelsCode4
RTMDet: An Empirical Study of Designing Real-Time Object DetectorsCode4
DN-DETR: Accelerate DETR Training by Introducing Query DeNoisingCode4
EfficientSAM: Leveraged Masked Image Pretraining for Efficient Segment AnythingCode4
RSAR: Restricted State Angle Resolver and Rotated SAR BenchmarkCode4
OverLoCK: An Overview-first-Look-Closely-next ConvNet with Context-Mixing Dynamic KernelsCode4
DiffusionDet: Diffusion Model for Object DetectionCode4
Detectron2 Object Detection & Manipulating Images using CartoonizationCode4
OK-Robot: What Really Matters in Integrating Open-Knowledge Models for RoboticsCode4
Architecture-Agnostic Masked Image Modeling -- From ViT back to CNNCode4
Mask DINO: Towards A Unified Transformer-based Framework for Object Detection and SegmentationCode4
Deep Residual Learning for Image RecognitionCode4
DINO: DETR with Improved DeNoising Anchor Boxes for End-to-End Object DetectionCode4
Mamba YOLO: A Simple Baseline for Object Detection with State Space ModelCode4
Show:102550
← PrevPage 1 of 220Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1Co-DETRbox mAP66Unverified
2InternImage-H (M3I Pre-training)box mAP65.5Unverified
3M3I Pre-training (InternImage-H)box mAP65.4Unverified
4MoCaEbox mAP65.1Unverified
5Co-DETR (Swin-L)box mAP64.8Unverified
6Focal-Stable-DINO (Focal-Huge, no TTA)box mAP64.8Unverified
7EVAbox mAP64.7Unverified
8Group DETR v2box mAP64.5Unverified
9FocalNet-H (DINO)box mAP64.4Unverified
10InternImage-XLbox mAP64.3Unverified