SOTAVerified

Object Detection

Papers

Showing 150 of 10957 papers

TitleStatusHype
YOLOv9: Learning What You Want to Learn Using Programmable Gradient InformationCode16
YOLOv10: Real-Time End-to-End Object DetectionCode11
LW-DETR: A Transformer Replacement to YOLO for Real-Time DetectionCode9
YOLO-World: Real-Time Open-Vocabulary Object DetectionCode9
Perception Encoder: The best visual embeddings are not at the output of the networkCode8
DETRs Beat YOLOs on Real-time Object DetectionCode8
DocLayNet: A Large Human-Annotated Dataset for Document-Layout AnalysisCode8
Visual-RFT: Visual Reinforcement Fine-TuningCode7
MambaVision: A Hybrid Mamba-Transformer Vision BackboneCode7
Grounding DINO 1.5: Advance the "Edge" of Open-Set Object DetectionCode7
MambaOut: Do We Really Need Mamba for Vision?Code7
T-Rex2: Towards Generic Object Detection via Text-Visual Prompt SynergyCode7
YOLOv7: Trainable bag-of-freebies sets new state-of-the-art for real-time object detectorsCode7
Patch n' Pack: NaViT, a Vision Transformer for any Aspect Ratio and ResolutionCode6
YOLOv13: Real-Time Object Detection with Hypergraph-Enhanced Adaptive Visual PerceptionCode5
DEIM: DETR with Improved Matching for Fast ConvergenceCode5
DINO-X: A Unified Vision Model for Open-World Object Detection and UnderstandingCode5
YOLO-RD: Introducing Relevant and Compact Explicit Knowledge to YOLO by Retriever-DictionaryCode5
SAM2-Adapter: Evaluating & Adapting Segment Anything 2 in Downstream Tasks: Camouflage, Shadow, Medical Image Segmentation, and MoreCode5
Real-time Transformer-based Open-Vocabulary Detection with Efficient Fusion HeadCode5
YOLOR-Based Multi-Task LearningCode5
Infinite Photorealistic Worlds using Procedural GenerationCode5
Retinexformer: One-stage Retinex-based Transformer for Low-light Image EnhancementCode5
Grounding DINO: Marrying DINO with Grounded Pre-Training for Open-Set Object DetectionCode5
EfficientRep:An Efficient Repvgg-style ConvNets with Hardware-aware Neural Network DesignCode5
YOLOv6 v3.0: A Full-Scale ReloadingCode5
YOLOv6: A Single-Stage Object Detection Framework for Industrial ApplicationsCode5
Slicing Aided Hyper Inference and Fine-tuning for Small Object DetectionCode5
A ConvNet for the 2020sCode5
YOLOv11-RGBT: Towards a Comprehensive Single-Stage Multispectral Object Detection FrameworkCode4
FG-CLIP: Fine-Grained Visual and Textual AlignmentCode4
Seg-Zero: Reasoning-Chain Guided Segmentation via Cognitive ReinforcementCode4
OverLoCK: An Overview-first-Look-Closely-next ConvNet with Context-Mixing Dynamic KernelsCode4
RSAR: Restricted State Angle Resolver and Rotated SAR BenchmarkCode4
Strip R-CNN: Large Strip Convolution for Remote Sensing Object DetectionCode4
UltimateDO: An Efficient Framework to Marry Occupancy Prediction with 3D Object Detection via Channel2heightCode4
Mamba YOLO: A Simple Baseline for Object Detection with State Space ModelCode4
A Survey on Visual MambaCode4
LSKNet: A Foundation Lightweight Backbone for Remote SensingCode4
SARDet-100K: Towards Open-Source Benchmark and ToolKit for Large-Scale SAR Object DetectionCode4
TUMTraf V2X Cooperative Perception DatasetCode4
ActiveAnno3D -- An Active Learning Framework for Multi-Modal 3D Object DetectionCode4
OK-Robot: What Really Matters in Integrating Open-Knowledge Models for RoboticsCode4
EfficientSAM: Leveraged Masked Image Pretraining for Efficient Segment AnythingCode4
Memory-aided Contrastive Consensus Learning for Co-salient Object DetectionCode4
RTMDet: An Empirical Study of Designing Real-Time Object DetectorsCode4
DAMO-YOLO : A Report on Real-Time Object Detection DesignCode4
DiffusionDet: Diffusion Model for Object DetectionCode4
InternImage: Exploring Large-Scale Vision Foundation Models with Deformable ConvolutionsCode4
LET-3D-AP: Longitudinal Error Tolerant 3D Average Precision for Camera-Only 3D DetectionCode4
Show:102550
← PrevPage 1 of 220Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1Co-DETRbox mAP66Unverified
2InternImage-H (M3I Pre-training)box mAP65.5Unverified
3M3I Pre-training (InternImage-H)box mAP65.4Unverified
4MoCaEbox mAP65.1Unverified
5Co-DETR (Swin-L)box mAP64.8Unverified
6Focal-Stable-DINO (Focal-Huge, no TTA)box mAP64.8Unverified
7EVAbox mAP64.7Unverified
8Group DETR v2box mAP64.5Unverified
9FocalNet-H (DINO)box mAP64.4Unverified
10InternImage-XLbox mAP64.3Unverified