| YOLOv10: Real-Time End-to-End Object Detection | May 23, 2024 | 2D Object DetectionData Augmentation | CodeCode Available | 11 |
| DETRs Beat YOLOs on Real-time Object Detection | Apr 17, 2023 | 2D Object DetectionDecoder | CodeCode Available | 8 |
| YOLOv7: Trainable bag-of-freebies sets new state-of-the-art for real-time object detectors | Jul 6, 2022 | 2D Object DetectionGPU | CodeCode Available | 7 |
| VisionReasoner: Unified Visual Perception and Reasoning via Reinforcement Learning | May 17, 2025 | 2D Object DetectionObject Counting | CodeCode Available | 4 |
| Wavelet Convolutions for Large Receptive Fields | Jul 8, 2024 | 2D Object Detection2D Semantic Segmentation | CodeCode Available | 4 |
| DenoDet: Attention as Deformable Multi-Subspace Feature Denoising for Target Detection in SAR Images | Jun 5, 2024 | 2D Object DetectionDenoising | CodeCode Available | 4 |
| SARDet-100K: Towards Open-Source Benchmark and ToolKit for Large-Scale SAR Object Detection | Mar 11, 2024 | 2D Object Detection2k | CodeCode Available | 4 |
| OpenAgents: An Open Platform for Language Agents in the Wild | Oct 16, 2023 | 2D Object Detection | CodeCode Available | 4 |
| InternImage: Exploring Large-Scale Vision Foundation Models with Deformable Convolutions | Nov 10, 2022 | 2D Object DetectionClassification | CodeCode Available | 4 |
| GLIPv2: Unifying Localization and Vision-Language Understanding | Jun 12, 2022 | 2D Object DetectionContrastive Learning | CodeCode Available | 4 |
| PP-YOLOE: An evolved version of YOLO | Mar 30, 2022 | 2D Object DetectionDense Object Detection | CodeCode Available | 4 |
| Relation DETR: Exploring Explicit Position Relation Prior for Object Detection | Jul 16, 2024 | 2D Object Detectionobject-detection | CodeCode Available | 3 |
| Open-YOLO 3D: Towards Fast and Accurate Open-Vocabulary 3D Instance Segmentation | Jun 4, 2024 | 2D Object Detection3D Instance Segmentation | CodeCode Available | 3 |
| SARATR-X: Toward Building A Foundation Model for SAR Target Recognition | May 15, 2024 | 2D Object DetectionEarth Observation | CodeCode Available | 3 |
| Salience DETR: Enhancing Detection Transformer with Hierarchical Salience Filtering Refinement | Mar 24, 2024 | 2D Object DetectionComputational Efficiency | CodeCode Available | 3 |
| Transcending Forgery Specificity with Latent Space Augmentation for Generalizable Deepfake Detection | Nov 19, 2023 | 2D Object DetectionDeepFake Detection | CodeCode Available | 3 |
| Designing BERT for Convolutional Networks: Sparse and Hierarchical Masked Modeling | Jan 9, 2023 | 2D Object DetectionContrastive Learning | CodeCode Available | 3 |
| XMem: Long-Term Video Object Segmentation with an Atkinson-Shiffrin Memory Model | Jul 14, 2022 | 2D Human Pose Estimation2D Object Detection | CodeCode Available | 3 |
| Workshop on Autonomous Driving at CVPR 2021: Technical Report for Streaming Perception Challenge | Jul 27, 2021 | 2D Object DetectionAutonomous Driving | CodeCode Available | 3 |
| Deformable DETR: Deformable Transformers for End-to-End Object Detection | Oct 8, 2020 | 2D Object DetectionObject Detection | CodeCode Available | 3 |
| Distributional Generalization: A New Kind of Generalization | Sep 17, 2020 | 2D Object Detection | CodeCode Available | 3 |
| SCoralDet: Efficient real-time underwater soft coral detection with YOLO | Dec 16, 2024 | 2D Object Detectionobject-detection | CodeCode Available | 2 |
| PillarNeXt: Rethinking Network Designs for 3D Object Detection in LiDAR Point Clouds | May 8, 2023 | 2D Object Detection3D Object Detection | CodeCode Available | 2 |
| PACO: Parts and Attributes of Common Objects | Jan 4, 2023 | 2D Object DetectionAttribute | CodeCode Available | 2 |
| Roboflow 100: A Rich, Multi-Domain Object Detection Benchmark | Nov 24, 2022 | 2D Object DetectionImage Retrieval | CodeCode Available | 2 |
| Target-aware Dual Adversarial Learning and a Multi-scenario Multi-Modality Benchmark to Fuse Infrared and Visible for Object Detection | Mar 30, 2022 | 2D Object DetectionBilevel Optimization | CodeCode Available | 2 |
| DAB-DETR: Dynamic Anchor Boxes are Better Queries for DETR | Jan 28, 2022 | 2D Object DetectionObject Detection | CodeCode Available | 2 |
| Grounded Language-Image Pre-training | Dec 7, 2021 | 2D Object DetectionDescribed Object Detection | CodeCode Available | 2 |
| 2nd Place Solution for Waymo Open Dataset Challenge -- Real-time 2D Object Detection | Jun 16, 2021 | 2D Object DetectionAutonomous Driving | CodeCode Available | 2 |
| 2nd Place Solution for Waymo Open Dataset Challenge - Real-time 2D Object Detection | Jun 16, 2021 | 2D Object DetectionAutonomous Driving | CodeCode Available | 2 |
| Sparse R-CNN: End-to-End Object Detection with Learnable Proposals | Nov 25, 2020 | 2D Object DetectionObject | CodeCode Available | 2 |
| Focal Loss for Dense Object Detection | Aug 7, 2017 | 2D Object DetectionDense Object Detection | CodeCode Available | 2 |
| Distributed LLMs and Multimodal Large Language Models: A Survey on Advances, Challenges, and Future Directions | Mar 20, 2025 | 2D Object DetectionDistributed Computing | CodeCode Available | 1 |
| Toward Highly Efficient Semantic-Guided Machine Vision for Low-Light Object Detection | Dec 20, 2024 | 2D Object DetectionImage Enhancement | CodeCode Available | 1 |
| UAVDB: Trajectory-Guided Adaptable Bounding Boxes for UAV Detection | Sep 9, 2024 | 2D Object DetectionDiversity | CodeCode Available | 1 |
| GUI Element Detection Using SOTA YOLO Deep Learning Models | Aug 7, 2024 | 2D Object DetectionCode Generation | CodeCode Available | 1 |
| TXL-PBC: a freely accessible labeled peripheral blood cell dataset | Jul 18, 2024 | 2D Object Detection | CodeCode Available | 1 |
| SCAResNet: A ResNet Variant Optimized for Tiny Object Detection in Transmission and Distribution Towers | Apr 5, 2024 | 2D Object Detection2D Tiny Object Detection | CodeCode Available | 1 |
| SeSame: Simple, Easy 3D Object Detection with Point-Wise Semantics | Mar 11, 2024 | 2D Object Detection3D Object Detection | CodeCode Available | 1 |
| BGF-YOLO: Enhanced YOLOv8 with Multiscale Attentional Feature Fusion for Brain Tumor Detection | Sep 22, 2023 | 2D Object DetectionMedical Diagnosis | CodeCode Available | 1 |
| RCS-YOLO: A Fast and High-Accuracy Object Detector for Brain Tumor Detection | Jul 31, 2023 | 2D Object DetectionMedical Diagnosis | CodeCode Available | 1 |
| C^2Former: Calibrated and Complementary Transformer for RGB-Infrared Object Detection | Jun 28, 2023 | 2D Object DetectionMultispectral Object Detection | CodeCode Available | 1 |
| CST-YOLO: A Novel Method for Blood Cell Detection Based on Improved YOLOv7 and CNN-Swin Transformer | Jun 26, 2023 | 2D Object DetectionBlood Cell Detection | CodeCode Available | 1 |
| A Gated Cross-domain Collaborative Network for Underwater Object Detection | Jun 25, 2023 | 2D Object DetectionImage Enhancement | CodeCode Available | 1 |
| Object Detection with Transformers: A Review | Jun 7, 2023 | 2D Object DetectionObject | CodeCode Available | 1 |
| Large, Complex, and Realistic Safety Clothing and Helmet Detection: Dataset and Method | Jun 3, 2023 | 2D Object Detectionobject-detection | CodeCode Available | 1 |
| FishEye8K: A Benchmark and Dataset for Fisheye Camera Object Detection | May 27, 2023 | 2D Object Detectionobject-detection | CodeCode Available | 1 |
| WEDGE: A multi-weather autonomous driving dataset built from generative vision-language models | May 12, 2023 | 2D Object DetectionAdversarial Robustness | CodeCode Available | 1 |
| Parcel3D: Shape Reconstruction from Single RGB Images for Applications in Transportation Logistics | Apr 18, 2023 | 2D Object Detection3D Object Detection | CodeCode Available | 1 |
| Open-TransMind: A New Baseline and Benchmark for 1st Foundation Model Challenge of Intelligent Transportation | Apr 12, 2023 | 2D Object DetectionImage Retrieval | CodeCode Available | 1 |