| MobileOne: An Improved One millisecond Mobile Backbone | Jun 8, 2022 | Efficient Neural NetworkGaze Estimation | CodeCode Available | 2 |
| Rethinking Features-Fused-Pyramid-Neck for Object Detection | May 19, 2025 | object-detectionObject Detection | CodeCode Available | 2 |
| Centralized Feature Pyramid for Object Detection | Oct 5, 2022 | Objectobject-detection | CodeCode Available | 2 |
| DiffusionTrack: Diffusion Model For Multi-Object Tracking | Aug 19, 2023 | Denoisingmodel | CodeCode Available | 2 |
| DFormer: Rethinking RGBD Representation Learning for Semantic Segmentation | Sep 18, 2023 | 3D geometryDecoder | CodeCode Available | 2 |
| DEYOLO: Dual-Feature-Enhancement YOLO for Cross-Modality Object Detection | Dec 6, 2024 | Objectobject-detection | CodeCode Available | 2 |
| RFLA: Gaussian Receptive Field based Label Assignment for Tiny Object Detection | Aug 18, 2022 | Objectobject-detection | CodeCode Available | 2 |
| CFMW: Cross-modality Fusion Mamba for Multispectral Object Detection under Adverse Weather Conditions | Apr 25, 2024 | MambaMultispectral Object Detection | CodeCode Available | 2 |
| Roboflow100-VL: A Multi-Domain Object Detection Benchmark for Vision-Language Models | May 27, 2025 | Concept Alignmentobject-detection | CodeCode Available | 2 |
| RoboFusion: Towards Robust Multi-Modal 3D Object Detection via SAM | Jan 8, 2024 | 3D Object DetectionAutonomous Driving | CodeCode Available | 2 |
| DiffBEV: Conditional Diffusion Model for Bird's Eye View Perception | Mar 15, 2023 | 3D Object DetectionAutonomous Driving | CodeCode Available | 2 |
| ChainerCV: a Library for Deep Learning in Computer Vision | Aug 28, 2017 | Deep Learningobject-detection | CodeCode Available | 2 |
| CitySim: A Drone-Based Vehicle Trajectory Dataset for Safety Oriented Research and Digital Twins | Aug 23, 2022 | object-detectionObject Detection | CodeCode Available | 2 |
| Dilated Neighborhood Attention Transformer | Sep 29, 2022 | Image ClassificationInstance Segmentation | CodeCode Available | 2 |
| DPFT: Dual Perspective Fusion Transformer for Camera-Radar-based Object Detection | Apr 3, 2024 | Autonomous Vehiclesobject-detection | CodeCode Available | 2 |
| An Empirical Study of Remote Sensing Pretraining | Apr 6, 2022 | Aerial Scene ClassificationBuilding change detection for remote sensing images | CodeCode Available | 2 |
| DetGPT: Detect What You Need via Reasoning | May 23, 2023 | Autonomous DrivingObject | CodeCode Available | 2 |
| SAMRS: Scaling-up Remote Sensing Segmentation Dataset with Segment Anything Model | May 3, 2023 | Instance SegmentationObject | CodeCode Available | 2 |
| DETR Does Not Need Multi-Scale or Locality Design | Jan 1, 2023 | DecoderObject Detection | CodeCode Available | 2 |
| Scale-Aware Modulation Meet Transformer | Jul 17, 2023 | object-detectionObject Detection | CodeCode Available | 2 |
| CMX: Cross-Modal Fusion for RGB-X Semantic Segmentation with Transformers | Mar 9, 2022 | 3D Object DetectionAutonomous Vehicles | CodeCode Available | 2 |
| Scaling Multi-Camera 3D Object Detection through Weak-to-Strong Eliciting | Apr 10, 2024 | 3D Object DetectionAutonomous Driving | CodeCode Available | 2 |
| CLIPSelf: Vision Transformer Distills Itself for Open-Vocabulary Dense Prediction | Oct 2, 2023 | image-classificationImage Classification | CodeCode Available | 2 |
| LargeKernel3D: Scaling up Kernels in 3D Sparse CNNs | Jun 21, 2022 | 3D Object DetectionObject | CodeCode Available | 2 |
| PubTables-1M: Towards comprehensive table extraction from unstructured documents | Sep 30, 2021 | Articlesobject-detection | CodeCode Available | 2 |
| SCoralDet: Efficient real-time underwater soft coral detection with YOLO | Dec 16, 2024 | 2D Object Detectionobject-detection | CodeCode Available | 2 |
| SeaFormer++: Squeeze-enhanced Axial Transformer for Mobile Visual Recognition | Jan 30, 2023 | Feature Upsamplingimage-classification | CodeCode Available | 2 |
| Searching Efficient 3D Architectures with Sparse Point-Voxel Convolution | Jul 31, 2020 | 3D Object Detection3D Semantic Segmentation | CodeCode Available | 2 |
| self-prompting analogical reasoning for uav object detection | Apr 11, 2025 | graph constructionobject-detection | CodeCode Available | 2 |
| Self-supervised Learning of LiDAR 3D Point Clouds via 2D-3D Neural Calibration | Jan 23, 2024 | 3D Semantic SegmentationAutonomous Driving | CodeCode Available | 2 |
| Detection in Crowded Scenes: One Proposal, Multiple Predictions | Mar 20, 2020 | Object DetectionPedestrian Detection | CodeCode Available | 2 |
| COALA: A Practical and Vision-Centric Federated Learning Platform | Jul 23, 2024 | BenchmarkingContinual Learning | CodeCode Available | 2 |
| SH17: A Dataset for Human Safety and Personal Protective Equipment Detection in Manufacturing Industry | Jul 5, 2024 | Benchmarkingobject-detection | CodeCode Available | 2 |
| CoBEVT: Cooperative Bird's Eye View Semantic Segmentation with Sparse Transformers | Jul 5, 2022 | 3D Object DetectionAutonomous Driving | CodeCode Available | 2 |
| COCO-O: A Benchmark for Object Detectors under Natural Distribution Shifts | Jul 24, 2023 | Autonomous DrivingObject | CodeCode Available | 2 |
| CoDA: Collaborative Novel Box Discovery and Cross-modal Alignment for Open-vocabulary 3D Object Detection | Oct 4, 2023 | 3D Object Detectioncross-modal alignment | CodeCode Available | 2 |
| Simplifying Object Segmentation with PixelLib Library | Jan 20, 2021 | Image ClassificationInstance Segmentation | CodeCode Available | 2 |
| Simulate Any Radar: Attribute-Controllable Radar Simulation via Waveform Parameter Embedding | Jun 3, 2025 | 3D Object DetectionAttribute | CodeCode Available | 2 |
| Detecting Everything in the Open World: Towards Universal Object Detection | Mar 21, 2023 | object-detectionObject Detection | CodeCode Available | 2 |
| Detect-Order-Construct: A Tree Construction based Approach for Hierarchical Document Structure Analysis | Jan 22, 2024 | Document Layout AnalysisDocument Summarization | CodeCode Available | 2 |
| Detect Everything with Few Examples | Sep 22, 2023 | Binary ClassificationCross-Domain Few-Shot Object Detection | CodeCode Available | 2 |
| SNIPER: Efficient Multi-Scale Training | May 23, 2018 | GPUimage-classification | CodeCode Available | 2 |
| SOLOv2: Dynamic and Fast Instance Segmentation | Mar 23, 2020 | Instance Segmentationobject-detection | CodeCode Available | 2 |
| SOOD++: Leveraging Unlabeled Data to Boost Oriented Object Detection | Jul 1, 2024 | Objectobject-detection | CodeCode Available | 2 |
| DetectoRS: Detecting Objects with Recursive Feature Pyramid and Switchable Atrous Convolution | Jun 3, 2020 | Instance SegmentationObject | CodeCode Available | 2 |
| Complex-YOLO: Real-time 3D Object Detection on Point Clouds | Mar 16, 2018 | 3D Object DetectionAutonomous Driving | CodeCode Available | 2 |
| DetZero: Rethinking Offboard 3D Object Detection with Long-term Sequential Point Clouds | Jun 9, 2023 | 3D Multi-Object Tracking3D Object Detection | CodeCode Available | 2 |
| SparseFusion: Fusing Multi-Modal Sparse Representations for Multi-Sensor 3D Object Detection | Apr 27, 2023 | 3D Object Detectionobject-detection | CodeCode Available | 2 |
| AeroGen: Enhancing Remote Sensing Object Detection with Diffusion-Driven Data Generation | Nov 23, 2024 | Data AugmentationDiversity | CodeCode Available | 2 |
| Dense Distinct Query for End-to-End Object Detection | Mar 22, 2023 | Objectobject-detection | CodeCode Available | 2 |