| Evaluating Large-Vocabulary Object Detectors: The Devil is in the Details | Feb 1, 2021 | Benchmarkingobject-detection | CodeCode Available | 2 | 5 |
| 2nd Place Solution for Waymo Open Dataset Challenge -- Real-time 2D Object Detection | Jun 16, 2021 | 2D Object DetectionAutonomous Driving | CodeCode Available | 2 | 5 |
| A Unified Transformer Framework for Group-based Segmentation: Co-Segmentation, Co-Saliency Detection and Video Salient Object Detection | Mar 9, 2022 | Co-Salient Object Detectionobject-detection | CodeCode Available | 2 | 5 |
| F2DNet: Fast Focal Detection Network for Pedestrian Detection | Mar 4, 2022 | object-detectionObject Detection | CodeCode Available | 2 | 5 |
| DiverGen: Improving Instance Segmentation by Learning Wider Data Distribution with More Diverse Generative Data | May 16, 2024 | Data AugmentationDiversity | CodeCode Available | 2 | 5 |
| Fast Vision Transformers with HiLo Attention | May 26, 2022 | BenchmarkingEfficient ViTs | CodeCode Available | 2 | 5 |
| Feature Pyramid Networks for Object Detection | Dec 9, 2016 | GPUObject | CodeCode Available | 2 | 5 |
| DSVT: Dynamic Sparse Voxel Transformer with Rotated Sets | Jan 15, 2023 | 3D Object Detectionobject-detection | CodeCode Available | 2 | 5 |
| AdaMixer: A Fast-Converging Query-Based Object Detector | Mar 30, 2022 | ObjectObject Detection | CodeCode Available | 2 | 5 |
| Aligning and Prompting Everything All at Once for Universal Visual Perception | Dec 4, 2023 | AllObject | CodeCode Available | 2 | 5 |
| FM-Fusion: Instance-aware Semantic Mapping Boosted by Vision-Language Foundation Models | Feb 7, 2024 | Instance SegmentationObject | CodeCode Available | 2 | 5 |
| FocalFormer3D: Focusing on Hard Instance for 3D Object Detection | Jan 1, 2023 | 3D Object DetectionAutonomous Driving | CodeCode Available | 2 | 5 |
| Focal Loss for Dense Object Detection | Aug 7, 2017 | 2D Object DetectionDense Object Detection | CodeCode Available | 2 | 5 |
| Focusing on Tracks for Online Multi-Object Tracking | Jun 15, 2025 | global-optimizationMulti-Object Tracking | CodeCode Available | 2 | 5 |
| Adapter is All You Need for Tuning Visual Tasks | Nov 25, 2023 | Allimage-classification | CodeCode Available | 2 | 5 |
| Frequency-Adaptive Dilated Convolution for Semantic Segmentation | Mar 8, 2024 | object-detectionObject Detection | CodeCode Available | 2 | 5 |
| Dilated Neighborhood Attention Transformer | Sep 29, 2022 | Image ClassificationInstance Segmentation | CodeCode Available | 2 | 5 |
| DiffusionTrack: Diffusion Model For Multi-Object Tracking | Aug 19, 2023 | Denoisingmodel | CodeCode Available | 2 | 5 |
| GaussianPretrain: A Simple Unified 3D Gaussian Representation for Visual Pre-training in Autonomous Driving | Nov 19, 2024 | 3D Object DetectionAutonomous Driving | CodeCode Available | 2 | 5 |
| DI-MaskDINO: A Joint Object Detection and Instance Segmentation Model | Oct 22, 2024 | DecoderInstance Segmentation | CodeCode Available | 2 | 5 |
| Distance-IoU Loss: Faster and Better Learning for Bounding Box Regression | Nov 19, 2019 | object-detectionObject Detection | CodeCode Available | 2 | 5 |
| Dynamic Graph Induced Contour-aware Heat Conduction Network for Event-based Object Detection | May 19, 2025 | Event-based visionObject | CodeCode Available | 2 | 5 |
| Generative Region-Language Pretraining for Open-Ended Object Detection | Mar 15, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 2 | 5 |
| EFFOcc: A Minimal Baseline for EFficient Fusion-based 3D Occupancy Network | Jun 11, 2024 | 3D Object DetectionActive Learning | CodeCode Available | 2 | 5 |
| DEYO: DETR with YOLO for End-to-End Object Detection | Feb 26, 2024 | DecoderGPU | CodeCode Available | 2 | 5 |