| DAT++: Spatially Dynamic Vision Transformer with Deformable Attention | Sep 4, 2023 | Image ClassificationInstance Segmentation | CodeCode Available | 2 | 5 |
| EMIFF: Enhanced Multi-scale Image Feature Fusion for Vehicle-Infrastructure Cooperative 3D Object Detection | Feb 23, 2024 | 3D Object DetectionAutonomous Driving | CodeCode Available | 2 | 5 |
| Enhance Then Search: An Augmentation-Search Strategy with Foundation Models for Cross-Domain Few-Shot Object Detection | Apr 6, 2025 | Cross-Domain Few-ShotCross-Domain Few-Shot Object Detection | CodeCode Available | 2 | 5 |
| DeCLIP: Decoupled Learning for Open-Vocabulary Dense Perception | May 7, 2025 | object-detectionObject Detection | CodeCode Available | 2 | 5 |
| ESOD: Efficient Small Object Detection on High-Resolution Images | Jul 23, 2024 | GPUObject | CodeCode Available | 2 | 5 |
| Evaluating Large-Vocabulary Object Detectors: The Devil is in the Details | Feb 1, 2021 | Benchmarkingobject-detection | CodeCode Available | 2 | 5 |
| CAS-ViT: Convolutional Additive Self-attention Vision Transformers for Efficient Mobile Applications | Aug 7, 2024 | image-classificationImage Classification | CodeCode Available | 2 | 5 |
| Exploring Object-Centric Temporal Modeling for Efficient Multi-View 3D Object Detection | Mar 21, 2023 | 3D Multi-Object Tracking3D Object Detection | CodeCode Available | 2 | 5 |
| A Light-Weight Framework for Open-Set Object Detection with Decoupled Feature Alignment in Joint Space | Dec 19, 2024 | Computational Efficiencyobject-detection | CodeCode Available | 2 | 5 |
| FasterViT: Fast Vision Transformers with Hierarchical Attention | Jun 9, 2023 | Image Classificationobject-detection | CodeCode Available | 2 | 5 |
| Fast Vision Transformers with HiLo Attention | May 26, 2022 | BenchmarkingEfficient ViTs | CodeCode Available | 2 | 5 |
| Feature Pyramid Networks for Object Detection | Dec 9, 2016 | GPUObject | CodeCode Available | 2 | 5 |
| Fine-Grained Prototypes Distillation for Few-Shot Object Detection | Jan 15, 2024 | Few-Shot Object DetectionMeta-Learning | CodeCode Available | 2 | 5 |
| Fine-Grained Stochastic Architecture Search | Jun 17, 2020 | Neural Architecture Searchobject-detection | CodeCode Available | 2 | 5 |
| FM-Fusion: Instance-aware Semantic Mapping Boosted by Vision-Language Foundation Models | Feb 7, 2024 | Instance SegmentationObject | CodeCode Available | 2 | 5 |
| An Empirical Study of Remote Sensing Pretraining | Apr 6, 2022 | Aerial Scene ClassificationBuilding change detection for remote sensing images | CodeCode Available | 2 | 5 |
| Dataset Quantization | Aug 21, 2023 | Dataset Distillationobject-detection | CodeCode Available | 2 | 5 |
| Focal Modulation Networks | Mar 22, 2022 | image-classificationImage Classification | CodeCode Available | 2 | 5 |
| Focusing on Tracks for Online Multi-Object Tracking | Jun 15, 2025 | global-optimizationMulti-Object Tracking | CodeCode Available | 2 | 5 |
| FreeSOLO: Learning to Segment Objects without Annotations | Feb 24, 2022 | Instance Segmentationobject-detection | CodeCode Available | 2 | 5 |
| DATR: Unsupervised Domain Adaptive Detection Transformer with Dataset-Level Adaptation and Prototypical Alignment | May 20, 2024 | Contrastive LearningDomain Adaptation | CodeCode Available | 2 | 5 |
| Frustratingly Simple Few-Shot Object Detection | Mar 16, 2020 | Cross-Domain Few-Shot Object DetectionFew-Shot Object Detection | CodeCode Available | 2 | 5 |
| Center-based 3D Object Detection and Tracking | Jun 19, 2020 | 3D Multi-Object Tracking3D Object Detection | CodeCode Available | 2 | 5 |
| CenterFormer: Center-based Transformer for 3D Object Detection | Sep 12, 2022 | 3D Object DetectionObject | CodeCode Available | 2 | 5 |
| Decoupled Knowledge Distillation | Mar 16, 2022 | image-classificationImage Classification | CodeCode Available | 2 | 5 |
| DAMSDet: Dynamic Adaptive Multispectral Detection Transformer with Competitive Query Selection and Adaptive Feature Fusion | Mar 1, 2024 | Objectobject-detection | CodeCode Available | 2 | 5 |
| MobileOne: An Improved One millisecond Mobile Backbone | Jun 8, 2022 | Efficient Neural NetworkGaze Estimation | CodeCode Available | 2 | 5 |
| Generative Sparse Detection Networks for 3D Single-shot Object Detection | Jun 22, 2020 | 3D Object DetectionDecoder | CodeCode Available | 2 | 5 |
| GLARE: Low Light Image Enhancement via Generative Latent Feature based Codebook Retrieval | Jul 17, 2024 | DecoderImage Enhancement | CodeCode Available | 2 | 5 |
| DAOcc: 3D Object Detection Assisted Multi-Sensor Fusion for 3D Occupancy Prediction | Sep 30, 2024 | 3D Object Detection3D Semantic Occupancy Prediction | CodeCode Available | 2 | 5 |
| Global Context Vision Transformers | Jun 20, 2022 | image-classificationImage Classification | CodeCode Available | 2 | 5 |
| Going Denser with Open-Vocabulary Part Segmentation | May 18, 2023 | Objectobject-detection | CodeCode Available | 2 | 5 |
| DAMamba: Vision State Space Model with Dynamic Adaptive Scan | Feb 18, 2025 | image-classificationImage Classification | CodeCode Available | 2 | 5 |
| ChainerCV: a Library for Deep Learning in Computer Vision | Aug 28, 2017 | Deep Learningobject-detection | CodeCode Available | 2 | 5 |
| Centralized Feature Pyramid for Object Detection | Oct 5, 2022 | Objectobject-detection | CodeCode Available | 2 | 5 |
| Grounded Language-Image Pre-training | Dec 7, 2021 | 2D Object DetectionDescribed Object Detection | CodeCode Available | 2 | 5 |
| CFMW: Cross-modality Fusion Mamba for Multispectral Object Detection under Adverse Weather Conditions | Apr 25, 2024 | MambaMultispectral Object Detection | CodeCode Available | 2 | 5 |
| CGCOD: Class-Guided Camouflaged Object Detection | Dec 25, 2024 | Objectobject-detection | CodeCode Available | 2 | 5 |
| GroupViT: Semantic Segmentation Emerges from Text Supervision | Feb 22, 2022 | Object DetectionScene Understanding | CodeCode Available | 2 | 5 |
| HASSOD: Hierarchical Adaptive Self-Supervised Object Detection | Feb 5, 2024 | Objectobject-detection | CodeCode Available | 2 | 5 |
| DeepInteraction: 3D Object Detection via Modality Interaction | Aug 23, 2022 | 3D Object DetectionDecoder | CodeCode Available | 2 | 5 |
| Hierarchical Open-vocabulary Universal Image Segmentation | Jul 3, 2023 | Image ComprehensionImage Segmentation | CodeCode Available | 2 | 5 |
| HorNet: Efficient High-Order Spatial Interactions with Recursive Gated Convolutions | Jul 28, 2022 | Image ClassificationObject Detection | CodeCode Available | 2 | 5 |
| Detecting Everything in the Open World: Towards Universal Object Detection | Mar 21, 2023 | object-detectionObject Detection | CodeCode Available | 2 | 5 |
| EA-LSS: Edge-aware Lift-splat-shot Framework for 3D BEV Object Detection | Mar 31, 2023 | 3D Object DetectionDepth Estimation | CodeCode Available | 2 | 5 |
| ALBench: A Framework for Evaluating Active Learning in Object Detection | Jul 27, 2022 | Active Learningimage-classification | CodeCode Available | 2 | 5 |
| ICDAR 2021 Competition on Scientific Literature Parsing | Jun 8, 2021 | document understandingobject-detection | CodeCode Available | 2 | 5 |
| iFormer: Integrating ConvNet and Transformer for Mobile Application | Jan 26, 2025 | Instance Segmentationobject-detection | CodeCode Available | 2 | 5 |
| GeminiFusion: Efficient Pixel-wise Multimodal Fusion for Vision Transformer | Jun 3, 2024 | 3D Object DetectionImage-to-Image Translation | CodeCode Available | 2 | 5 |
| MambaDFuse: A Mamba-based Dual-phase Model for Multi-modality Image Fusion | Apr 12, 2024 | Image ReconstructionMamba | CodeCode Available | 2 | 5 |