| Aria Digital Twin: A New Benchmark Dataset for Egocentric 3D Machine Perception | Jun 10, 2023 | 3D Object DetectionBenchmarking | CodeCode Available | 2 | 5 |
| AeroGen: Enhancing Remote Sensing Object Detection with Diffusion-Driven Data Generation | Nov 23, 2024 | Data AugmentationDiversity | CodeCode Available | 2 | 5 |
| EFFOcc: A Minimal Baseline for EFficient Fusion-based 3D Occupancy Network | Jun 11, 2024 | 3D Object DetectionActive Learning | CodeCode Available | 2 | 5 |
| EPTQ: Enhanced Post-Training Quantization via Hessian-guided Network-wise Optimization | Sep 20, 2023 | Knowledge Distillationobject-detection | CodeCode Available | 2 | 5 |
| EMIFF: Enhanced Multi-scale Image Feature Fusion for Vehicle-Infrastructure Cooperative 3D Object Detection | Feb 23, 2024 | 3D Object DetectionAutonomous Driving | CodeCode Available | 2 | 5 |
| EMOv2: Pushing 5M Vision Model Frontier | Dec 9, 2024 | Image Generationmodel | CodeCode Available | 2 | 5 |
| 3DGS-DET: Empower 3D Gaussian Splatting with Boundary Guidance and Box-Focused Sampling for 3D Object Detection | Oct 2, 2024 | 3DGS3D Object Detection | CodeCode Available | 2 | 5 |
| ESOD: Efficient Small Object Detection on High-Resolution Images | Jul 23, 2024 | GPUObject | CodeCode Available | 2 | 5 |
| Visible-Thermal Multiple Object Tracking: Large-scale Video Dataset and Progressive Fusion Approach | Aug 2, 2024 | cross-modal alignmentMultiple Object Tracking | CodeCode Available | 2 | 5 |
| Exploiting Unlabeled Data with Multiple Expert Teachers for Open Vocabulary Aerial Object Detection and Its Orientation Adaptation | Nov 4, 2024 | Earth ObservationObject | CodeCode Available | 2 | 5 |
| UNetFormer: A UNet-like Transformer for Efficient Semantic Segmentation of Remote Sensing Urban Scene Imagery | Sep 18, 2021 | Change DetectionDecoder | CodeCode Available | 2 | 5 |
| F2DNet: Fast Focal Detection Network for Pedestrian Detection | Mar 4, 2022 | object-detectionObject Detection | CodeCode Available | 2 | 5 |
| FasterViT: Fast Vision Transformers with Hierarchical Attention | Jun 9, 2023 | Image Classificationobject-detection | CodeCode Available | 2 | 5 |
| A Simple Framework for 3D Occupancy Estimation in Autonomous Driving | Mar 17, 2023 | 3D Object Detection3D Reconstruction | CodeCode Available | 2 | 5 |
| Accelerating DETR Convergence via Semantic-Aligned Matching | Mar 14, 2022 | Objectobject-detection | CodeCode Available | 2 | 5 |
| Agent Attention: On the Integration of Softmax and Linear Attention | Dec 14, 2023 | Computational Efficiencyimage-classification | CodeCode Available | 2 | 5 |
| Fine-Grained Prototypes Distillation for Few-Shot Object Detection | Jan 15, 2024 | Few-Shot Object DetectionMeta-Learning | CodeCode Available | 2 | 5 |
| Fine-Grained Stochastic Architecture Search | Jun 17, 2020 | Neural Architecture Searchobject-detection | CodeCode Available | 2 | 5 |
| A Simple Latent Diffusion Approach for Panoptic Segmentation and Mask Inpainting | Jan 18, 2024 | Instance SegmentationInteractive Segmentation | CodeCode Available | 2 | 5 |
| FocalFormer3D : Focusing on Hard Instance for 3D Object Detection | Aug 8, 2023 | 3D Object DetectionAutonomous Driving | CodeCode Available | 2 | 5 |
| Focal Loss for Dense Object Detection | Aug 7, 2017 | 2D Object DetectionDense Object Detection | CodeCode Available | 2 | 5 |
| Focal Modulation Networks | Mar 22, 2022 | image-classificationImage Classification | CodeCode Available | 2 | 5 |
| E2E-MFD: Towards End-to-End Synchronous Multimodal Fusion Detection | Mar 14, 2024 | Autonomous DrivingObject | CodeCode Available | 2 | 5 |
| Frustratingly Simple Few-Shot Object Detection | Mar 16, 2020 | Cross-Domain Few-Shot Object DetectionFew-Shot Object Detection | CodeCode Available | 2 | 5 |
| Fully Test-Time Adaptation for Monocular 3D Object Detection | May 30, 2024 | 3D Object DetectionMonocular 3D Object Detection | CodeCode Available | 2 | 5 |
| FusionVision: A comprehensive approach of 3D object reconstruction and segmentation from RGB-D cameras using YOLO and fast segment anything | Feb 29, 2024 | 3D Object ReconstructionInstance Segmentation | CodeCode Available | 2 | 5 |
| EdgeNeXt: Efficiently Amalgamated CNN-Transformer Architecture for Mobile Vision Applications | Jun 21, 2022 | Image ClassificationObject Detection | CodeCode Available | 2 | 5 |
| ParC-Net: Position Aware Circular Convolution with Merits from ConvNets and Transformer | Mar 8, 2022 | Image Classificationobject-detection | CodeCode Available | 2 | 5 |
| MogaNet: Multi-order Gated Aggregation Network | Nov 7, 2022 | 3D Human Pose EstimationImage Classification | CodeCode Available | 2 | 5 |
| Generalized Semantic Contrastive Learning via Embedding Side Information for Few-Shot Object Detection | Apr 9, 2025 | Contrastive Learningcounterfactual | CodeCode Available | 2 | 5 |
| Dynamic Graph Induced Contour-aware Heat Conduction Network for Event-based Object Detection | May 19, 2025 | Event-based visionObject | CodeCode Available | 2 | 5 |
| A Light-Weight Framework for Open-Set Object Detection with Decoupled Feature Alignment in Joint Space | Dec 19, 2024 | Computational Efficiencyobject-detection | CodeCode Available | 2 | 5 |
| A Strong and Reproducible Object Detector with Only Public Datasets | Apr 25, 2023 | object-detectionObject Detection | CodeCode Available | 2 | 5 |
| DSVT: Dynamic Sparse Voxel Transformer with Rotated Sets | Jan 15, 2023 | 3D Object Detectionobject-detection | CodeCode Available | 2 | 5 |
| Global Context Networks | Dec 24, 2020 | Instance SegmentationObject Detection | CodeCode Available | 2 | 5 |
| Global Context Vision Transformers | Jun 20, 2022 | image-classificationImage Classification | CodeCode Available | 2 | 5 |
| GOReloc: Graph-based Object-Level Relocalization for Visual SLAM | Aug 15, 2024 | Objectobject-detection | CodeCode Available | 2 | 5 |
| GreedyViG: Dynamic Axial Graph Construction for Efficient Vision GNNs | May 10, 2024 | graph constructionimage-classification | CodeCode Available | 2 | 5 |
| EA-LSS: Edge-aware Lift-splat-shot Framework for 3D BEV Object Detection | Mar 31, 2023 | 3D Object DetectionDepth Estimation | CodeCode Available | 2 | 5 |
| DQ-DETR: DETR with Dynamic Query for Tiny Object Detection | Apr 4, 2024 | Objectobject-detection | CodeCode Available | 2 | 5 |
| GroupViT: Semantic Segmentation Emerges from Text Supervision | Feb 22, 2022 | Object DetectionScene Understanding | CodeCode Available | 2 | 5 |
| HASSOD: Hierarchical Adaptive Self-Supervised Object Detection | Feb 5, 2024 | Objectobject-detection | CodeCode Available | 2 | 5 |
| A Simple Aerial Detection Baseline of Multimodal Language Models | Jan 16, 2025 | object-detectionObject Detection | CodeCode Available | 2 | 5 |
| HENet: Hybrid Encoding for End-to-end Multi-task 3D Perception from Multi-view Cameras | Apr 3, 2024 | 3D Object DetectionAutonomous Driving | CodeCode Available | 2 | 5 |
| A Survey on Open-Vocabulary Detection and Segmentation: Past, Present, and Future | Jul 18, 2023 | Knowledge Distillationobject-detection | CodeCode Available | 2 | 5 |
| Drones Help Drones: A Collaborative Framework for Multi-Drone Object Trajectory Prediction and Beyond | May 23, 2024 | 3D Object Detectionobject-detection | CodeCode Available | 2 | 5 |
| ECA-Net: Efficient Channel Attention for Deep Convolutional Neural Networks | Oct 8, 2019 | Dimensionality Reductionimage-classification | CodeCode Available | 2 | 5 |
| Efficient Multi-Scale Attention Module with Cross-Spatial Learning | May 23, 2023 | Dimensionality Reductionimage-classification | CodeCode Available | 2 | 5 |
| Equalized Focal Loss for Dense Long-Tailed Object Detection | Jan 7, 2022 | Long-tailed Object DetectionObject | CodeCode Available | 2 | 5 |
| Improving CLIP Fine-tuning Performance | Jan 1, 2023 | Diagnosticobject-detection | CodeCode Available | 2 | 5 |