| MambaFusion: Height-Fidelity Dense Global Fusion for Multi-modal 3D Object Detection | Jul 6, 2025 | 3D Object DetectionAttribute | CodeCode Available | 2 |
| Seg-R1: Segmentation Can Be Surprisingly Simple with Reinforcement Learning | Jun 27, 2025 | Foreground Segmentationobject-detection | CodeCode Available | 2 |
| Focusing on Tracks for Online Multi-Object Tracking | Jun 15, 2025 | global-optimizationMulti-Object Tracking | CodeCode Available | 2 |
| Simulate Any Radar: Attribute-Controllable Radar Simulation via Waveform Parameter Embedding | Jun 3, 2025 | 3D Object DetectionAttribute | CodeCode Available | 2 |
| Roboflow100-VL: A Multi-Domain Object Detection Benchmark for Vision-Language Models | May 27, 2025 | Concept Alignmentobject-detection | CodeCode Available | 2 |
| Rethinking Features-Fused-Pyramid-Neck for Object Detection | May 19, 2025 | object-detectionObject Detection | CodeCode Available | 2 |
| Dynamic Graph Induced Contour-aware Heat Conduction Network for Event-based Object Detection | May 19, 2025 | Event-based visionObject | CodeCode Available | 2 |
| DeCLIP: Decoupled Learning for Open-Vocabulary Dense Perception | May 7, 2025 | object-detectionObject Detection | CodeCode Available | 2 |
| NTIRE 2025 Challenge on Cross-Domain Few-Shot Object Detection: Methods and Results | Apr 14, 2025 | Cross-Domain Few-ShotCross-Domain Few-Shot Object Detection | CodeCode Available | 2 |
| Vision-Language Model for Object Detection and Segmentation: A Review and Evaluation | Apr 13, 2025 | Domain AdaptationLanguage Modeling | CodeCode Available | 2 |
| self-prompting analogical reasoning for uav object detection | Apr 11, 2025 | graph constructionobject-detection | CodeCode Available | 2 |
| P2Object: Single Point Supervised Object Detection and Instance Segmentation | Apr 10, 2025 | Instance SegmentationMultiple Instance Learning | CodeCode Available | 2 |
| Generalized Semantic Contrastive Learning via Embedding Side Information for Few-Shot Object Detection | Apr 9, 2025 | Contrastive Learningcounterfactual | CodeCode Available | 2 |
| Enhance Then Search: An Augmentation-Search Strategy with Foundation Models for Cross-Domain Few-Shot Object Detection | Apr 6, 2025 | Cross-Domain Few-ShotCross-Domain Few-Shot Object Detection | CodeCode Available | 2 |
| Large Self-Supervised Models Bridge the Gap in Domain Adaptive Object Detection | Mar 29, 2025 | object-detectionObject Detection | CodeCode Available | 2 |
| LEGNet: Lightweight Edge-Gaussian Driven Network for Low-Quality Remote Sensing Image Object Detection | Mar 18, 2025 | Computational Efficiencyobject-detection | CodeCode Available | 2 |
| RoMA: Scaling up Mamba-based Foundation Models for Remote Sensing | Mar 13, 2025 | Computational EfficiencyMamba | CodeCode Available | 2 |
| Referring to Any Person | Mar 11, 2025 | Large Language ModelMultimodal Large Language Model | CodeCode Available | 2 |
| MI-DETR: An Object Detection Model with Multi-time Inquiries Mechanism | Mar 3, 2025 | Object Detection | CodeCode Available | 2 |
| DAMamba: Vision State Space Model with Dynamic Adaptive Scan | Feb 18, 2025 | image-classificationImage Classification | CodeCode Available | 2 |
| SARChat-Bench-2M: A Multi-Task Vision-Language Benchmark for SAR Image Interpretation | Feb 12, 2025 | Earth Observationobject-detection | CodeCode Available | 2 |
| MHAF-YOLO: Multi-Branch Heterogeneous Auxiliary Fusion YOLO for accurate object detection | Feb 7, 2025 | object-detectionObject Detection | CodeCode Available | 2 |
| iFormer: Integrating ConvNet and Transformer for Mobile Application | Jan 26, 2025 | Instance Segmentationobject-detection | CodeCode Available | 2 |
| PointOBB-v3: Expanding Performance Boundaries of Single Point-Supervised Oriented Object Detection | Jan 23, 2025 | object-detectionObject Detection | CodeCode Available | 2 |
| YOLO11-JDE: Fast and Accurate Multi-Object Tracking with Self-Supervised Re-ID | Jan 23, 2025 | Multi-Object Trackingobject-detection | CodeCode Available | 2 |