| Feature Pyramid Networks for Object Detection | Dec 9, 2016 | GPUObject | CodeCode Available | 2 |
| SSD: Single Shot MultiBox Detector | Dec 8, 2015 | LIDAR Semantic SegmentationLow-Light Image Enhancement | CodeCode Available | 2 |
| Fast R-CNN | Apr 30, 2015 | ObjectObject Detection | CodeCode Available | 2 |
| Beyond One Shot, Beyond One Perspective: Cross-View and Long-Horizon Distillation for Better LiDAR Representations | Jul 7, 2025 | 3D Object DetectionAutonomous Driving | CodeCode Available | 1 |
| Improve Underwater Object Detection through YOLOv12 Architecture and Physics-informed Augmentation | Jun 30, 2025 | Autonomous NavigationComputational Efficiency | CodeCode Available | 1 |
| Semantic-decoupled Spatial Partition Guided Point-supervised Oriented Object Detection | Jun 12, 2025 | object-detectionObject Detection | CodeCode Available | 1 |
| Multiple Object Stitching for Unsupervised Representation Learning | Jun 9, 2025 | Contrastive LearningObject | CodeCode Available | 1 |
| Diffusion Domain Teacher: Diffusion Guided Domain Adaptive Object Detector | Jun 4, 2025 | Domain Adaptationobject-detection | CodeCode Available | 1 |
| GeneA-SLAM2: Dynamic SLAM with AutoEncoder-Preprocessed Genetic Keypoints Resampling and Depth Variance-Guided Dynamic Region Removal | Jun 3, 2025 | object-detectionObject Detection | CodeCode Available | 1 |
| OD3: Optimization-free Dataset Distillation for Object Detection | Jun 2, 2025 | Dataset Distillationimage-classification | CodeCode Available | 1 |
| Adaptive Semantic Token Communication for Transformer-based Edge Inference | May 23, 2025 | object-detectionObject Detection | CodeCode Available | 1 |
| AdvReal: Adversarial Patch Generation Framework with Application to Adversarial Safety Evaluation of Object Detection Systems | May 22, 2025 | Autonomous Vehiclesobject-detection | CodeCode Available | 1 |
| Decoupling Classifier for Boosting Few-shot Object Detection and Instance Segmentation | May 20, 2025 | Few-Shot Object DetectionInstance Segmentation | CodeCode Available | 1 |
| AGI-Elo: How Far Are We From Mastering A Task? | May 19, 2025 | Code GenerationImage Classification | CodeCode Available | 1 |
| M4-SAR: A Multi-Resolution, Multi-Polarization, Multi-Scene, Multi-Source Dataset and Benchmark for Optical-SAR Fusion Object Detection | May 16, 2025 | Benchmarkingobject-detection | CodeCode Available | 1 |
| StoryReasoning Dataset: Using Chain-of-Thought for Scene Understanding and Grounded Story Generation | May 15, 2025 | Face RecognitionObject | CodeCode Available | 1 |
| M3CAD: Towards Generic Cooperative Autonomous Driving Benchmark | May 10, 2025 | Autonomous DrivingMotion Forecasting | CodeCode Available | 1 |
| DiGIT: Multi-Dilated Gated Encoder and Central-Adjacent Region Integrated Decoder for Temporal Action Detection Transformer | May 9, 2025 | Action DetectionDecoder | CodeCode Available | 1 |
| A Simple Detector with Frame Dynamics is a Strong Tracker | May 8, 2025 | Objectobject-detection | CodeCode Available | 1 |
| Lightweight RGB-D Salient Object Detection from a Speed-Accuracy Tradeoff Perspective | May 7, 2025 | object-detectionObject Detection | CodeCode Available | 1 |
| DualDiff: Dual-branch Diffusion Model for Autonomous Driving with Semantic Fusion | May 3, 2025 | 3D Object DetectionAutonomous Driving | CodeCode Available | 1 |
| CDFormer: Cross-Domain Few-Shot Object Detection Transformer Against Feature Confusion | May 2, 2025 | Cross-Domain Few-ShotCross-Domain Few-Shot Object Detection | CodeCode Available | 1 |
| LLM-Empowered Embodied Agent for Memory-Augmented Task Planning in Household Robotics | Apr 30, 2025 | In-Context LearningObject | CodeCode Available | 1 |
| E-InMeMo: Enhanced Prompting for Visual In-Context Learning | Apr 25, 2025 | Foreground SegmentationIn-Context Learning | CodeCode Available | 1 |
| A Multimodal Hybrid Late-Cascade Fusion Network for Enhanced 3D Object Detection | Apr 25, 2025 | 3D Object Detectionobject-detection | CodeCode Available | 1 |
| SAGA: Semantic-Aware Gray color Augmentation for Visible-to-Thermal Domain Adaptation across Multi-View Drone and Ground-Based Vision Systems | Apr 22, 2025 | Autonomous DrivingDomain Adaptation | CodeCode Available | 1 |
| Visual Consensus Prompting for Co-Salient Object Detection | Apr 19, 2025 | Co-Salient Object Detectionobject-detection | CodeCode Available | 1 |
| Lightweight LiDAR-Camera 3D Dynamic Object Detection and Multi-Class Trajectory Prediction | Apr 18, 2025 | 3D Object DetectionGPU | CodeCode Available | 1 |
| DRIFT open dataset: A drone-derived intelligence for traffic analysis in urban environmen | Apr 15, 2025 | object-detectionObject Detection | CodeCode Available | 1 |
| LEMUR Neural Network Dataset: Towards Seamless AutoML | Apr 14, 2025 | AutoMLBenchmarking | CodeCode Available | 1 |
| Uncertainty Guided Refinement for Fine-Grained Salient Object Detection | Apr 13, 2025 | object-detectionObject Detection | CodeCode Available | 1 |
| RT-DATR:Real-time Unsupervised Domain Adaptive Detection Transformer with Adversarial Feature Learning | Apr 12, 2025 | Domain AdaptationDomain Generalization | CodeCode Available | 1 |
| Pychop: Emulating Low-Precision Arithmetic in Numerical Methods and Neural Networks | Apr 10, 2025 | image-classificationImage Classification | CodeCode Available | 1 |
| Hyperspectral Remote Sensing Images Salient Object Detection: The First Benchmark Dataset and Baseline | Apr 3, 2025 | object-detectionObject Detection | CodeCode Available | 1 |
| Multimodal Fusion and Vision-Language Models: A Survey for Robot Vision | Apr 3, 2025 | 3D Object Detectioncross-modal alignment | CodeCode Available | 1 |
| Rip Current Segmentation: A Novel Benchmark and YOLOv8 Baseline Results | Apr 3, 2025 | Instance Segmentationobject-detection | CodeCode Available | 1 |
| CaLiV: LiDAR-to-Vehicle Calibration of Arbitrary Sensor Setups via Object Reconstruction | Mar 31, 2025 | object-detectionObject Detection | CodeCode Available | 1 |
| Spectral-Adaptive Modulation Networks for Visual Perception | Mar 31, 2025 | object-detectionObject Detection | CodeCode Available | 1 |
| EagleVision: Object-level Attribute Multimodal LLM for Remote Sensing | Mar 30, 2025 | AttributeDisentanglement | CodeCode Available | 1 |
| Learning Class Prototypes for Unified Sparse Supervised 3D Object Detection | Mar 27, 2025 | 3D Object DetectionObject | CodeCode Available | 1 |
| Mind the Gap: Benchmarking Spatial Reasoning in Vision-Language Models | Mar 25, 2025 | BenchmarkingImage Captioning | CodeCode Available | 1 |
| Benchmarking Object Detectors under Real-World Distribution Shifts in Satellite Imagery | Mar 24, 2025 | BenchmarkingHumanitarian | CodeCode Available | 1 |
| Superpowering Open-Vocabulary Object Detectors for X-ray Vision | Mar 21, 2025 | object-detectionObject Detection | CodeCode Available | 1 |
| UltraFlwr -- An Efficient Federated Medical and Surgical Object Detection Framework | Mar 19, 2025 | Federated LearningObject | CodeCode Available | 1 |
| Robust Object Detection of Underwater Robot based on Domain Generalization | Mar 18, 2025 | Domain GeneralizationObject | CodeCode Available | 1 |
| Is Discretization Fusion All You Need for Collaborative Perception? | Mar 18, 2025 | Allobject-detection | CodeCode Available | 1 |
| State Space Model Meets Transformer: A New Paradigm for 3D Object Detection | Mar 18, 2025 | 3D Object DetectionDecoder | CodeCode Available | 1 |
| RoCo-Sim: Enhancing Roadside Collaborative Perception through Foreground Simulation | Mar 13, 2025 | 3D Object Detectionobject-detection | CodeCode Available | 1 |
| A Hierarchical Semantic Distillation Framework for Open-Vocabulary Object Detection | Mar 13, 2025 | object-detectionObject Detection | CodeCode Available | 1 |
| Accelerate 3D Object Detection Models via Zero-Shot Attention Key Pruning | Mar 11, 2025 | 3D Object Detectionobject-detection | CodeCode Available | 1 |