| Mind the Gap: Benchmarking Spatial Reasoning in Vision-Language Models | Mar 25, 2025 | BenchmarkingImage Captioning | CodeCode Available | 1 |
| BiblioPage: A Dataset of Scanned Title Pages for Bibliographic Metadata Extraction | Mar 25, 2025 | document understandingobject-detection | CodeCode Available | 0 |
| Single Shot AI-assisted quantification of KI-67 proliferation index in breast cancer | Mar 25, 2025 | Diagnosticobject-detection | —Unverified | 0 |
| Hyperdimensional Uncertainty Quantification for Multimodal Uncertainty Fusion in Autonomous Vehicles Perception | Mar 25, 2025 | 3D Object DetectionAutonomous Vehicles | —Unverified | 0 |
| MATT-GS: Masked Attention-based 3DGS for Robot Perception and Object Detection | Mar 25, 2025 | 3DGSobject-detection | —Unverified | 0 |
| Benchmarking Object Detectors under Real-World Distribution Shifts in Satellite Imagery | Mar 24, 2025 | BenchmarkingHumanitarian | CodeCode Available | 1 |
| Pitch Contour Exploration Across Audio Domains: A Vision-Based Transfer Learning Approach | Mar 24, 2025 | object-detectionObject Detection | —Unverified | 0 |
| Frequency Dynamic Convolution for Dense Image Prediction | Mar 24, 2025 | object-detectionObject Detection | CodeCode Available | 3 |
| CQ-DINO: Mitigating Gradient Dilution via Category Queries for Vast Vocabulary Object Detection | Mar 24, 2025 | Objectobject-detection | CodeCode Available | 0 |
| Building Blocks for Robust and Effective Semi-Supervised Real-World Object Detection | Mar 24, 2025 | Autonomous DrivingData Augmentation | —Unverified | 0 |
| LGI-DETR: Local-Global Interaction for UAV Object Detection | Mar 24, 2025 | object-detectionObject Detection | —Unverified | 0 |
| MAMAT: 3D Mamba-Based Atmospheric Turbulence Removal and its Object Detection Capability | Mar 22, 2025 | MambaObject | —Unverified | 0 |
| R-LiViT: A LiDAR-Visual-Thermal Dataset Enabling Vulnerable Road User Focused Roadside Perception | Mar 21, 2025 | Autonomous Drivingobject-detection | —Unverified | 0 |
| Which2comm: An Efficient Collaborative Perception Framework for 3D Object Detection | Mar 21, 2025 | 3D Object DetectionObject | —Unverified | 0 |
| R2LDM: An Efficient 4D Radar Super-Resolution Framework Leveraging Diffusion Model | Mar 21, 2025 | object-detectionObject Detection | —Unverified | 0 |
| Superpowering Open-Vocabulary Object Detectors for X-ray Vision | Mar 21, 2025 | object-detectionObject Detection | CodeCode Available | 1 |
| Scoring, Remember, and Reference: Catching Camouflaged Objects in Videos | Mar 21, 2025 | object-detectionObject Detection | —Unverified | 0 |
| Should we pre-train a decoder in contrastive learning for dense prediction tasks? | Mar 21, 2025 | Contrastive LearningDecoder | —Unverified | 0 |
| Event-Based Crossing Dataset (EBCD) | Mar 21, 2025 | Event-based visionobject-detection | CodeCode Available | 0 |
| An Iterative Feedback Mechanism for Improving Natural Language Class Descriptions in Open-Vocabulary Object Detection | Mar 21, 2025 | object-detectionObject Detection | —Unverified | 0 |
| You Only Look Once at Anytime (AnytimeYOLO): Analysis and Optimization of Early-Exits for Object-Detection | Mar 21, 2025 | object-detectionObject Detection | —Unverified | 0 |
| Hi-ALPS -- An Experimental Robustness Quantification of Six LiDAR-based Object Detection Systems for Autonomous Driving | Mar 21, 2025 | 3D Object DetectionAutonomous Driving | —Unverified | 0 |
| Exploring Few-Shot Object Detection on Blood Smear Images: A Case Study of Leukocytes and Schistocytes | Mar 21, 2025 | Few-Shot Object Detectionobject-detection | —Unverified | 0 |
| Region Masking to Accelerate Video Processing on Neuromorphic Hardware | Mar 21, 2025 | object-detectionObject Detection | —Unverified | 0 |
| Spatiotemporal Learning with Context-aware Video Tubelets for Ultrasound Video Analysis | Mar 21, 2025 | object-detectionObject Detection | —Unverified | 0 |
| Uncertainty Meets Diversity: A Comprehensive Active Learning Framework for Indoor 3D Object Detection | Mar 20, 2025 | 3D Object DetectionActive Learning | —Unverified | 0 |
| MapGlue: Multimodal Remote Sensing Image Matching | Mar 20, 2025 | object-detectionObject Detection | CodeCode Available | 0 |
| RESFL: An Uncertainty-Aware Framework for Responsible Federated Learning by Balancing Privacy, Fairness and Utility in Autonomous Vehicles | Mar 20, 2025 | Autonomous VehiclesDisentanglement | —Unverified | 0 |
| A Comprehensive Survey on Architectural Advances in Deep CNNs: Challenges, Applications, and Emerging Research Directions | Mar 19, 2025 | Action RecognitionComputational Efficiency | —Unverified | 0 |
| Test-Time Backdoor Detection for Object Detection Models | Mar 19, 2025 | image-classificationImage Classification | —Unverified | 0 |
| Fine-Grained Open-Vocabulary Object Detection with Fined-Grained Prompts: Task, Dataset and Benchmark | Mar 19, 2025 | Objectobject-detection | —Unverified | 0 |
| DCA: Dividing and Conquering Amnesia in Incremental Object Detection | Mar 19, 2025 | Knowledge Distillationobject-detection | CodeCode Available | 0 |
| UltraFlwr -- An Efficient Federated Medical and Surgical Object Detection Framework | Mar 19, 2025 | Federated LearningObject | CodeCode Available | 1 |
| Robust Object Detection of Underwater Robot based on Domain Generalization | Mar 18, 2025 | Domain GeneralizationObject | CodeCode Available | 1 |
| Shift, Scale and Rotation Invariant Multiple Object Detection using Balanced Joint Transform Correlator | Mar 18, 2025 | object-detectionObject Detection | —Unverified | 0 |
| LED: LLM Enhanced Open-Vocabulary Object Detection without Human Curated Data Generation | Mar 18, 2025 | DecoderObject | CodeCode Available | 0 |
| TGBFormer: Transformer-GraphFormer Blender Network for Video Object Detection | Mar 18, 2025 | GPUobject-detection | —Unverified | 0 |
| Is Discretization Fusion All You Need for Collaborative Perception? | Mar 18, 2025 | Allobject-detection | CodeCode Available | 1 |
| HSOD-BIT-V2: A New Challenging Benchmarkfor Hyperspectral Salient Object Detection | Mar 18, 2025 | Objectobject-detection | CodeCode Available | 0 |
| LEGNet: Lightweight Edge-Gaussian Driven Network for Low-Quality Remote Sensing Image Object Detection | Mar 18, 2025 | Computational Efficiencyobject-detection | CodeCode Available | 2 |
| FrustumFusionNets: A Three-Dimensional Object Detection Network Based on Tractor Road Scene | Mar 18, 2025 | Objectobject-detection | —Unverified | 0 |
| State Space Model Meets Transformer: A New Paradigm for 3D Object Detection | Mar 18, 2025 | 3D Object DetectionDecoder | CodeCode Available | 1 |
| PSA-SSL: Pose and Size-aware Self-Supervised Learning on LiDAR Point Clouds | Mar 18, 2025 | 3D Object Detection3D Semantic Segmentation | CodeCode Available | 0 |
| A Revisit to the Decoder for Camouflaged Object Detection | Mar 18, 2025 | Decoderobject-detection | —Unverified | 0 |
| SparseAlign: A Fully Sparse Framework for Cooperative Object Detection | Mar 17, 2025 | Autonomous Drivingobject-detection | —Unverified | 0 |
| Ship Detection in Remote Sensing Imagery for Arbitrarily Oriented Object Detection | Mar 17, 2025 | Disaster ResponseInstance Segmentation | —Unverified | 0 |
| MonoCT: Overcoming Monocular 3D Detection Domain Shift with Consistent Teacher Models | Mar 17, 2025 | 3D Object DetectionDepth Estimation | —Unverified | 0 |
| 8-Calves Image dataset | Mar 17, 2025 | object-detectionObject Detection | CodeCode Available | 0 |
| Let Synthetic Data Shine: Domain Reassembly and Soft-Fusion for Single Domain Generalization | Mar 17, 2025 | Domain AdaptationDomain Generalization | —Unverified | 0 |
| GeoRSMLLM: A Multimodal Large Language Model for Vision-Language Tasks in Geoscience and Remote Sensing | Mar 16, 2025 | Change DetectionImage Captioning | —Unverified | 0 |