| MAFE R-CNN: Selecting More Samples to Learn Category-aware Features for Small Object Detection | May 22, 2025 | Objectobject-detection | —Unverified | 0 |
| CodeMerge: Codebook-Guided Model Merging for Robust Test-Time Adaptation in Autonomous Driving | May 22, 2025 | 3D Object DetectionAutonomous Driving | —Unverified | 0 |
| Semi-Supervised State-Space Model with Dynamic Stacking Filter for Real-World Video Deraining | May 22, 2025 | object-detectionObject Detection | —Unverified | 0 |
| Self-Classification Enhancement and Correction for Weakly Supervised Object Detection | May 22, 2025 | Binary ClassificationClassification | —Unverified | 0 |
| Robust Vision-Based Runway Detection through Conformal Prediction and Conformal mAP | May 22, 2025 | Conformal Predictionobject-detection | CodeCode Available | 0 |
| AdvReal: Adversarial Patch Generation Framework with Application to Adversarial Safety Evaluation of Object Detection Systems | May 22, 2025 | Autonomous Vehiclesobject-detection | CodeCode Available | 1 |
| RAZER: Robust Accelerated Zero-Shot 3D Open-Vocabulary Panoptic Reconstruction with Spatio-Temporal Aggregation | May 21, 2025 | GPUNatural Language Queries | —Unverified | 0 |
| Detection of Underwater Multi-Targets Based on Self-Supervised Learning and Deformable Path Aggregation Feature Pyramid Network | May 21, 2025 | object-detectionObject Detection | —Unverified | 0 |
| Multispectral Detection Transformer with Infrared-Centric Sensor Fusion | May 21, 2025 | Multispectral Object DetectionObject | CodeCode Available | 0 |
| SNAP: A Benchmark for Testing the Effects of Capture Conditions on Fundamental Vision Tasks | May 21, 2025 | image-classificationImage Classification | CodeCode Available | 0 |
| SCAN: Semantic Document Layout Analysis for Textual and Visual Retrieval-Augmented Generation | May 20, 2025 | Document Layout Analysisobject-detection | —Unverified | 0 |
| Automated Quality Evaluation of Cervical Cytopathology Whole Slide Images Based on Content Analysis | May 20, 2025 | Diagnosticobject-detection | —Unverified | 0 |
| Decoupling Classifier for Boosting Few-shot Object Detection and Instance Segmentation | May 20, 2025 | Few-Shot Object DetectionInstance Segmentation | CodeCode Available | 1 |
| Scaling Vision Mamba Across Resolutions via Fractal Traversal | May 20, 2025 | Change Detectionimage-classification | —Unverified | 0 |
| InstanceBEV: Unifying Instance and BEV Representation for Global Modeling | May 20, 2025 | 3D Object DetectionAutonomous Driving | —Unverified | 0 |
| Safety2Drive: Safety-Critical Scenario Benchmark for the Evaluation of Autonomous Driving | May 20, 2025 | Autonomous DrivingBench2Drive | —Unverified | 0 |
| Non-planar Object Detection and Identification by Features Matching and Triangulation Growth | May 19, 2025 | Image RetrievalIndustrial Robots | —Unverified | 0 |
| Rethinking Features-Fused-Pyramid-Neck for Object Detection | May 19, 2025 | object-detectionObject Detection | CodeCode Available | 2 |
| Enhancing Transformers Through Conditioned Embedded Tokens | May 19, 2025 | image-classificationImage Classification | —Unverified | 0 |
| Dynamic Graph Induced Contour-aware Heat Conduction Network for Event-based Object Detection | May 19, 2025 | Event-based visionObject | CodeCode Available | 2 |
| AGI-Elo: How Far Are We From Mastering A Task? | May 19, 2025 | Code GenerationImage Classification | CodeCode Available | 1 |
| VLC Fusion: Vision-Language Conditioned Sensor Fusion for Robust Object Detection | May 19, 2025 | Autonomous DrivingLanguage Modeling | —Unverified | 0 |
| EarthSynth: Generating Informative Earth Observation with Diffusion Models | May 17, 2025 | counterfactualDiversity | —Unverified | 0 |
| Experimental Study on Automatically Assembling Custom Catering Packages With a 3-DOF Delta Robot Using Deep Learning Methods | May 17, 2025 | object-detectionObject Detection | —Unverified | 0 |
| Improving Object Detection Performance through YOLOv8: A Comprehensive Training and Evaluation Study | May 16, 2025 | object-detectionObject Detection | —Unverified | 0 |
| MTevent: A Multi-Task Event Camera Dataset for 6D Pose Estimation and Moving Object Detection | May 16, 2025 | 6D Pose EstimationEvent-based vision | CodeCode Available | 0 |
| M4-SAR: A Multi-Resolution, Multi-Polarization, Multi-Scene, Multi-Source Dataset and Benchmark for Optical-SAR Fusion Object Detection | May 16, 2025 | Benchmarkingobject-detection | CodeCode Available | 1 |
| A High-Performance Thermal Infrared Object Detection Framework with Centralized Regulation | May 16, 2025 | Objectobject-detection | —Unverified | 0 |
| Application of YOLOv8 in monocular downward multiple Car Target detection | May 15, 2025 | Autonomous Drivingobject-detection | —Unverified | 0 |
| Defect Detection in Photolithographic Patterns Using Deep Learning Models Trained on Synthetic Data | May 15, 2025 | Defect Detectionobject-detection | —Unverified | 0 |
| StoryReasoning Dataset: Using Chain-of-Thought for Scene Understanding and Grounded Story Generation | May 15, 2025 | Face RecognitionObject | CodeCode Available | 1 |
| Beyond General Prompts: Automated Prompt Refinement using Contrastive Class Alignment Scores for Disambiguating Objects in Vision-Language Models | May 14, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| MoRAL: Motion-aware Multi-Frame 4D Radar and LiDAR Fusion for Robust 3D Object Detection | May 14, 2025 | 3D Object DetectionAutonomous Driving | —Unverified | 0 |
| DRRNet: Macro-Micro Feature Fusion and Dual Reverse Refinement for Camouflaged Object Detection | May 14, 2025 | object-detectionObject Detection | CodeCode Available | 0 |
| Promoting SAM for Camouflaged Object Detection via Selective Key Point-based Guidance | May 14, 2025 | Domain Adaptationobject-detection | —Unverified | 0 |
| Robustness Analysis against Adversarial Patch Attacks in Fully Unmanned Stores | May 13, 2025 | Objectobject-detection | —Unverified | 0 |
| HMPNet: A Feature Aggregation Architecture for Maritime Object Detection from a Shipborne Perspective | May 13, 2025 | Computational EfficiencyObject | CodeCode Available | 0 |
| Object detection in adverse weather conditions for autonomous vehicles using Instruct Pix2Pix | May 13, 2025 | Autonomous DrivingAutonomous Vehicles | —Unverified | 0 |
| MoKD: Multi-Task Optimization for Knowledge Distillation | May 13, 2025 | image-classificationImage Classification | —Unverified | 0 |
| DepthFusion: Depth-Aware Hybrid Feature Fusion for LiDAR-Camera 3D Object Detection | May 12, 2025 | 3D Object Detectionobject-detection | —Unverified | 0 |
| Hybrid Spiking Vision Transformer for Object Detection with Event Cameras | May 12, 2025 | Event DetectionObject | —Unverified | 0 |
| Language-Driven Dual Style Mixing for Single-Domain Generalized Object Detection | May 12, 2025 | Domain GeneralizationImage Augmentation | CodeCode Available | 0 |
| VALISENS: A Validated Innovative Multi-Sensor System for Cooperative Automated Driving | May 11, 2025 | Motion Forecastingobject-detection | —Unverified | 0 |
| Differentiable NMS via Sinkhorn Matching for End-to-End Fabric Defect Detection | May 11, 2025 | Defect Detectionobject-detection | —Unverified | 0 |
| M3CAD: Towards Generic Cooperative Autonomous Driving Benchmark | May 10, 2025 | Autonomous DrivingMotion Forecasting | CodeCode Available | 1 |
| Underwater object detection in sonar imagery with detection transformer and Zero-shot neural architecture search | May 10, 2025 | Neural Architecture SearchObject | —Unverified | 0 |
| METOR: A Unified Framework for Mutual Enhancement of Objects and Relationships in Open-vocabulary Video Visual Relationship Detection | May 10, 2025 | Objectobject-detection | CodeCode Available | 0 |
| Camera-Only Bird's Eye View Perception: A Neural Approach to LiDAR-Free Environmental Mapping for Autonomous Vehicles | May 9, 2025 | Autonomous NavigationAutonomous Vehicles | —Unverified | 0 |
| Dome-DETR: DETR with Density-Oriented Feature-Query Manipulation for Efficient Tiny Object Detection | May 9, 2025 | object-detectionObject Detection | —Unverified | 0 |
| DiGIT: Multi-Dilated Gated Encoder and Central-Adjacent Region Integrated Decoder for Temporal Action Detection Transformer | May 9, 2025 | Action DetectionDecoder | CodeCode Available | 1 |