| A Real-Time System for Egocentric Hand-Object Interaction Detection in Industrial Domains | Jul 17, 2025 | Action RecognitionHand-Object Interaction Detection | —Unverified | 0 |
| RS-TinyNet: Stage-wise Feature Fusion Network for Detecting Tiny Objects in Remote Sensing Images | Jul 17, 2025 | object-detectionObject Detection | —Unverified | 0 |
| Dual LiDAR-Based Traffic Movement Count Estimation at a Signalized Intersection: Deployment, Data Collection, and Preliminary Analysis | Jul 17, 2025 | 3D Object Detectionobject-detection | —Unverified | 0 |
| Decoupled PROB: Decoupled Query Initialization Tasks and Objectness-Class Learning for Open World Object Detection | Jul 17, 2025 | object-detectionObject Detection | —Unverified | 0 |
| Vision-based Perception for Autonomous Vehicles in Obstacle Avoidance Scenarios | Jul 16, 2025 | Autonomous NavigationAutonomous Vehicles | —Unverified | 0 |
| Tomato Multi-Angle Multi-Pose Dataset for Fine-Grained Phenotyping | Jul 15, 2025 | Instance Segmentationobject-detection | —Unverified | 1 |
| ECORE: Energy-Conscious Optimized Routing for Deep Learning Models at the Edge | Jul 8, 2025 | Edge-computingObject | —Unverified | 0 |
| Beyond One Shot, Beyond One Perspective: Cross-View and Long-Horizon Distillation for Better LiDAR Representations | Jul 7, 2025 | 3D Object DetectionAutonomous Driving | CodeCode Available | 1 |
| MambaFusion: Height-Fidelity Dense Global Fusion for Multi-modal 3D Object Detection | Jul 6, 2025 | 3D Object DetectionAttribute | CodeCode Available | 2 |
| Weakly-supervised Contrastive Learning with Quantity Prompts for Moving Infrared Small Target Detection | Jul 3, 2025 | Contrastive Learningobject-detection | CodeCode Available | 0 |
| Detection of Rail Line Track and Human Beings Near the Track to Avoid Accidents | Jul 3, 2025 | Line Detectionobject-detection | —Unverified | 0 |
| Improve Underwater Object Detection through YOLOv12 Architecture and Physics-informed Augmentation | Jun 30, 2025 | Autonomous NavigationComputational Efficiency | CodeCode Available | 1 |
| Seg-R1: Segmentation Can Be Surprisingly Simple with Reinforcement Learning | Jun 27, 2025 | Foreground Segmentationobject-detection | CodeCode Available | 2 |
| Towards Reliable Detection of Empty Space: Conditional Marked Point Processes for Object Detection | Jun 26, 2025 | Objectobject-detection | CodeCode Available | 0 |
| DuET: Dual Incremental Object Detection via Exemplar-Free Task Arithmetic | Jun 26, 2025 | Autonomous DrivingAvg | —Unverified | 0 |
| A Comprehensive Dataset for Underground Miner Detection in Diverse Scenario | Jun 26, 2025 | object-detectionObject Detection | —Unverified | 0 |
| LASFNet: A Lightweight Attention-Guided Self-Modulation Feature Fusion Network for Multimodal Object Detection | Jun 26, 2025 | object-detectionObject Detection | CodeCode Available | 0 |
| ThermalDiffusion: Visual-to-Thermal Image-to-Image Translation for Autonomous Navigation | Jun 26, 2025 | Autonomous NavigationDepth Estimation | —Unverified | 0 |
| Lightweight Multi-Frame Integration for Robust YOLO Object Detection in Videos | Jun 25, 2025 | Autonomous DrivingComputational Efficiency | —Unverified | 0 |
| TDiR: Transformer based Diffusion for Image Restoration Tasks | Jun 25, 2025 | DenoisingImage Enhancement | —Unverified | 0 |
| Feature Hallucination for Self-supervised Action Recognition | Jun 25, 2025 | Action RecognitionHallucination | —Unverified | 0 |
| From Codicology to Code: A Comparative Study of Transformer and YOLO-based Detectors for Layout Analysis in Historical Documents | Jun 25, 2025 | Document Layout Analysisobject-detection | —Unverified | 0 |
| A Survey of Multi-sensor Fusion Perception for Embodied AI: Background, Methods, Challenges and Prospects | Jun 24, 2025 | 3D Object DetectionAutonomous Driving | —Unverified | 0 |
| Unfolding the Past: A Comprehensive Deep Learning Approach to Analyzing Incunabula Pages | Jun 22, 2025 | image-classificationImage Classification | —Unverified | 0 |
| YOLOv13: Real-Time Object Detection with Hypergraph-Enhanced Adaptive Visual Perception | Jun 21, 2025 | Computational Efficiencyobject-detection | CodeCode Available | 5 |
| Class Agnostic Instance-level Descriptor for Visual Instance Search | Jun 20, 2025 | Content-Based Image RetrievalImage Retrieval | —Unverified | 0 |
| Can AI Dream of Unseen Galaxies? Conditional Diffusion Model for Galaxy Morphology Augmentation | Jun 19, 2025 | AstronomyMorphology classification | CodeCode Available | 0 |
| Retrospective Memory for Camouflaged Object Detection | Jun 18, 2025 | Objectobject-detection | —Unverified | 0 |
| VisText-Mosquito: A Multimodal Dataset and Benchmark for AI-Based Mosquito Breeding Site Detection and Reasoning | Jun 17, 2025 | object-detectionObject Detection | CodeCode Available | 0 |
| YOLOv11-RGBT: Towards a Comprehensive Single-Stage Multispectral Object Detection Framework | Jun 17, 2025 | Multispectral Object Detectionobject-detection | CodeCode Available | 4 |
| Comparison of Two Methods for Stationary Incident Detection Based on Background Image | Jun 17, 2025 | object-detectionObject Detection | —Unverified | 0 |
| How Real is CARLAs Dynamic Vision Sensor? A Study on the Sim-to-Real Gap in Traffic Object Detection | Jun 16, 2025 | Domain Adaptationobject-detection | —Unverified | 0 |
| Sparse Convolutional Recurrent Learning for Efficient Event-based Neuromorphic Object Detection | Jun 16, 2025 | Computational EfficiencyObject | —Unverified | 0 |
| UAV Object Detection and Positioning in a Mining Industrial Metaverse with Custom Geo-Referenced Data | Jun 16, 2025 | 3D Reconstructionobject-detection | —Unverified | 0 |
| FindMeIfYouCan: Bringing Open Set metrics to near , far and farther Out-of-Distribution Object Detection | Jun 16, 2025 | Autonomous Drivingobject-detection | —Unverified | 0 |
| Lecture Video Visual Objects (LVVO) Dataset: A Benchmark for Visual Object Detection in Educational Videos | Jun 16, 2025 | object-detectionObject Detection | CodeCode Available | 0 |
| Focusing on Tracks for Online Multi-Object Tracking | Jun 15, 2025 | global-optimizationMulti-Object Tracking | CodeCode Available | 2 |
| MatchPlant: An Open-Source Pipeline for UAV-Based Single-Plant Detection and Data Extraction | Jun 14, 2025 | object-detectionObject Detection | CodeCode Available | 0 |
| Vision-based Lifting of 2D Object Detections for Automated Driving | Jun 13, 2025 | 3D Object DetectionAutonomous Driving | —Unverified | 0 |
| Teleoperated Driving: a New Challenge for 3D Object Detection in Compressed Point Clouds | Jun 13, 2025 | 3D Object DetectionAutonomous Driving | —Unverified | 0 |
| FSATFusion: Frequency-Spatial Attention Transformer for Infrared and Visible Image Fusion | Jun 12, 2025 | Infrared And Visible Image Fusionobject-detection | CodeCode Available | 0 |
| Improving Medical Visual Representation Learning with Pathological-level Cross-Modal Alignment and Correlation Exploration | Jun 12, 2025 | cross-modal alignmentImage to text | —Unverified | 0 |
| Semantic-decoupled Spatial Partition Guided Point-supervised Oriented Object Detection | Jun 12, 2025 | object-detectionObject Detection | CodeCode Available | 1 |
| Uncertainty-Masked Bernoulli Diffusion for Camouflaged Object Detection Refinement | Jun 12, 2025 | Decoderobject-detection | —Unverified | 0 |
| DySS: Dynamic Queries and State-Space Learning for Efficient 3D Object Detection from Multi-Camera Videos | Jun 11, 2025 | 3D Object DetectionAutonomous Driving | —Unverified | 0 |
| CEM-FBGTinyDet: Context-Enhanced Foreground Balance with Gradient Tuning for tiny Objects | Jun 11, 2025 | object-detectionObject Detection | —Unverified | 0 |
| WD-DETR: Wavelet Denoising-Enhanced Real-Time Object Detection Transformer for Robot Perception with Event Cameras | Jun 10, 2025 | Denoisingobject-detection | —Unverified | 0 |
| Data Augmentation For Small Object using Fast AutoAugment | Jun 10, 2025 | Data AugmentationObject | —Unverified | 0 |
| Hierarchical Neural Collapse Detection Transformer for Class Incremental Object Detection | Jun 10, 2025 | Class-Incremental Object DetectionObject | —Unverified | 0 |
| ADAM: Autonomous Discovery and Annotation Model using LLMs for Context-Aware Annotations | Jun 10, 2025 | Objectobject-detection | —Unverified | 0 |