| Point Cloud Based Scene Segmentation: A Survey | Mar 16, 2025 | 3D Object Detection3D Semantic Segmentation | —Unverified | 0 |
| UniMamba: Unified Spatial-Channel Representation Learning with Group-Efficient Mamba for LiDAR-based 3D Object Detection | Mar 15, 2025 | 3D Object DetectionMamba | —Unverified | 0 |
| Falcon: A Remote Sensing Vision-Language Foundation Model | Mar 14, 2025 | Image Captioningimage-classification | CodeCode Available | 3 |
| FMNet: Frequency-Assisted Mamba-Like Linear Attention Network for Camouflaged Object Detection | Mar 14, 2025 | DecoderMamba | —Unverified | 0 |
| Comparative Analysis of Advanced AI-based Object Detection Models for Pavement Marking Quality Assessment during Daytime | Mar 14, 2025 | Computational Efficiencyobject-detection | —Unverified | 0 |
| Cyclic Contrastive Knowledge Transfer for Open-Vocabulary Object Detection | Mar 14, 2025 | object-detectionObject Detection | CodeCode Available | 0 |
| FLASHμ: Fast Localizing And Sizing of Holographic Microparticles | Mar 14, 2025 | object-detectionObject Detection | —Unverified | 0 |
| The Power of One: A Single Example is All it Takes for Segmentation in VLMs | Mar 13, 2025 | Allobject-detection | —Unverified | 0 |
| HeightFormer: Learning Height Prediction in Voxel Features for Roadside Vision Centric 3D Object Detection via Transformer | Mar 13, 2025 | 3D Object DetectionAutonomous Vehicles | —Unverified | 0 |
| TARS: Traffic-Aware Radar Scene Flow Estimation | Mar 13, 2025 | Autonomous Drivingobject-detection | —Unverified | 0 |
| RoMA: Scaling up Mamba-based Foundation Models for Remote Sensing | Mar 13, 2025 | Computational EfficiencyMamba | CodeCode Available | 2 |
| Object detection characteristics in a learning factory environment using YOLOv8 | Mar 13, 2025 | object-detectionObject Detection | —Unverified | 0 |
| A Hierarchical Semantic Distillation Framework for Open-Vocabulary Object Detection | Mar 13, 2025 | object-detectionObject Detection | CodeCode Available | 1 |
| Style Evolving along Chain-of-Thought for Unknown-Domain Object Detection | Mar 13, 2025 | object-detectionObject Detection | —Unverified | 0 |
| Semantic-Supervised Spatial-Temporal Fusion for LiDAR-based 3D Object Detection | Mar 13, 2025 | 3D Object DetectionObject | —Unverified | 0 |
| RoCo-Sim: Enhancing Roadside Collaborative Perception through Foreground Simulation | Mar 13, 2025 | 3D Object Detectionobject-detection | CodeCode Available | 1 |
| How good are deep learning methods for automated road safety analysis using video data? An experimental study | Mar 12, 2025 | Multi-Object Trackingobject-detection | —Unverified | 0 |
| Evaluating the Impact of Synthetic Data on Object Detection Tasks in Autonomous Driving | Mar 12, 2025 | 3D Object DetectionAutonomous Driving | —Unverified | 0 |
| Fully-Synthetic Training for Visual Quality Inspection in Automotive Production | Mar 12, 2025 | Defect Detectionobject-detection | —Unverified | 0 |
| DitHub: A Modular Framework for Incremental Open-Vocabulary Object Detection | Mar 12, 2025 | object-detectionObject Detection | —Unverified | 0 |
| Dual-Domain Homogeneous Fusion with Cross-Modal Mamba and Progressive Decoder for 3D Object Detection | Mar 12, 2025 | 3D Object DetectionAutonomous Driving | —Unverified | 0 |
| Polygonizing Roof Segments from High-Resolution Aerial Images Using Yolov8-Based Edge Detection | Mar 12, 2025 | Edge Detectionobject-detection | —Unverified | 0 |
| CleverDistiller: Simple and Spatially Consistent Cross-modal Distillation | Mar 12, 2025 | 3D Object DetectionAutonomous Driving | —Unverified | 0 |
| Deep Learning for Climate Action: Computer Vision Analysis of Visual Narratives on X | Mar 12, 2025 | image-classificationImage Classification | —Unverified | 0 |
| Bring Remote Sensing Object Detect Into Nature Language Model: Using SFT Method | Mar 11, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Learning to Detect Objects from Multi-Agent LiDAR Scans without Manual Labels | Mar 11, 2025 | 3D Object DetectionObject | CodeCode Available | 1 |
| Simulating Automotive Radar with Lidar and Camera Inputs | Mar 11, 2025 | Autonomous Drivingobject-detection | —Unverified | 0 |
| Referring to Any Person | Mar 11, 2025 | Large Language ModelMultimodal Large Language Model | CodeCode Available | 2 |
| SparseVoxFormer: Sparse Voxel-based Transformer for Multi-modal 3D Object Detection | Mar 11, 2025 | 3D Object Detectionobject-detection | —Unverified | 0 |
| Boundary Regression for Leitmotif Detection in Music Audio | Mar 11, 2025 | Event Detectionobject-detection | —Unverified | 0 |
| Physics-based AI methodology for Material Parameter Extraction from Optical Data | Mar 11, 2025 | object-detectionObject Detection | —Unverified | 0 |
| Accelerate 3D Object Detection Models via Zero-Shot Attention Key Pruning | Mar 11, 2025 | 3D Object Detectionobject-detection | CodeCode Available | 1 |
| VocalEyes: Enhancing Environmental Perception for the Visually Impaired through Vision-Language Models and Distance-Aware Object Detection | Mar 10, 2025 | NVIDIA Jetson Orin Nanoobject-detection | —Unverified | 0 |
| Hierarchical Cross-Modal Alignment for Open-Vocabulary 3D Object Detection | Mar 10, 2025 | 3D Object Detectioncross-modal alignment | —Unverified | 0 |
| A Light Perspective for 3D Object Detection | Mar 10, 2025 | 3D Object DetectionObject | —Unverified | 0 |
| Large Language Model Guided Progressive Feature Alignment for Multimodal UAV Object Detection | Mar 10, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| HGO-YOLO: Advancing Anomaly Behavior Detection with Hierarchical Features and Lightweight Optimized Detection | Mar 10, 2025 | CPUobject-detection | —Unverified | 0 |
| Availability-aware Sensor Fusion via Unified Canonical Space for 4D Radar, LiDAR, and Camera | Mar 10, 2025 | Autonomous Drivingobject-detection | —Unverified | 0 |
| SimROD: A Simple Baseline for Raw Object Detection with Global and Local Enhancements | Mar 10, 2025 | Objectobject-detection | CodeCode Available | 1 |
| Mitigating Hallucinations in YOLO-based Object Detection Models: A Revisit to Out-of-Distribution Detection | Mar 10, 2025 | Hallucinationobject-detection | —Unverified | 0 |
| Semantic Communications with Computer Vision Sensing for Edge Video Transmission | Mar 10, 2025 | object-detectionObject Detection | —Unverified | 0 |
| RS2AD: End-to-End Autonomous Driving Data Generation from Roadside Sensor Observations | Mar 10, 2025 | 3D Object DetectionAutonomous Driving | —Unverified | 0 |
| Enhancing Layer Attention Efficiency through Pruning Redundant Retrievals | Mar 9, 2025 | image-classificationImage Classification | —Unverified | 0 |
| SP3D: Boosting Sparsely-Supervised 3D Object Detection via Accurate Cross-Modal Semantic Prompts | Mar 9, 2025 | 3D Object Detectionobject-detection | CodeCode Available | 1 |
| OV-SCAN: Semantically Consistent Alignment for Novel Object Discovery in Open-Vocabulary 3D Object Detection | Mar 9, 2025 | 3D Object DetectionAutonomous Driving | —Unverified | 0 |
| Seg-Zero: Reasoning-Chain Guided Segmentation via Cognitive Reinforcement | Mar 9, 2025 | Domain GeneralizationObject Detection | CodeCode Available | 4 |
| AnywhereDoor: Multi-Target Backdoor Attacks on Object Detection | Mar 9, 2025 | Backdoor AttackMulti-Task Learning | CodeCode Available | 0 |
| From Dataset to Real-world: General 3D Object Detection via Generalized Cross-domain Few-shot Learning | Mar 8, 2025 | 3D Object DetectionAutonomous Driving | —Unverified | 0 |
| Accurate and Efficient Two-Stage Gun Detection in Video | Mar 8, 2025 | Anomaly DetectionObject | —Unverified | 0 |
| Improving SAM for Camouflaged Object Detection via Dual Stream Adapters | Mar 8, 2025 | Knowledge Distillationobject-detection | —Unverified | 0 |