| Deep learning approaches to surgical video segmentation and object detection: A Scoping Review | Feb 23, 2025 | object-detectionObject Detection | —Unverified | 0 |
| MQADet: A Plug-and-Play Paradigm for Enhancing Open-Vocabulary Object Detection via Multimodal Question Answering | Feb 23, 2025 | Objectobject-detection | —Unverified | 0 |
| FeatSharp: Your Vision Model Features, Sharper | Feb 22, 2025 | modelobject-detection | —Unverified | 0 |
| Q-PETR: Quant-aware Position Embedding Transformation for Multi-View 3D Object Detection | Feb 21, 2025 | 3D Object DetectionAutonomous Driving | —Unverified | 0 |
| Generative AI Framework for 3D Object Generation in Augmented Reality | Feb 21, 2025 | 3D Generationobject-detection | —Unverified | 0 |
| KnowZRel: Common Sense Knowledge-based Zero-Shot Relationship Retrieval for Generalised Scene Graph Generation | Feb 21, 2025 | Common Sense ReasoningGraph Generation | CodeCode Available | 0 |
| Depth-aware Fusion Method based on Image and 4D Radar Spectrum for 3D Object Detection | Feb 21, 2025 | 3D Object DetectionAutonomous Driving | —Unverified | 0 |
| LXLv2: Enhanced LiDAR Excluded Lean 3D Object Detection with Fusion of 4D Radar and Camera | Feb 20, 2025 | 3D Object DetectionDepth Estimation | —Unverified | 0 |
| Synth It Like KITTI: Synthetic Data Generation for Object Detection in Driving Scenarios | Feb 20, 2025 | 3D Object DetectionAutonomous Driving | CodeCode Available | 0 |
| ODVerse33: Is the New YOLO Version Always Better? A Multi Domain benchmark from YOLO v5 to v11 | Feb 20, 2025 | Autonomous DrivingObject | —Unverified | 0 |
| YOLOv12: A Breakdown of the Key Architectural Features | Feb 20, 2025 | Computational Efficiencyobject-detection | —Unverified | 0 |
| MSVCOD:A Large-Scale Multi-Scene Dataset for Video Camouflage Object Detection | Feb 19, 2025 | Objectobject-detection | —Unverified | 0 |
| Image compositing is all you need for data augmentation | Feb 19, 2025 | AllData Augmentation | —Unverified | 0 |
| GroundCap: A Visually Grounded Image Captioning Dataset | Feb 19, 2025 | Image CaptioningObject Detection | —Unverified | 0 |
| An Overall Real-Time Mechanism for Classification and Quality Evaluation of Rice | Feb 19, 2025 | object-detectionObject Detection | —Unverified | 0 |
| Multiple Distribution Shift -- Aerial (MDS-A): A Dataset for Test-Time Error Detection and Model Adaptation | Feb 18, 2025 | object-detectionObject Detection | —Unverified | 0 |
| RobuRCDet: Enhancing Robustness of Radar-Camera Fusion in Bird's Eye View for 3D Object Detection | Feb 18, 2025 | 3D Object DetectionObject | —Unverified | 0 |
| DAMamba: Vision State Space Model with Dynamic Adaptive Scan | Feb 18, 2025 | image-classificationImage Classification | CodeCode Available | 2 |
| Task-Oriented Semantic Communication for Stereo-Vision 3D Object Detection | Feb 18, 2025 | 3D Object Detectionobject-detection | —Unverified | 0 |
| CoDiff: Conditional Diffusion Model for Collaborative 3D Object Detection | Feb 17, 2025 | 3D Object DetectionAutonomous Driving | CodeCode Available | 1 |
| Enhancing Transparent Object Pose Estimation: A Fusion of GDR-Net and Edge Detection | Feb 17, 2025 | 6D Pose Estimation using RGBEdge Detection | —Unverified | 0 |
| DA-Mamba: Domain Adaptive Hybrid Mamba-Transformer Based One-Stage Object Detection | Feb 16, 2025 | Domain AdaptationKnowledge Distillation | CodeCode Available | 1 |
| CLoCKDistill: Consistent Location-and-Context-aware Knowledge Distillation for DETRs | Feb 15, 2025 | DenoisingKnowledge Distillation | —Unverified | 0 |
| Text-guided Sparse Voxel Pruning for Efficient 3D Visual Grounding | Feb 14, 2025 | 3D Object Detection3D visual grounding | CodeCode Available | 3 |
| Object Detection and Tracking | Feb 14, 2025 | Deep LearningObject | CodeCode Available | 0 |