| Deep learning approaches to surgical video segmentation and object detection: A Scoping Review | Feb 23, 2025 | object-detectionObject Detection | —Unverified | 0 |
| MQADet: A Plug-and-Play Paradigm for Enhancing Open-Vocabulary Object Detection via Multimodal Question Answering | Feb 23, 2025 | Objectobject-detection | —Unverified | 0 |
| FeatSharp: Your Vision Model Features, Sharper | Feb 22, 2025 | modelobject-detection | —Unverified | 0 |
| Q-PETR: Quant-aware Position Embedding Transformation for Multi-View 3D Object Detection | Feb 21, 2025 | 3D Object DetectionAutonomous Driving | —Unverified | 0 |
| Generative AI Framework for 3D Object Generation in Augmented Reality | Feb 21, 2025 | 3D Generationobject-detection | —Unverified | 0 |
| Depth-aware Fusion Method based on Image and 4D Radar Spectrum for 3D Object Detection | Feb 21, 2025 | 3D Object DetectionAutonomous Driving | —Unverified | 0 |
| KnowZRel: Common Sense Knowledge-based Zero-Shot Relationship Retrieval for Generalised Scene Graph Generation | Feb 21, 2025 | Common Sense ReasoningGraph Generation | CodeCode Available | 0 |
| Synth It Like KITTI: Synthetic Data Generation for Object Detection in Driving Scenarios | Feb 20, 2025 | 3D Object DetectionAutonomous Driving | CodeCode Available | 0 |
| YOLOv12: A Breakdown of the Key Architectural Features | Feb 20, 2025 | Computational Efficiencyobject-detection | —Unverified | 0 |
| ODVerse33: Is the New YOLO Version Always Better? A Multi Domain benchmark from YOLO v5 to v11 | Feb 20, 2025 | Autonomous DrivingObject | —Unverified | 0 |
| LXLv2: Enhanced LiDAR Excluded Lean 3D Object Detection with Fusion of 4D Radar and Camera | Feb 20, 2025 | 3D Object DetectionDepth Estimation | —Unverified | 0 |
| MSVCOD:A Large-Scale Multi-Scene Dataset for Video Camouflage Object Detection | Feb 19, 2025 | Objectobject-detection | —Unverified | 0 |
| Image compositing is all you need for data augmentation | Feb 19, 2025 | AllData Augmentation | —Unverified | 0 |
| GroundCap: A Visually Grounded Image Captioning Dataset | Feb 19, 2025 | Image CaptioningObject Detection | —Unverified | 0 |
| An Overall Real-Time Mechanism for Classification and Quality Evaluation of Rice | Feb 19, 2025 | object-detectionObject Detection | —Unverified | 0 |
| DAMamba: Vision State Space Model with Dynamic Adaptive Scan | Feb 18, 2025 | image-classificationImage Classification | CodeCode Available | 2 |
| Multiple Distribution Shift -- Aerial (MDS-A): A Dataset for Test-Time Error Detection and Model Adaptation | Feb 18, 2025 | object-detectionObject Detection | —Unverified | 0 |
| RobuRCDet: Enhancing Robustness of Radar-Camera Fusion in Bird's Eye View for 3D Object Detection | Feb 18, 2025 | 3D Object DetectionObject | —Unverified | 0 |
| Task-Oriented Semantic Communication for Stereo-Vision 3D Object Detection | Feb 18, 2025 | 3D Object Detectionobject-detection | —Unverified | 0 |
| CoDiff: Conditional Diffusion Model for Collaborative 3D Object Detection | Feb 17, 2025 | 3D Object DetectionAutonomous Driving | CodeCode Available | 1 |
| Enhancing Transparent Object Pose Estimation: A Fusion of GDR-Net and Edge Detection | Feb 17, 2025 | 6D Pose Estimation using RGBEdge Detection | —Unverified | 0 |
| DA-Mamba: Domain Adaptive Hybrid Mamba-Transformer Based One-Stage Object Detection | Feb 16, 2025 | Domain AdaptationKnowledge Distillation | CodeCode Available | 1 |
| CLoCKDistill: Consistent Location-and-Context-aware Knowledge Distillation for DETRs | Feb 15, 2025 | DenoisingKnowledge Distillation | —Unverified | 0 |
| Text-guided Sparse Voxel Pruning for Efficient 3D Visual Grounding | Feb 14, 2025 | 3D Object Detection3D visual grounding | CodeCode Available | 3 |
| Object Detection and Tracking | Feb 14, 2025 | Deep LearningObject | CodeCode Available | 0 |
| Mitigating the Impact of Prominent Position Shift in Drone-based RGBT Object Detection | Feb 13, 2025 | object-detectionObject Detection | —Unverified | 0 |
| Wholly-WOOD: Wholly Leveraging Diversified-quality Labels for Weakly-supervised Oriented Object Detection | Feb 13, 2025 | object-detectionObject Detection | CodeCode Available | 1 |
| Instance Segmentation of Scene Sketches Using Natural Image Priors | Feb 13, 2025 | Image SegmentationInstance Segmentation | —Unverified | 0 |
| Deep Reinforcement Learning-Based User Scheduling for Collaborative Perception | Feb 12, 2025 | 3D Object DetectionAutonomous Driving | —Unverified | 0 |
| Knowledge Swapping via Learning and Unlearning | Feb 12, 2025 | image-classificationImage Classification | CodeCode Available | 0 |
| Plantation Monitoring Using Drone Images: A Dataset and Performance Review | Feb 12, 2025 | object-detectionObject Detection | —Unverified | 0 |
| SARChat-Bench-2M: A Multi-Task Vision-Language Benchmark for SAR Image Interpretation | Feb 12, 2025 | Earth Observationobject-detection | CodeCode Available | 2 |
| Take What You Need: Flexible Multi-Task Semantic Communications with Channel Adaptation | Feb 12, 2025 | Image Reconstructionobject-detection | —Unverified | 0 |
| Uncertainty Aware Human-machine Collaboration in Camouflaged Object Detection | Feb 12, 2025 | object-detectionObject Detection | CodeCode Available | 0 |
| A Survey on Mamba Architecture for Vision Applications | Feb 11, 2025 | Mambaobject-detection | —Unverified | 0 |
| SparseFormer: Detecting Objects in HRW Shots via Sparse Vision Transformer | Feb 11, 2025 | object-detectionObject Detection | —Unverified | 0 |
| Fast-COS: A Fast One-Stage Object Detector Based on Reparameterized Attention Vision Transformer for Autonomous Driving | Feb 11, 2025 | Autonomous DrivingComputational Efficiency | —Unverified | 0 |
| Foreign-Object Detection in High-Voltage Transmission Line Based on Improved YOLOv8m | Feb 11, 2025 | object-detectionObject Detection | —Unverified | 0 |
| Multi-Task-oriented Nighttime Haze Imaging Enhancer for Vision-driven Measurement Systems | Feb 11, 2025 | Image Reconstructionobject-detection | CodeCode Available | 0 |
| Dense Object Detection Based on De-homogenized Queries | Feb 11, 2025 | Dense Object DetectionObject | —Unverified | 0 |
| Amnesia as a Catalyst for Enhancing Black Box Pixel Attacks in Image Classification and Object Detection | Feb 10, 2025 | image-classificationImage Classification | CodeCode Available | 0 |
| From Objects to Events: Unlocking Complex Visual Understanding in Object Detectors via LLM-guided Symbolic Reasoning | Feb 9, 2025 | object-detectionObject Detection | —Unverified | 0 |
| Secure Visual Data Processing via Federated Learning | Feb 9, 2025 | Federated LearningManagement | —Unverified | 0 |
| Demystifying Catastrophic Forgetting in Two-Stage Incremental Object Detector | Feb 8, 2025 | Incremental LearningKnowledge Distillation | —Unverified | 0 |
| AIQViT: Architecture-Informed Post-Training Quantization for Vision Transformers | Feb 7, 2025 | image-classificationImage Classification | —Unverified | 0 |
| Counting Fish with Temporal Representations of Sonar Video | Feb 7, 2025 | object-detectionObject Detection | —Unverified | 0 |
| MHAF-YOLO: Multi-Branch Heterogeneous Auxiliary Fusion YOLO for accurate object detection | Feb 7, 2025 | object-detectionObject Detection | CodeCode Available | 2 |
| DetVPCC: RoI-based Point Cloud Sequence Compression for 3D Object Detection | Feb 7, 2025 | 3D Object Detectionobject-detection | —Unverified | 0 |
| LP-DETR: Layer-wise Progressive Relations for Object Detection | Feb 7, 2025 | DecoderObject | —Unverified | 0 |
| A Performance Analysis of You Only Look Once Models for Deployment on Constrained Computational Edge Devices in Drone Applications | Feb 6, 2025 | NVIDIA Jetson Orin Nanoobject-detection | —Unverified | 0 |