| Evaluating Stenosis Detection with Grounding DINO, YOLO, and DINO-DETR | Mar 3, 2025 | Model Selectionobject-detection | —Unverified | 0 |
| Visual-RFT: Visual Reinforcement Fine-Tuning | Mar 3, 2025 | Few-Shot Object DetectionFine-Grained Image Classification | CodeCode Available | 7 |
| MI-DETR: An Object Detection Model with Multi-time Inquiries Mechanism | Mar 3, 2025 | Object Detection | CodeCode Available | 2 |
| A Comparison of Object Detection and Phrase Grounding Models in Chest X-ray Abnormality Localization using Eye-tracking Data | Mar 2, 2025 | object-detectionObject Detection | —Unverified | 0 |
| Unifying Light Field Perception with Field of Parallax | Mar 2, 2025 | Multi-Task Learningobject-detection | CodeCode Available | 0 |
| UniFa: A unified feature hallucination framework for any-shot object detection | Mar 1, 2025 | Generalized Zero-Shot Object DetectionHallucination | —Unverified | 0 |
| RFWNet: A Lightweight Remote Sensing Object Detector Integrating Multi-Scale Receptive Fields and Foreground Focus Mechanism | Mar 1, 2025 | object-detectionObject Detection | —Unverified | 0 |
| Technical Report for ReID-SAM on SkiTB Visual Tracking Challenge 2025 | Feb 28, 2025 | object-detectionObject Detection | —Unverified | 0 |
| FASTer: Focal Token Acquiring-and-Scaling Transformer for Long-term 3D Object Detection | Feb 28, 2025 | 3D Object Detectionobject-detection | CodeCode Available | 1 |
| OverLoCK: An Overview-first-Look-Closely-next ConvNet with Context-Mixing Dynamic Kernels | Feb 27, 2025 | Image ClassificationInstance Segmentation | CodeCode Available | 4 |
| Multi-Scale Neighborhood Occupancy Masked Autoencoder for Self-Supervised Learning in LiDAR Point Clouds | Feb 27, 2025 | 3D Object DetectionDecoder | —Unverified | 0 |
| BEVDiffuser: Plug-and-Play Diffusion Model for BEV Denoising with Ground-Truth Guidance | Feb 27, 2025 | 3D Object DetectionAutonomous Driving | —Unverified | 0 |
| Learning Mask Invariant Mutual Information for Masked Image Modeling | Feb 27, 2025 | Contrastive Learningimage-classification | —Unverified | 0 |
| WalnutData: A UAV Remote Sensing Dataset of Green Walnuts and Model Evaluation | Feb 27, 2025 | 2D Object DetectionObject Detection | CodeCode Available | 0 |
| Vision Transformers on the Edge: A Comprehensive Survey of Model Compression and Acceleration Strategies | Feb 26, 2025 | image-classificationImage Classification | —Unverified | 0 |
| Improved YOLOv12 with LLM-Generated Synthetic Data for Enhanced Apple Detection and Benchmarking Against YOLOv11 and YOLOv10 | Feb 26, 2025 | Benchmarkingobject-detection | —Unverified | 0 |
| Advanced YOLO-based Real-time Power Line Detection for Vegetation Management | Feb 26, 2025 | Line DetectionManagement | —Unverified | 0 |
| Ev-3DOD: Pushing the Temporal Boundaries of 3D Object Detection with Event Cameras | Feb 26, 2025 | 3D Object DetectionAutonomous Driving | CodeCode Available | 1 |
| Automatic Vehicle Detection using DETR: A Transformer-Based Approach for Navigating Treacherous Roads | Feb 25, 2025 | Autonomous NavigationAutonomous Vehicles | —Unverified | 0 |
| Progressive Local Alignment for Medical Multimodal Pre-training | Feb 25, 2025 | Contrastive LearningImage-text Retrieval | —Unverified | 0 |
| Multi-Perspective Data Augmentation for Few-shot Object Detection | Feb 25, 2025 | Data AugmentationFew-Shot Object Detection | CodeCode Available | 1 |
| LCV2I: Communication-Efficient and High-Performance Collaborative Perception Framework with Low-Resolution LiDAR | Feb 24, 2025 | 3D Object Detectionobject-detection | —Unverified | 0 |
| Experimental validation of UAV search and detection system in real wilderness environment | Feb 24, 2025 | object-detectionObject Detection | —Unverified | 0 |
| Geometry-Aware 3D Salient Object Detection Network | Feb 23, 2025 | Objectobject-detection | —Unverified | 0 |
| Cross-domain Few-shot Object Detection with Multi-modal Textual Enrichment | Feb 23, 2025 | Cross-Domain Few-ShotCross-Domain Few-Shot Object Detection | CodeCode Available | 1 |