| YOLO11 to Its Genesis: A Decadal and Comprehensive Review of The You Only Look Once (YOLO) Series | Jun 12, 2024 | Computational Efficiencyobject-detection | —Unverified | 0 |
| I Don't Know You, But I Can Catch You: Real-Time Defense against Diverse Adversarial Patches for Object Detectors | Jun 12, 2024 | object-detectionObject Detection | —Unverified | 0 |
| CT3D++: Improving 3D Object Detection with Keypoint-induced Channel-wise Transformer | Jun 12, 2024 | 3D Object DetectionDecoder | CodeCode Available | 0 |
| Transformation-Dependent Adversarial Attacks | Jun 12, 2024 | image-classificationImage Classification | —Unverified | 0 |
| MWIRSTD: A MWIR Small Target Detection Dataset | Jun 12, 2024 | object-detectionObject Detection | CodeCode Available | 1 |
| Dataset Enhancement with Instance-Level Augmentations | Jun 12, 2024 | Data AugmentationObject | CodeCode Available | 1 |
| Sense Less, Generate More: Pre-training LiDAR Perception with Masked Autoencoders for Ultra-Efficient 3D Sensing | Jun 12, 2024 | 3D Object DetectionAutonomous Navigation | CodeCode Available | 0 |
| A Semantic-Aware and Multi-Guided Network for Infrared-Visible Image Fusion | Jun 11, 2024 | object-detectionObject Detection | CodeCode Available | 0 |
| Advancing Roadway Sign Detection with YOLO Models and Transfer Learning | Jun 11, 2024 | object-detectionObject Detection | —Unverified | 0 |
| A Deep Learning Approach to Detect Complete Safety Equipment For Construction Workers Based On YOLOv7 | Jun 11, 2024 | object-detectionObject Detection | —Unverified | 0 |
| Advancing Grounded Multimodal Named Entity Recognition via LLM-Based Reformulation and Box-Based Segmentation | Jun 11, 2024 | Grounded Multimodal Named Entity Recognitionnamed-entity-recognition | CodeCode Available | 1 |
| Understanding Visual Concepts Across Models | Jun 11, 2024 | Image Generationobject-detection | CodeCode Available | 0 |
| EFFOcc: A Minimal Baseline for EFficient Fusion-based 3D Occupancy Network | Jun 11, 2024 | 3D Object DetectionActive Learning | CodeCode Available | 2 |
| Minimizing Energy Costs in Deep Learning Model Training: The Gaussian Sampling Approach | Jun 11, 2024 | Domain AdaptationDomain Generalization | —Unverified | 0 |
| LiSD: An Efficient Multi-Task Learning Framework for LiDAR Segmentation and Detection | Jun 11, 2024 | 3D Semantic SegmentationAutonomous Driving | —Unverified | 0 |
| Unsupervised Object Detection with Theoretical Guarantees | Jun 11, 2024 | DecoderObject | —Unverified | 0 |
| RS-DFM: A Remote Sensing Distributed Foundation Model for Diverse Downstream Tasks | Jun 11, 2024 | 3D Object DetectionDepth Estimation | —Unverified | 0 |
| Teaching with Uncertainty: Unleashing the Potential of Knowledge Distillation in Object Detection | Jun 11, 2024 | Knowledge Distillationobject-detection | —Unverified | 0 |
| Triple-domain Feature Learning with Frequency-aware Memory Enhancement for Moving Infrared Small Target Detection | Jun 11, 2024 | object-detectionObject Detection | CodeCode Available | 1 |
| Real-Time Automated donning and doffing detection of PPE based on Yolov4-tiny | Jun 10, 2024 | object-detectionObject Detection | —Unverified | 0 |
| ReCon1M:A Large-scale Benchmark Dataset for Relation Comprehension in Remote Sensing Imagery | Jun 10, 2024 | Graph Generationobject-detection | —Unverified | 0 |
| UEMM-Air: A Synthetic Multi-modal Dataset for Unmanned Aerial Vehicle Object Detection | Jun 10, 2024 | Objectobject-detection | CodeCode Available | 1 |
| UnSupDLA: Towards Unsupervised Document Layout Analysis | Jun 10, 2024 | DiversityDocument Layout Analysis | —Unverified | 0 |
| Solution for SMART-101 Challenge of CVPR Multi-modal Algorithmic Reasoning Task 2024 | Jun 10, 2024 | Language Modellingobject-detection | —Unverified | 0 |
| A DeNoising FPN With Transformer R-CNN for Tiny Object Detection | Jun 9, 2024 | Contrastive LearningDenoising | CodeCode Available | 2 |