| YOLO11 to Its Genesis: A Decadal and Comprehensive Review of The You Only Look Once (YOLO) Series | Jun 12, 2024 | Computational Efficiencyobject-detection | —Unverified | 0 |
| I Don't Know You, But I Can Catch You: Real-Time Defense against Diverse Adversarial Patches for Object Detectors | Jun 12, 2024 | object-detectionObject Detection | —Unverified | 0 |
| MWIRSTD: A MWIR Small Target Detection Dataset | Jun 12, 2024 | object-detectionObject Detection | CodeCode Available | 1 |
| CT3D++: Improving 3D Object Detection with Keypoint-induced Channel-wise Transformer | Jun 12, 2024 | 3D Object DetectionDecoder | CodeCode Available | 0 |
| Transformation-Dependent Adversarial Attacks | Jun 12, 2024 | image-classificationImage Classification | —Unverified | 0 |
| Dataset Enhancement with Instance-Level Augmentations | Jun 12, 2024 | Data AugmentationObject | CodeCode Available | 1 |
| Sense Less, Generate More: Pre-training LiDAR Perception with Masked Autoencoders for Ultra-Efficient 3D Sensing | Jun 12, 2024 | 3D Object DetectionAutonomous Navigation | CodeCode Available | 0 |
| A Semantic-Aware and Multi-Guided Network for Infrared-Visible Image Fusion | Jun 11, 2024 | object-detectionObject Detection | CodeCode Available | 0 |
| Advancing Roadway Sign Detection with YOLO Models and Transfer Learning | Jun 11, 2024 | object-detectionObject Detection | —Unverified | 0 |
| A Deep Learning Approach to Detect Complete Safety Equipment For Construction Workers Based On YOLOv7 | Jun 11, 2024 | object-detectionObject Detection | —Unverified | 0 |
| Minimizing Energy Costs in Deep Learning Model Training: The Gaussian Sampling Approach | Jun 11, 2024 | Domain AdaptationDomain Generalization | —Unverified | 0 |
| EFFOcc: A Minimal Baseline for EFficient Fusion-based 3D Occupancy Network | Jun 11, 2024 | 3D Object DetectionActive Learning | CodeCode Available | 2 |
| Triple-domain Feature Learning with Frequency-aware Memory Enhancement for Moving Infrared Small Target Detection | Jun 11, 2024 | object-detectionObject Detection | CodeCode Available | 1 |
| Advancing Grounded Multimodal Named Entity Recognition via LLM-Based Reformulation and Box-Based Segmentation | Jun 11, 2024 | Grounded Multimodal Named Entity Recognitionnamed-entity-recognition | CodeCode Available | 1 |
| LiSD: An Efficient Multi-Task Learning Framework for LiDAR Segmentation and Detection | Jun 11, 2024 | 3D Semantic SegmentationAutonomous Driving | —Unverified | 0 |
| RS-DFM: A Remote Sensing Distributed Foundation Model for Diverse Downstream Tasks | Jun 11, 2024 | 3D Object DetectionDepth Estimation | —Unverified | 0 |
| Teaching with Uncertainty: Unleashing the Potential of Knowledge Distillation in Object Detection | Jun 11, 2024 | Knowledge Distillationobject-detection | —Unverified | 0 |
| Understanding Visual Concepts Across Models | Jun 11, 2024 | Image Generationobject-detection | CodeCode Available | 0 |
| Unsupervised Object Detection with Theoretical Guarantees | Jun 11, 2024 | DecoderObject | —Unverified | 0 |
| Real-Time Automated donning and doffing detection of PPE based on Yolov4-tiny | Jun 10, 2024 | object-detectionObject Detection | —Unverified | 0 |
| ReCon1M:A Large-scale Benchmark Dataset for Relation Comprehension in Remote Sensing Imagery | Jun 10, 2024 | Graph Generationobject-detection | —Unverified | 0 |
| Solution for SMART-101 Challenge of CVPR Multi-modal Algorithmic Reasoning Task 2024 | Jun 10, 2024 | Language Modellingobject-detection | —Unverified | 0 |
| UnSupDLA: Towards Unsupervised Document Layout Analysis | Jun 10, 2024 | DiversityDocument Layout Analysis | —Unverified | 0 |
| UEMM-Air: A Synthetic Multi-modal Dataset for Unmanned Aerial Vehicle Object Detection | Jun 10, 2024 | Objectobject-detection | CodeCode Available | 1 |
| Scaling Graph Convolutions for Mobile Vision | Jun 9, 2024 | Graph AttentionGraph Neural Network | CodeCode Available | 1 |
| A DeNoising FPN With Transformer R-CNN for Tiny Object Detection | Jun 9, 2024 | Contrastive LearningDenoising | CodeCode Available | 2 |
| Mamba YOLO: A Simple Baseline for Object Detection with State Space Model | Jun 9, 2024 | GPUMamba | CodeCode Available | 4 |
| SlowPerception: Physical-World Latency Attack against Visual Perception in Autonomous Driving | Jun 9, 2024 | Autonomous DrivingMultiple Object Tracking | —Unverified | 0 |
| Utilizing Grounded SAM for self-supervised frugal camouflaged human detection | Jun 9, 2024 | Human DetectionObject | —Unverified | 0 |
| SAM-PM: Enhancing Video Camouflaged Object Detection using Spatio-Temporal Attention | Jun 9, 2024 | Image Segmentationobject-detection | CodeCode Available | 1 |
| ControlLoc: Physical-World Hijacking Attack on Visual Perception in Autonomous Driving | Jun 9, 2024 | Autonomous DrivingMultiple Object Tracking | —Unverified | 0 |
| Spiking Neural Networks with Consistent Mapping Relations Allow High-Accuracy Inference | Jun 8, 2024 | object-detectionObject Detection | —Unverified | 0 |
| Select-Mosaic: Data Augmentation Method for Dense Small Object Scenes | Jun 8, 2024 | Data AugmentationDiversity | CodeCode Available | 0 |
| Real-time object detection and tracking using flash LiDAR imagery | Jun 7, 2024 | 3D Object ClassificationObject | —Unverified | 0 |
| Nacala-Roof-Material: Drone Imagery for Roof Detection, Classification, and Segmentation to Support Mosquito-borne Disease Risk Assessment | Jun 7, 2024 | DecoderInstance Segmentation | —Unverified | 0 |
| UCDNet: Multi-UAV Collaborative 3D Object Detection Network by Reliable Feature Mapping | Jun 7, 2024 | 3D Object DetectionManagement | —Unverified | 0 |
| IOR: Inversed Objects Replay for Incremental Object Detection | Jun 7, 2024 | Knowledge DistillationObject | —Unverified | 0 |
| UVCPNet: A UAV-Vehicle Collaborative Perception Network for 3D Object Detection | Jun 7, 2024 | 3D Object DetectionDepth Estimation | —Unverified | 0 |
| Cut-and-Paste with Precision: a Content and Perspective-aware Data Augmentation for Road Damage Detection | Jun 6, 2024 | Data AugmentationObject | —Unverified | 0 |
| DeTra: A Unified Model for Object Detection and Trajectory Forecasting | Jun 6, 2024 | Autonomous Drivingobject-detection | —Unverified | 0 |
| Semmeldetector: Application of Machine Learning in Commercial Bakeries | Jun 6, 2024 | Image Augmentationobject-detection | —Unverified | 0 |
| Parameter-Inverted Image Pyramid Networks | Jun 6, 2024 | Computational Efficiencyimage-classification | CodeCode Available | 2 |
| Frequency-based Matcher for Long-tailed Semantic Segmentation | Jun 6, 2024 | Autonomous Drivingobject-detection | CodeCode Available | 1 |
| CORU: Comprehensive Post-OCR Parsing and Receipt Understanding Dataset | Jun 6, 2024 | object-detectionObject Detection | CodeCode Available | 1 |
| Instance Segmentation and Teeth Classification in Panoramic X-rays | Jun 6, 2024 | Instance Segmentationobject-detection | CodeCode Available | 1 |
| FedPylot: Navigating Federated Learning for Real-Time Object Detection in Internet of Vehicles | Jun 5, 2024 | Autonomous DrivingAutonomous Vehicles | CodeCode Available | 2 |
| Situation Monitor: Diversity-Driven Zero-Shot Out-of-Distribution Detection using Budding Ensemble Architecture for Object Detection | Jun 5, 2024 | Autonomous DrivingDiversity | —Unverified | 0 |
| Global Clipper: Enhancing Safety and Reliability of Transformer-based Object Detection Models | Jun 5, 2024 | Autonomous Vehiclesobject-detection | —Unverified | 0 |
| LW-DETR: A Transformer Replacement to YOLO for Real-Time Detection | Jun 5, 2024 | Decoderobject-detection | CodeCode Available | 9 |
| Enhanced Automotive Object Detection via RGB-D Fusion in a DiffusionDet Framework | Jun 5, 2024 | Autonomous Drivingobject-detection | —Unverified | 0 |