| VisionReasoner: Unified Visual Perception and Reasoning via Reinforcement Learning | May 17, 2025 | 2D Object DetectionObject Counting | CodeCode Available | 4 |
| GATE3D: Generalized Attention-based Task-synergized Estimation in 3D* | Apr 15, 2025 | 2D Object Detection3D Object Detection | —Unverified | 0 |
| Safe-Construct: Redefining Construction Safety Violation Recognition as 3D Multi-View Engagement Task | Apr 15, 2025 | 2D Object DetectionObject | —Unverified | 0 |
| Distributed LLMs and Multimodal Large Language Models: A Survey on Advances, Challenges, and Future Directions | Mar 20, 2025 | 2D Object DetectionDistributed Computing | CodeCode Available | 1 |
| 2D Object Detection: A Survey | Mar 7, 2025 | 2D Object DetectionObject | —Unverified | 0 |
| AI-Driven Relocation Tracking in Dynamic Kitchen Environments | Mar 3, 2025 | 2D Object Detection3D Reconstruction | CodeCode Available | 0 |
| WalnutData: A UAV Remote Sensing Dataset of Green Walnuts and Model Evaluation | Feb 27, 2025 | 2D Object DetectionObject Detection | CodeCode Available | 0 |
| A Diffusion Model and Knowledge Distillation Framework for Robust Coral Detection in Complex Underwater Environments | Jan 6, 2025 | 2D Object DetectionKnowledge Distillation | CodeCode Available | 0 |
| Toward Highly Efficient Semantic-Guided Machine Vision for Low-Light Object Detection | Dec 20, 2024 | 2D Object DetectionImage Enhancement | CodeCode Available | 1 |
| SCoralDet: Efficient real-time underwater soft coral detection with YOLO | Dec 16, 2024 | 2D Object Detectionobject-detection | CodeCode Available | 2 |
| Enhancing 3D Object Detection in Autonomous Vehicles Based on Synthetic Virtual Environment Analysis | Dec 10, 2024 | 2D Object Detection3D Object Detection | —Unverified | 0 |
| MCBLT: Multi-Camera Multi-Object 3D Tracking in Long Videos | Dec 1, 2024 | 2D Object Detection3D Object Detection | —Unverified | 0 |
| WARLearn: Weather-Adaptive Representation Learning | Nov 21, 2024 | 2D Object DetectionAdversarial Robustness | CodeCode Available | 0 |
| YCB-LUMA: YCB Object Dataset with Luminance Keying for Object Localization | Nov 20, 2024 | 2D Object DetectionAutonomous Driving | CodeCode Available | 0 |
| How Important are Data Augmentations to Close the Domain Gap for Object Detection in Orbit? | Oct 21, 2024 | 2D Object DetectionDomain Generalization | —Unverified | 0 |
| Pediatric Wrist Fracture Detection Using Feature Context Excitation Modules in X-ray Images | Oct 1, 2024 | 2D Object DetectionFracture detection | CodeCode Available | 0 |
| UAVDB: Trajectory-Guided Adaptable Bounding Boxes for UAV Detection | Sep 9, 2024 | 2D Object DetectionDiversity | CodeCode Available | 1 |
| Real-Time Dynamic Scale-Aware Fusion Detection Network: Take Road Damage Detection as an example | Sep 4, 2024 | 2D Object Detectionobject-detection | CodeCode Available | 0 |
| Relaxed Rotational Equivariance via G-Biases in Vision | Aug 22, 2024 | 2D Object Detectionobject-detection | CodeCode Available | 0 |
| SBDet: A Symmetry-Breaking Object Detector via Relaxed Rotation-Equivariance | Aug 21, 2024 | 2D Object Detectionimage-classification | —Unverified | 0 |
| GUI Element Detection Using SOTA YOLO Deep Learning Models | Aug 7, 2024 | 2D Object DetectionCode Generation | CodeCode Available | 1 |
| Underwater Soft Coral Detection: SCoralNet for Accurate and Efficient Annotation. | Aug 1, 2024 | 2D Object Detection | —Unverified | 0 |
| RayFormer: Improving Query-Based Multi-Camera 3D Object Detection via Ray-Centric Strategies | Jul 20, 2024 | 2D Object Detection3D Object Detection | —Unverified | 0 |
| TXL-PBC: a freely accessible labeled peripheral blood cell dataset | Jul 18, 2024 | 2D Object Detection | CodeCode Available | 1 |
| PADRe: A Unifying Polynomial Attention Drop-in Replacement for Efficient Vision Transformer | Jul 16, 2024 | 2D Object DetectionComputational Efficiency | —Unverified | 0 |
| Relation DETR: Exploring Explicit Position Relation Prior for Object Detection | Jul 16, 2024 | 2D Object Detectionobject-detection | CodeCode Available | 3 |
| Wavelet Convolutions for Large Receptive Fields | Jul 8, 2024 | 2D Object Detection2D Semantic Segmentation | CodeCode Available | 4 |
| DenoDet: Attention as Deformable Multi-Subspace Feature Denoising for Target Detection in SAR Images | Jun 5, 2024 | 2D Object DetectionDenoising | CodeCode Available | 4 |
| Open-YOLO 3D: Towards Fast and Accurate Open-Vocabulary 3D Instance Segmentation | Jun 4, 2024 | 2D Object Detection3D Instance Segmentation | CodeCode Available | 3 |
| YOLOv10: Real-Time End-to-End Object Detection | May 23, 2024 | 2D Object DetectionData Augmentation | CodeCode Available | 11 |
| SARATR-X: Toward Building A Foundation Model for SAR Target Recognition | May 15, 2024 | 2D Object DetectionEarth Observation | CodeCode Available | 3 |
| BadFusion: 2D-Oriented Backdoor Attacks against 3D Object Detection | May 6, 2024 | 2D Object Detection3D Object Detection | —Unverified | 0 |
| Run-time Monitoring of 3D Object Detection in Automated Driving Systems Using Early Layer Neural Activation Patterns | Apr 11, 2024 | 2D Object Detection3D Object Detection | —Unverified | 0 |
| SCAResNet: A ResNet Variant Optimized for Tiny Object Detection in Transmission and Distribution Towers | Apr 5, 2024 | 2D Object Detection2D Tiny Object Detection | CodeCode Available | 1 |
| MosquitoFusion: A Multiclass Dataset for Real-Time Detection of Mosquitoes, Swarms, and Breeding Sites Using Deep Learning | Apr 1, 2024 | 2D Object Detection | CodeCode Available | 0 |
| Salience DETR: Enhancing Detection Transformer with Hierarchical Salience Filtering Refinement | Mar 24, 2024 | 2D Object DetectionComputational Efficiency | CodeCode Available | 3 |
| EffiPerception: an Efficient Framework for Various Perception Tasks | Mar 18, 2024 | 2D Object Detection3D Object Detection | —Unverified | 0 |
| SeSame: Simple, Easy 3D Object Detection with Point-Wise Semantics | Mar 11, 2024 | 2D Object Detection3D Object Detection | CodeCode Available | 1 |
| SARDet-100K: Towards Open-Source Benchmark and ToolKit for Large-Scale SAR Object Detection | Mar 11, 2024 | 2D Object Detection2k | CodeCode Available | 4 |
| Run-time Introspection of 2D Object Detection in Automated Driving Systems Using Learning Representations | Mar 2, 2024 | 2D Object DetectionObject | —Unverified | 0 |
| MSU-4S - The Michigan State University Four Seasons Dataset | Jan 1, 2024 | 2D Object DetectionAutonomous Driving | —Unverified | 0 |
| Weakly Misalignment-free Adaptive Feature Alignment for UAVs-based Multimodal Object Detection | Jan 1, 2024 | 2D Object DetectionObject | —Unverified | 0 |
| First qualitative observations on deep learning vision model YOLO and DETR for automated driving in Austria | Dec 19, 2023 | 2D Object DetectionAutonomous Driving | —Unverified | 0 |
| RadioGalaxyNET: Dataset and Novel Computer Vision Algorithms for the Detection of Extended Radio Galaxies and Infrared Hosts | Dec 1, 2023 | 2D Object Detectionobject-detection | CodeCode Available | 0 |
| Transcending Forgery Specificity with Latent Space Augmentation for Generalizable Deepfake Detection | Nov 19, 2023 | 2D Object DetectionDeepFake Detection | CodeCode Available | 3 |
| Globular Cluster Detection in M33 Using Multiple Views Representation Learning | Nov 15, 2023 | 2D Object DetectionDeep Learning | CodeCode Available | 0 |
| Topology-Matching Normalizing Flows for Out-of-Distribution Detection in Robot Learning | Nov 11, 2023 | 2D Object DetectionDensity Estimation | —Unverified | 0 |
| PrObeD: Proactive Object Detection Wrapper | Oct 28, 2023 | 2D Object DetectionDecoder | CodeCode Available | 0 |
| YOLO-BEV: Generating Bird's-Eye View in the Same Way as 2D Object Detection | Oct 26, 2023 | 2D Object DetectionAutonomous Driving | —Unverified | 0 |
| OpenAgents: An Open Platform for Language Agents in the Wild | Oct 16, 2023 | 2D Object Detection | CodeCode Available | 4 |