| YOLOv9 for Fracture Detection in Pediatric Wrist Trauma X-ray Images | Mar 17, 2024 | Data AugmentationFracture detection | CodeCode Available | 1 |
| GRA: Detecting Oriented Objects through Group-wise Rotating and Attention | Mar 17, 2024 | Objectobject-detection | —Unverified | 0 |
| V2X-DGW: Domain Generalization for Multi-agent Perception under Adverse Weather Conditions | Mar 17, 2024 | 3D Object DetectionDomain Generalization | —Unverified | 0 |
| FishNet: Deep Neural Networks for Low-Cost Fish Stock Estimation | Mar 16, 2024 | Classificationobject-detection | —Unverified | 0 |
| HCF-Net: Hierarchical Context Fusion Network for Infrared Small Object Detection | Mar 16, 2024 | channel selectionobject-detection | CodeCode Available | 2 |
| Detection of Fast-Moving Objects with Neuromorphic Hardware | Mar 15, 2024 | GPUMoving Object Detection | —Unverified | 0 |
| Cannabis Seed Variant Detection using Faster R-CNN | Mar 15, 2024 | object-detectionObject Detection | —Unverified | 0 |
| SimPB: A Single Model for 2D and 3D Object Detection from Multiple Cameras | Mar 15, 2024 | 3D Object DetectionAutonomous Driving | CodeCode Available | 1 |
| SparseFusion: Efficient Sparse Multi-Modal Fusion Framework for Long-Range 3D Perception | Mar 15, 2024 | 3D Lane Detection3D Object Detection | —Unverified | 0 |
| Generative Region-Language Pretraining for Open-Ended Object Detection | Mar 15, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| CSDNet: Detect Salient Object in Depth-Thermal via A Lightweight Cross Shallow and Deep Perception Network | Mar 15, 2024 | DescriptiveInformativeness | —Unverified | 0 |
| A Hybrid SNN-ANN Network for Event-based Object Detection with Spatial and Temporal Attention | Mar 15, 2024 | object-detectionObject Detection | —Unverified | 0 |
| Attention-based Class-Conditioned Alignment for Multi-Source Domain Adaptation of Object Detectors | Mar 14, 2024 | BenchmarkingDomain Adaptation | CodeCode Available | 0 |
| E2E-MFD: Towards End-to-End Synchronous Multimodal Fusion Detection | Mar 14, 2024 | Autonomous DrivingObject | CodeCode Available | 2 |
| Griffon v2: Advancing Multimodal Perception with High-Resolution Scaling and Visual-Language Co-Referring | Mar 14, 2024 | ObjectObject Counting | —Unverified | 0 |
| SHAN: Object-Level Privacy Detection via Inference on Scene Heterogeneous Graph | Mar 14, 2024 | Graph AttentionObject | —Unverified | 0 |
| D-YOLO a robust framework for object detection in adverse weather conditions | Mar 14, 2024 | Image Restorationobject-detection | —Unverified | 0 |
| Improving Distant 3D Object Detection Using 2D Box Supervision | Mar 14, 2024 | 3D Object DetectionDepth Estimation | —Unverified | 0 |
| PoIFusion: Multi-Modal 3D Object Detection via Fusion at Points of Interest | Mar 14, 2024 | 3D Object DetectionObject | —Unverified | 0 |
| Knowledge Distillation in YOLOX-ViT for Side-Scan Sonar Object Detection | Mar 14, 2024 | Knowledge DistillationNovel Object Detection | CodeCode Available | 2 |
| D3T: Distinctive Dual-Domain Teacher Zigzagging Across RGB-Thermal Gap for Domain-Adaptive Object Detection | Mar 14, 2024 | Domain Adaptationobject-detection | CodeCode Available | 1 |
| Open-Vocabulary Object Detection with Meta Prompt Representation and Instance Contrastive Optimization | Mar 14, 2024 | Contrastive LearningKnowledge Distillation | —Unverified | 0 |
| CLIP-BEVFormer: Enhancing Multi-View Image-Based BEV Detector with Ground Truth Flow | Mar 13, 2024 | 3D Object DetectionAutonomous Driving | —Unverified | 0 |
| FogGuard: guarding YOLO against fog using perceptual loss | Mar 13, 2024 | Autonomous DrivingDomain Adaptation | CodeCode Available | 0 |
| Advancing Security in AI Systems: A Novel Approach to Detecting Backdoors in Deep Neural Networks | Mar 13, 2024 | image-classificationImage Classification | —Unverified | 0 |
| MIM4D: Masked Modeling with Multi-View Video for Autonomous Driving Representation Learning | Mar 13, 2024 | 3D Object DetectionAutonomous Driving | CodeCode Available | 2 |
| FieldNet: Efficient Real-Time Shadow Removal for Enhanced Vision in Field Robotics | Mar 13, 2024 | Edge-computingobject-detection | —Unverified | 0 |
| A Multimodal Fusion Network For Student Emotion Recognition Based on Transformer and Tensor Product | Mar 13, 2024 | Emotion Recognitionobject-detection | —Unverified | 0 |
| Improved YOLOv5 Based on Attention Mechanism and FasterNet for Foreign Object Detection on Railway and Airway tracks | Mar 13, 2024 | object-detectionObject Detection | —Unverified | 0 |
| ViT-CoMer: Vision Transformer with Convolutional Multi-scale Feature Interaction for Dense Predictions | Mar 13, 2024 | Instance SegmentationObject Detection | CodeCode Available | 3 |
| Aedes aegypti Egg Counting with Neural Networks for Object Detection | Mar 12, 2024 | object-detectionObject Detection | —Unverified | 0 |
| TaskCLIP: Extend Large Vision-Language Model for Task Oriented Object Detection | Mar 12, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| PeLK: Parameter-efficient Large Kernel ConvNets with Peripheral Convolution | Mar 12, 2024 | object-detectionObject Detection | —Unverified | 0 |
| SparseLIF: High-Performance Sparse LiDAR-Camera Fusion for 3D Object Detection | Mar 12, 2024 | 3D Object Detectionobject-detection | —Unverified | 0 |
| Eliminating Cross-modal Conflicts in BEV Space for LiDAR-Camera 3D Object Detection | Mar 12, 2024 | 3D Object Detectionobject-detection | CodeCode Available | 0 |
| Adaptive Bounding Box Uncertainties via Two-Step Conformal Prediction | Mar 12, 2024 | Autonomous DrivingConformal Prediction | CodeCode Available | 1 |
| JSTR: Joint Spatio-Temporal Reasoning for Event-based Moving Object Detection | Mar 12, 2024 | Motion CompensationMoving Object Detection | —Unverified | 0 |
| A Survey of Vision Transformers in Autonomous Driving: Current Trends and Future Directions | Mar 12, 2024 | Autonomous DrivingDecoder | —Unverified | 0 |
| Mondrian: On-Device High-Performance Video Analytics with Compressive Packed Inference | Mar 12, 2024 | GPUobject-detection | —Unverified | 0 |
| Inception-YOLO: Computational cost and accuracy improvement of the YOLOv5 model based on employing modified CSP, SPPF, and inception modules | Mar 11, 2024 | Medical Image Analysisobject-detection | —Unverified | 0 |
| LISO: Lidar-only Self-Supervised 3D Object Detection | Mar 11, 2024 | 3D Object DetectionObject | CodeCode Available | 2 |
| Class Imbalance in Object Detection: An Experimental Diagnosis and Study of Mitigation Strategies | Mar 11, 2024 | BenchmarkingData Augmentation | CodeCode Available | 0 |
| Genetic Learning for Designing Sim-to-Real Data Augmentations | Mar 11, 2024 | Image Augmentationobject-detection | CodeCode Available | 0 |
| Evaluating the Energy Efficiency of Few-Shot Learning for Object Detection in Industrial Settings | Mar 11, 2024 | Few-Shot Learningobject-detection | —Unverified | 0 |
| LeOCLR: Leveraging Original Images for Contrastive Learning of Visual Representations | Mar 11, 2024 | Contrastive LearningData Augmentation | —Unverified | 0 |
| Cross-domain and Cross-dimension Learning for Image-to-Graph Transformers | Mar 11, 2024 | Domain Adaptationobject-detection | CodeCode Available | 0 |
| Fine-Grained Pillar Feature Encoding Via Spatio-Temporal Virtual Grid for 3D Object Detection | Mar 11, 2024 | 3D Object DetectionAutonomous Vehicles | CodeCode Available | 1 |
| Real-time Transformer-based Open-Vocabulary Detection with Efficient Fusion Head | Mar 11, 2024 | Object DetectionOpen-vocabulary object detection | CodeCode Available | 5 |
| SeSame: Simple, Easy 3D Object Detection with Point-Wise Semantics | Mar 11, 2024 | 2D Object Detection3D Object Detection | CodeCode Available | 1 |
| SARDet-100K: Towards Open-Source Benchmark and ToolKit for Large-Scale SAR Object Detection | Mar 11, 2024 | 2D Object Detection2k | CodeCode Available | 4 |