| PS-TTL: Prototype-based Soft-labels and Test-Time Learning for Few-shot Object Detection | Aug 11, 2024 | Few-Shot Object Detectionobject-detection | CodeCode Available | 1 |
| Dilated Convolution with Learnable Spacings | Aug 10, 2024 | Audio Classificationobject-detection | —Unverified | 0 |
| Advancing Pavement Distress Detection in Developing Countries: A Novel Deep Learning Approach with Locally-Collected Datasets | Aug 10, 2024 | object-detectionObject Detection | —Unverified | 0 |
| DeepInteraction++: Multi-Modality Interaction for Autonomous Driving | Aug 9, 2024 | 3D Object DetectionAutonomous Driving | CodeCode Available | 3 |
| A Recurrent YOLOv8-based framework for Event-Based Object Detection | Aug 9, 2024 | Autonomous VehiclesData Augmentation | —Unverified | 0 |
| RadarPillars: Efficient Object Detection from 4D Radar Point Clouds | Aug 9, 2024 | 3D Object Detectionobject-detection | —Unverified | 0 |
| Hyper-YOLO: When Visual Object Detection Meets Hypergraph Computation | Aug 9, 2024 | object-detectionObject Detection | CodeCode Available | 3 |
| UAV-Enhanced Combination to Application: Comprehensive Analysis and Benchmarking of a Human Detection Dataset for Disaster Scenarios | Aug 9, 2024 | BenchmarkingHuman Detection | CodeCode Available | 1 |
| Data-Driven Pixel Control: Challenges and Prospects | Aug 8, 2024 | object-detectionObject Detection | —Unverified | 0 |
| SOD-YOLOv8 -- Enhancing YOLOv8 for Small Object Detection in Traffic Scenes | Aug 8, 2024 | Autonomous VehiclesObject | CodeCode Available | 1 |
| Multi-Scale and Detail-Enhanced Segment Anything Model for Salient Object Detection | Aug 8, 2024 | object-detectionObject Detection | CodeCode Available | 2 |
| Detecting Car Speed using Object Detection and Depth Estimation: A Deep Learning Framework | Aug 8, 2024 | Depth Estimationobject-detection | —Unverified | 0 |
| SAM2-Adapter: Evaluating & Adapting Segment Anything 2 in Downstream Tasks: Camouflage, Shadow, Medical Image Segmentation, and More | Aug 8, 2024 | Image SegmentationMedical Image Segmentation | CodeCode Available | 5 |
| PaveCap: The First Multimodal Framework for Comprehensive Pavement Condition Assessment with Dense Captioning and PCI Estimation | Aug 7, 2024 | DecoderDense Captioning | CodeCode Available | 0 |
| Designing Extremely Memory-Efficient CNNs for On-device Vision Tasks | Aug 7, 2024 | image-classificationImage Classification | —Unverified | 0 |
| Query3D: LLM-Powered Open-Vocabulary Scene Segmentation with Language Embedded 3D Gaussian | Aug 7, 2024 | Autonomous Drivingobject-detection | CodeCode Available | 1 |
| CAS-ViT: Convolutional Additive Self-attention Vision Transformers for Efficient Mobile Applications | Aug 7, 2024 | image-classificationImage Classification | CodeCode Available | 2 |
| Vision-Language Guidance for LiDAR-based Unsupervised 3D Object Detection | Aug 7, 2024 | 3D Object DetectionAutonomous Driving | CodeCode Available | 1 |
| L4DR: LiDAR-4DRadar Fusion for Weather-Robust 3D Object Detection | Aug 7, 2024 | 3D Object DetectionAutonomous Navigation | CodeCode Available | 2 |
| Data Generation Scheme for Thermal Modality with Edge-Guided Adversarial Conditional Diffusion Model | Aug 7, 2024 | Image Generationobject-detection | CodeCode Available | 0 |
| GUI Element Detection Using SOTA YOLO Deep Learning Models | Aug 7, 2024 | 2D Object DetectionCode Generation | CodeCode Available | 1 |
| Biomedical Image Segmentation: A Systematic Literature Review of Deep Learning Based Object Detection Methods | Aug 6, 2024 | ArticlesDeep Learning | —Unverified | 0 |
| AI Foundation Models in Remote Sensing: A Survey | Aug 6, 2024 | Contrastive Learningobject-detection | —Unverified | 0 |
| Diverse Generation while Maintaining Semantic Coordination: A Diffusion-Based Data Augmentation Method for Object Detection | Aug 6, 2024 | Data AugmentationDiversity | —Unverified | 0 |
| HQOD: Harmonious Quantization for Object Detection | Aug 5, 2024 | Objectobject-detection | CodeCode Available | 0 |
| Tensorial template matching for fast cross-correlation with rotations and its application for tomography | Aug 5, 2024 | object-detectionObject Detection | —Unverified | 0 |
| AssemAI: Interpretable Image-Based Anomaly Detection for Manufacturing Pipelines | Aug 5, 2024 | Anomaly Detectionobject-detection | CodeCode Available | 0 |
| KAN-RCBEVDepth: A multi-modal fusion algorithm in object detection for autonomous driving | Aug 4, 2024 | 3D Object DetectionAttribute | CodeCode Available | 0 |
| A Survey and Evaluation of Adversarial Attacks for Object Detection | Aug 4, 2024 | Adversarial RobustnessAutonomous Vehicles | —Unverified | 0 |
| CAF-YOLO: A Robust Framework for Multi-Scale Lesion Detection in Biomedical Imagery | Aug 4, 2024 | Lesion DetectionMedical Object Detection | CodeCode Available | 0 |
| Do You Remember . . . the Future? Weak-to-Strong generalization in 3D Object Detection | Aug 3, 2024 | 3D Object DetectionKnowledge Distillation | CodeCode Available | 0 |
| Supervised Image Translation from Visible to Infrared Domain for Object Detection | Aug 3, 2024 | Generative Adversarial NetworkObject | —Unverified | 0 |
| Domain penalisation for improved Out-of-Distribution Generalisation | Aug 3, 2024 | Objectobject-detection | —Unverified | 0 |
| LAM3D: Leveraging Attention for Monocular 3D Object Detection | Aug 3, 2024 | 3D Object DetectionAutonomous Driving | —Unverified | 0 |
| An Efficient Real-Time Object Detection Framework on Resource-Constricted Hardware Devices via Software and Hardware Co-design | Aug 2, 2024 | Model CompressionNeural Network Compression | —Unverified | 0 |
| Underwater Object Detection Enhancement via Channel Stabilization | Aug 2, 2024 | Image EnhancementObject | CodeCode Available | 0 |
| PGNeXt: High-Resolution Salient Object Detection via Pyramid Grafting Network | Aug 2, 2024 | 4k8k | —Unverified | 0 |
| Effect of Fog Particle Size Distribution on 3D Object Detection Under Adverse Weather Conditions | Aug 2, 2024 | 3D Object DetectionAutonomous Driving | —Unverified | 0 |
| Boosting Gaze Object Prediction via Pixel-level Supervision from Vision Foundation Model | Aug 2, 2024 | Objectobject-detection | CodeCode Available | 0 |
| Visible-Thermal Multiple Object Tracking: Large-scale Video Dataset and Progressive Fusion Approach | Aug 2, 2024 | cross-modal alignmentMultiple Object Tracking | CodeCode Available | 2 |
| Complete 3d relationships extraction modality alignment network for 3d dense captioning | Aug 1, 2024 | 3D dense captioning3D Object Detection | —Unverified | 0 |
| MonoMM: A Multi-scale Mamba-Enhanced Network for Real-time Monocular 3D Object Detection | Aug 1, 2024 | 3D Object DetectionComputational Efficiency | —Unverified | 0 |
| MUFASA: Multi-View Fusion and Adaptation Network with Spatial Awareness for Radar Object Detection | Aug 1, 2024 | Autonomous DrivingObject | —Unverified | 0 |
| RoCo:Robust Collaborative Perception By Iterative Object Matching and Pose Adjustment | Aug 1, 2024 | Autonomous DrivingObject | CodeCode Available | 1 |
| Diff3DETR:Agent-based Diffusion Model for Semi-supervised 3D Object Detection | Aug 1, 2024 | 3D Object DetectionDecoder | —Unverified | 0 |
| Harnessing Uncertainty-aware Bounding Boxes for Unsupervised 3D Object Detection | Aug 1, 2024 | 3D Object Detectionobject-detection | CodeCode Available | 1 |
| DNTextSpotter: Arbitrary-Shaped Scene Text Spotting via Improved Denoising Training | Aug 1, 2024 | DenoisingGraph Matching | CodeCode Available | 1 |
| A Simple Background Augmentation Method for Object Detection with Diffusion Model | Aug 1, 2024 | Data AugmentationDiversity | —Unverified | 0 |
| Dynamic Object Queries for Transformer-based Incremental Object Detection | Jul 31, 2024 | Knowledge DistillationObject | —Unverified | 0 |
| MarvelOVD: Marrying Object Recognition and Vision-Language Models for Robust Open-Vocabulary Object Detection | Jul 31, 2024 | Language ModellingObject | CodeCode Available | 1 |