| MIM4D: Masked Modeling with Multi-View Video for Autonomous Driving Representation Learning | Mar 13, 2024 | 3D Object DetectionAutonomous Driving | CodeCode Available | 2 |
| FieldNet: Efficient Real-Time Shadow Removal for Enhanced Vision in Field Robotics | Mar 13, 2024 | Edge-computingobject-detection | —Unverified | 0 |
| A Multimodal Fusion Network For Student Emotion Recognition Based on Transformer and Tensor Product | Mar 13, 2024 | Emotion Recognitionobject-detection | —Unverified | 0 |
| Improved YOLOv5 Based on Attention Mechanism and FasterNet for Foreign Object Detection on Railway and Airway tracks | Mar 13, 2024 | object-detectionObject Detection | —Unverified | 0 |
| ViT-CoMer: Vision Transformer with Convolutional Multi-scale Feature Interaction for Dense Predictions | Mar 13, 2024 | Instance SegmentationObject Detection | CodeCode Available | 3 |
| Aedes aegypti Egg Counting with Neural Networks for Object Detection | Mar 12, 2024 | object-detectionObject Detection | —Unverified | 0 |
| TaskCLIP: Extend Large Vision-Language Model for Task Oriented Object Detection | Mar 12, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| PeLK: Parameter-efficient Large Kernel ConvNets with Peripheral Convolution | Mar 12, 2024 | object-detectionObject Detection | —Unverified | 0 |
| SparseLIF: High-Performance Sparse LiDAR-Camera Fusion for 3D Object Detection | Mar 12, 2024 | 3D Object Detectionobject-detection | —Unverified | 0 |
| Eliminating Cross-modal Conflicts in BEV Space for LiDAR-Camera 3D Object Detection | Mar 12, 2024 | 3D Object Detectionobject-detection | CodeCode Available | 0 |
| Adaptive Bounding Box Uncertainties via Two-Step Conformal Prediction | Mar 12, 2024 | Autonomous DrivingConformal Prediction | CodeCode Available | 1 |
| JSTR: Joint Spatio-Temporal Reasoning for Event-based Moving Object Detection | Mar 12, 2024 | Motion CompensationMoving Object Detection | —Unverified | 0 |
| A Survey of Vision Transformers in Autonomous Driving: Current Trends and Future Directions | Mar 12, 2024 | Autonomous DrivingDecoder | —Unverified | 0 |
| Mondrian: On-Device High-Performance Video Analytics with Compressive Packed Inference | Mar 12, 2024 | GPUobject-detection | —Unverified | 0 |
| Inception-YOLO: Computational cost and accuracy improvement of the YOLOv5 model based on employing modified CSP, SPPF, and inception modules | Mar 11, 2024 | Medical Image Analysisobject-detection | —Unverified | 0 |
| LISO: Lidar-only Self-Supervised 3D Object Detection | Mar 11, 2024 | 3D Object DetectionObject | CodeCode Available | 2 |
| Class Imbalance in Object Detection: An Experimental Diagnosis and Study of Mitigation Strategies | Mar 11, 2024 | BenchmarkingData Augmentation | CodeCode Available | 0 |
| Genetic Learning for Designing Sim-to-Real Data Augmentations | Mar 11, 2024 | Image Augmentationobject-detection | CodeCode Available | 0 |
| Evaluating the Energy Efficiency of Few-Shot Learning for Object Detection in Industrial Settings | Mar 11, 2024 | Few-Shot Learningobject-detection | —Unverified | 0 |
| LeOCLR: Leveraging Original Images for Contrastive Learning of Visual Representations | Mar 11, 2024 | Contrastive LearningData Augmentation | —Unverified | 0 |
| Cross-domain and Cross-dimension Learning for Image-to-Graph Transformers | Mar 11, 2024 | Domain Adaptationobject-detection | CodeCode Available | 0 |
| Fine-Grained Pillar Feature Encoding Via Spatio-Temporal Virtual Grid for 3D Object Detection | Mar 11, 2024 | 3D Object DetectionAutonomous Vehicles | CodeCode Available | 1 |
| Real-time Transformer-based Open-Vocabulary Detection with Efficient Fusion Head | Mar 11, 2024 | Object DetectionOpen-vocabulary object detection | CodeCode Available | 5 |
| SeSame: Simple, Easy 3D Object Detection with Point-Wise Semantics | Mar 11, 2024 | 2D Object Detection3D Object Detection | CodeCode Available | 1 |
| SARDet-100K: Towards Open-Source Benchmark and ToolKit for Large-Scale SAR Object Detection | Mar 11, 2024 | 2D Object Detection2k | CodeCode Available | 4 |