| Syn2Real Domain Generalization for Underwater Mine-like Object Detection Using Side-Scan Sonar | Oct 16, 2024 | Domain Generalizationobject-detection | —Unverified | 0 |
| Context-Infused Visual Grounding for Art | Oct 16, 2024 | object-detectionObject Detection | CodeCode Available | 0 |
| Fusion from Decomposition: A Self-Supervised Approach for Image Fusion and Beyond | Oct 16, 2024 | Image RestorationImage Segmentation | —Unverified | 0 |
| Mixture of Scale Experts for Alignment-free RGBT Video Object Detection and A Unified Benchmark | Oct 16, 2024 | object-detectionObject Detection | —Unverified | 0 |
| Cocoon: Robust Multi-Modal Perception with Uncertainty-Aware Sensor Fusion | Oct 16, 2024 | 3D Object DetectionObject | —Unverified | 0 |
| Optimizing YOLOv5s Object Detection through Knowledge Distillation algorithm | Oct 16, 2024 | Knowledge DistillationObject | —Unverified | 0 |
| Feature Augmentation for Self-supervised Contrastive Learning: A Closer Look | Oct 16, 2024 | Contrastive LearningData Augmentation | —Unverified | 0 |
| Real-time Stereo-based 3D Object Detection for Streaming Perception | Oct 16, 2024 | 3D Object DetectionAutonomous Driving | CodeCode Available | 1 |
| MambaBEV: An efficient 3D detection model with Mamba2 | Oct 16, 2024 | 3D Object DetectionAutonomous Driving | —Unverified | 0 |
| SAM-Guided Masked Token Prediction for 3D Scene Understanding | Oct 16, 2024 | 3D Object DetectionKnowledge Distillation | —Unverified | 0 |
| Fractal Calibration for long-tailed object detection | Oct 15, 2024 | Instance SegmentationLong-tailed Object Detection | CodeCode Available | 0 |
| TEOcc: Radar-camera Multi-modal Occupancy Prediction via Temporal Enhancement | Oct 15, 2024 | 3D Object DetectionAutonomous Driving | CodeCode Available | 1 |
| SeaDATE: Remedy Dual-Attention Transformer with Semantic Alignment via Contrast Learning for Multimodal Object Detection | Oct 15, 2024 | Contrastive Learningobject-detection | —Unverified | 0 |
| YOLO-ELA: Efficient Local Attention Modeling for High-Performance Real-Time Insulator Defect Detection | Oct 15, 2024 | Data AugmentationDefect Detection | —Unverified | 0 |
| CVCP-Fusion: On Implicit Depth Estimation for 3D Bounding Box Prediction | Oct 15, 2024 | 3D Object DetectionDepth Estimation | CodeCode Available | 0 |
| Representation Similarity: A Better Guidance of DNN Layer Sharing for Edge Computing without Training | Oct 15, 2024 | Edge-computingobject-detection | —Unverified | 0 |
| POLO -- Point-based, multi-class animal detection | Oct 15, 2024 | object-detectionObject Detection | —Unverified | 0 |
| Open World Object Detection: A Survey | Oct 15, 2024 | Incremental LearningObject | CodeCode Available | 2 |
| Multiview Scene Graph | Oct 15, 2024 | DecoderObject | CodeCode Available | 2 |
| Developing Gridded Emission Inventory from High-Resolution Satellite Object Detection for Improved Air Quality Forecasts | Oct 14, 2024 | object-detectionObject Detection | —Unverified | 0 |
| UAV3D: A Large-scale 3D Perception Benchmark for Unmanned Aerial Vehicles | Oct 14, 2024 | 3D Object DetectionObject | —Unverified | 0 |
| Learning to Ground VLMs without Forgetting | Oct 14, 2024 | DecoderLanguage Modelling | —Unverified | 0 |
| ROA-BEV: 2D Region-Oriented Attention for BEV-based 3D Object | Oct 14, 2024 | 3D Object DetectionAutonomous Driving | —Unverified | 0 |
| GlobalMamba: Global Image Serialization for Vision Mamba | Oct 14, 2024 | image-classificationImage Classification | CodeCode Available | 1 |
| Out-of-Bounding-Box Triggers: A Stealthy Approach to Cheat Object Detectors | Oct 14, 2024 | Adversarial RobustnessObject | CodeCode Available | 0 |
| V2M: Visual 2-Dimensional Mamba for Image Representation Learning | Oct 14, 2024 | Instance SegmentationMamba | CodeCode Available | 1 |
| ROSAR: An Adversarial Re-Training Framework for Robust Side-Scan Sonar Object Detection | Oct 14, 2024 | Knowledge Distillationobject-detection | CodeCode Available | 0 |
| Optimizing Waste Management with Advanced Object Detection for Garbage Classification | Oct 13, 2024 | Managementobject-detection | —Unverified | 0 |
| EITNet: An IoT-Enhanced Framework for Real-Time Basketball Action Recognition | Oct 13, 2024 | Action Recognitionobject-detection | —Unverified | 0 |
| Distributed Intelligent Video Surveillance for Early Armed Robbery Detection based on Deep Learning | Oct 13, 2024 | object-detectionObject Detection | CodeCode Available | 0 |
| LoLI-Street: Benchmarking Low-Light Image Enhancement and Beyond | Oct 13, 2024 | Autonomous DrivingAutonomous Vehicles | CodeCode Available | 1 |
| An Expeditious Spatial Mean Radiant Temperature Mapping Framework using Visual SLAM and Semantic Segmentation | Oct 12, 2024 | object-detectionObject Detection | —Unverified | 0 |
| Token Pruning using a Lightweight Background Aware Vision Transformer | Oct 12, 2024 | object-detectionObject Detection | —Unverified | 0 |
| DA-Ada: Learning Domain-Aware Adapter for Domain Adaptive Object Detection | Oct 11, 2024 | General Knowledgeobject-detection | CodeCode Available | 1 |
| Hespi: A pipeline for automatically detecting information from hebarium specimen sheets | Oct 11, 2024 | Handwritten Text RecognitionHTR | CodeCode Available | 1 |
| MMLF: Multi-modal Multi-class Late Fusion for Object Detection with Uncertainty Estimation | Oct 11, 2024 | Autonomous Drivingobject-detection | —Unverified | 0 |
| DeBiFormer: Vision Transformer with Deformable Agent Bi-level Routing Attention | Oct 11, 2024 | image-classificationImage Classification | CodeCode Available | 1 |
| LIME-Eval: Rethinking Low-light Image Enhancement Evaluation via Object Detection | Oct 11, 2024 | Image EnhancementLow-Light Image Enhancement | CodeCode Available | 0 |
| Boosting Open-Vocabulary Object Detection by Handling Background Samples | Oct 11, 2024 | object-detectionObject Detection | —Unverified | 0 |
| VOVTrack: Exploring the Potentiality in Videos for Open-Vocabulary Object Tracking | Oct 11, 2024 | Multi-Object TrackingObject | —Unverified | 0 |
| Are We Ready for Real-Time LiDAR Semantic Segmentation in Autonomous Driving? | Oct 10, 2024 | 3D Semantic SegmentationAutonomous Driving | —Unverified | 0 |
| PointOBB-v2: Towards Simpler, Faster, and Stronger Single Point Supervised Oriented Object Detection | Oct 10, 2024 | object-detectionObject Detection | CodeCode Available | 2 |
| HeightFormer: A Semantic Alignment Monocular 3D Object Detection Method from Roadside Perspective | Oct 10, 2024 | 3D Object DetectionAutonomous Driving | —Unverified | 0 |
| O1O: Grouping of Known Classes to Identify Unknown Objects as Odd-One-Out | Oct 10, 2024 | object-detectionObject Detection | —Unverified | 0 |
| Self-Supervised Learning for Real-World Object Detection: a Survey | Oct 9, 2024 | Objectobject-detection | —Unverified | 0 |
| Robust infrared small target detection using self-supervised and a contrario paradigms | Oct 9, 2024 | object-detectionObject Detection | —Unverified | 0 |
| Progressive Multi-Modal Fusion for Robust 3D Object Detection | Oct 9, 2024 | 3D Object DetectionAutonomous Driving | —Unverified | 0 |
| Rethinking the Evaluation of Visible and Infrared Image Fusion | Oct 9, 2024 | object-detectionObject Detection | CodeCode Available | 3 |
| QuadBEV: An Efficient Quadruple-Task Perception Framework via Bird's-Eye-View Representation | Oct 9, 2024 | 3D Object DetectionAutonomous Driving | —Unverified | 0 |
| QuadMamba: Learning Quadtree-based Selective Scan for Visual State Space Model | Oct 9, 2024 | image-classificationImage Classification | CodeCode Available | 1 |