| Syn2Real Domain Generalization for Underwater Mine-like Object Detection Using Side-Scan Sonar | Oct 16, 2024 | Domain Generalizationobject-detection | —Unverified | 0 |
| Fusion from Decomposition: A Self-Supervised Approach for Image Fusion and Beyond | Oct 16, 2024 | Image RestorationImage Segmentation | —Unverified | 0 |
| Feature Augmentation for Self-supervised Contrastive Learning: A Closer Look | Oct 16, 2024 | Contrastive LearningData Augmentation | —Unverified | 0 |
| Context-Infused Visual Grounding for Art | Oct 16, 2024 | object-detectionObject Detection | CodeCode Available | 0 |
| Cocoon: Robust Multi-Modal Perception with Uncertainty-Aware Sensor Fusion | Oct 16, 2024 | 3D Object DetectionObject | —Unverified | 0 |
| Optimizing YOLOv5s Object Detection through Knowledge Distillation algorithm | Oct 16, 2024 | Knowledge DistillationObject | —Unverified | 0 |
| MambaBEV: An efficient 3D detection model with Mamba2 | Oct 16, 2024 | 3D Object DetectionAutonomous Driving | —Unverified | 0 |
| Real-time Stereo-based 3D Object Detection for Streaming Perception | Oct 16, 2024 | 3D Object DetectionAutonomous Driving | CodeCode Available | 1 |
| SAM-Guided Masked Token Prediction for 3D Scene Understanding | Oct 16, 2024 | 3D Object DetectionKnowledge Distillation | —Unverified | 0 |
| Mixture of Scale Experts for Alignment-free RGBT Video Object Detection and A Unified Benchmark | Oct 16, 2024 | object-detectionObject Detection | —Unverified | 0 |
| Open World Object Detection: A Survey | Oct 15, 2024 | Incremental LearningObject | CodeCode Available | 2 |
| Multiview Scene Graph | Oct 15, 2024 | DecoderObject | CodeCode Available | 2 |
| CVCP-Fusion: On Implicit Depth Estimation for 3D Bounding Box Prediction | Oct 15, 2024 | 3D Object DetectionDepth Estimation | CodeCode Available | 0 |
| Representation Similarity: A Better Guidance of DNN Layer Sharing for Edge Computing without Training | Oct 15, 2024 | Edge-computingobject-detection | —Unverified | 0 |
| POLO -- Point-based, multi-class animal detection | Oct 15, 2024 | object-detectionObject Detection | —Unverified | 0 |
| YOLO-ELA: Efficient Local Attention Modeling for High-Performance Real-Time Insulator Defect Detection | Oct 15, 2024 | Data AugmentationDefect Detection | —Unverified | 0 |
| TEOcc: Radar-camera Multi-modal Occupancy Prediction via Temporal Enhancement | Oct 15, 2024 | 3D Object DetectionAutonomous Driving | CodeCode Available | 1 |
| Fractal Calibration for long-tailed object detection | Oct 15, 2024 | Instance SegmentationLong-tailed Object Detection | CodeCode Available | 0 |
| SeaDATE: Remedy Dual-Attention Transformer with Semantic Alignment via Contrast Learning for Multimodal Object Detection | Oct 15, 2024 | Contrastive Learningobject-detection | —Unverified | 0 |
| Developing Gridded Emission Inventory from High-Resolution Satellite Object Detection for Improved Air Quality Forecasts | Oct 14, 2024 | object-detectionObject Detection | —Unverified | 0 |
| UAV3D: A Large-scale 3D Perception Benchmark for Unmanned Aerial Vehicles | Oct 14, 2024 | 3D Object DetectionObject | —Unverified | 0 |
| ROSAR: An Adversarial Re-Training Framework for Robust Side-Scan Sonar Object Detection | Oct 14, 2024 | Knowledge Distillationobject-detection | CodeCode Available | 0 |
| Out-of-Bounding-Box Triggers: A Stealthy Approach to Cheat Object Detectors | Oct 14, 2024 | Adversarial RobustnessObject | CodeCode Available | 0 |
| Learning to Ground VLMs without Forgetting | Oct 14, 2024 | DecoderLanguage Modelling | —Unverified | 0 |
| ROA-BEV: 2D Region-Oriented Attention for BEV-based 3D Object | Oct 14, 2024 | 3D Object DetectionAutonomous Driving | —Unverified | 0 |