| OoDIS: Anomaly Instance Segmentation Benchmark | Jun 17, 2024 | Anomaly Instance SegmentationAnomaly Segmentation | CodeCode Available | 1 |
| YOLO9tr: A Lightweight Model for Pavement Damage Detection Utilizing a Generalized Efficient Layer Aggregation Network and Attention Mechanism | Jun 17, 2024 | Computational Efficiencyobject-detection | CodeCode Available | 1 |
| DenoiseRep: Denoising Model for Representation Learning | Jun 13, 2024 | DenoisingFine-Grained Image Classification | CodeCode Available | 1 |
| Towards Evaluating the Robustness of Visual State Space Models | Jun 13, 2024 | Adversarial Robustnessobject-detection | CodeCode Available | 1 |
| MWIRSTD: A MWIR Small Target Detection Dataset | Jun 12, 2024 | object-detectionObject Detection | CodeCode Available | 1 |
| Dataset Enhancement with Instance-Level Augmentations | Jun 12, 2024 | Data AugmentationObject | CodeCode Available | 1 |
| Advancing Grounded Multimodal Named Entity Recognition via LLM-Based Reformulation and Box-Based Segmentation | Jun 11, 2024 | Grounded Multimodal Named Entity Recognitionnamed-entity-recognition | CodeCode Available | 1 |
| Triple-domain Feature Learning with Frequency-aware Memory Enhancement for Moving Infrared Small Target Detection | Jun 11, 2024 | object-detectionObject Detection | CodeCode Available | 1 |
| UEMM-Air: A Synthetic Multi-modal Dataset for Unmanned Aerial Vehicle Object Detection | Jun 10, 2024 | Objectobject-detection | CodeCode Available | 1 |
| Scaling Graph Convolutions for Mobile Vision | Jun 9, 2024 | Graph AttentionGraph Neural Network | CodeCode Available | 1 |
| SAM-PM: Enhancing Video Camouflaged Object Detection using Spatio-Temporal Attention | Jun 9, 2024 | Image Segmentationobject-detection | CodeCode Available | 1 |
| CORU: Comprehensive Post-OCR Parsing and Receipt Understanding Dataset | Jun 6, 2024 | object-detectionObject Detection | CodeCode Available | 1 |
| Frequency-based Matcher for Long-tailed Semantic Segmentation | Jun 6, 2024 | Autonomous Drivingobject-detection | CodeCode Available | 1 |
| Instance Segmentation and Teeth Classification in Panoramic X-rays | Jun 6, 2024 | Instance Segmentationobject-detection | CodeCode Available | 1 |
| Alignment-Free RGBT Salient Object Detection: Semantics-guided Asymmetric Correlation Network and A Unified Benchmark | Jun 3, 2024 | object-detectionObject Detection | CodeCode Available | 1 |
| Learning Adaptive Fusion Bank for Multi-modal Salient Object Detection | Jun 3, 2024 | object-detectionObject Detection | CodeCode Available | 1 |
| RTGen: Generating Region-Text Pairs for Open-Vocabulary Object Detection | May 30, 2024 | Image CaptioningImage Inpainting | CodeCode Available | 1 |
| On Calibration of Object Detectors: Pitfalls, Evaluation and Baselines | May 30, 2024 | object-detectionObject Detection | CodeCode Available | 1 |
| Learning Shared RGB-D Fields: Unified Self-supervised Pre-training for Label-efficient LiDAR-Camera 3D Perception | May 28, 2024 | 3D Object DetectionAutonomous Driving | CodeCode Available | 1 |
| DMT-JEPA: Discriminative Masked Targets for Joint-Embedding Predictive Architecture | May 28, 2024 | image-classificationImage Classification | CodeCode Available | 1 |
| OV-DQUO: Open-Vocabulary DETR with Denoising Text Query Training and Open-World Unknown Objects Supervision | May 28, 2024 | Contrastive LearningDenoising | CodeCode Available | 1 |
| OED: Towards One-stage End-to-End Dynamic Scene Graph Generation | May 27, 2024 | Graph Generationobject-detection | CodeCode Available | 1 |
| DiffuBox: Refining 3D Object Detection with Point Diffusion | May 25, 2024 | 3D Object DetectionAutonomous Driving | CodeCode Available | 1 |
| Rethinking Early-Fusion Strategies for Improved Multispectral Object Detection | May 25, 2024 | Knowledge DistillationMultispectral Object Detection | CodeCode Available | 1 |
| MINet: Multi-scale Interactive Network for Real-time Salient Object Detection of Strip Steel Surface Defects | May 25, 2024 | CPUDefect Detection | CodeCode Available | 1 |