| YOLO9tr: A Lightweight Model for Pavement Damage Detection Utilizing a Generalized Efficient Layer Aggregation Network and Attention Mechanism | Jun 17, 2024 | Computational Efficiencyobject-detection | CodeCode Available | 1 |
| OoDIS: Anomaly Instance Segmentation Benchmark | Jun 17, 2024 | Anomaly Instance SegmentationAnomaly Segmentation | CodeCode Available | 1 |
| Towards Evaluating the Robustness of Visual State Space Models | Jun 13, 2024 | Adversarial Robustnessobject-detection | CodeCode Available | 1 |
| DenoiseRep: Denoising Model for Representation Learning | Jun 13, 2024 | DenoisingFine-Grained Image Classification | CodeCode Available | 1 |
| MWIRSTD: A MWIR Small Target Detection Dataset | Jun 12, 2024 | object-detectionObject Detection | CodeCode Available | 1 |
| Dataset Enhancement with Instance-Level Augmentations | Jun 12, 2024 | Data AugmentationObject | CodeCode Available | 1 |
| Advancing Grounded Multimodal Named Entity Recognition via LLM-Based Reformulation and Box-Based Segmentation | Jun 11, 2024 | Grounded Multimodal Named Entity Recognitionnamed-entity-recognition | CodeCode Available | 1 |
| Triple-domain Feature Learning with Frequency-aware Memory Enhancement for Moving Infrared Small Target Detection | Jun 11, 2024 | object-detectionObject Detection | CodeCode Available | 1 |
| UEMM-Air: A Synthetic Multi-modal Dataset for Unmanned Aerial Vehicle Object Detection | Jun 10, 2024 | Objectobject-detection | CodeCode Available | 1 |
| Scaling Graph Convolutions for Mobile Vision | Jun 9, 2024 | Graph AttentionGraph Neural Network | CodeCode Available | 1 |
| SAM-PM: Enhancing Video Camouflaged Object Detection using Spatio-Temporal Attention | Jun 9, 2024 | Image Segmentationobject-detection | CodeCode Available | 1 |
| Instance Segmentation and Teeth Classification in Panoramic X-rays | Jun 6, 2024 | Instance Segmentationobject-detection | CodeCode Available | 1 |
| CORU: Comprehensive Post-OCR Parsing and Receipt Understanding Dataset | Jun 6, 2024 | object-detectionObject Detection | CodeCode Available | 1 |
| Frequency-based Matcher for Long-tailed Semantic Segmentation | Jun 6, 2024 | Autonomous Drivingobject-detection | CodeCode Available | 1 |
| Alignment-Free RGBT Salient Object Detection: Semantics-guided Asymmetric Correlation Network and A Unified Benchmark | Jun 3, 2024 | object-detectionObject Detection | CodeCode Available | 1 |
| Learning Adaptive Fusion Bank for Multi-modal Salient Object Detection | Jun 3, 2024 | object-detectionObject Detection | CodeCode Available | 1 |
| RTGen: Generating Region-Text Pairs for Open-Vocabulary Object Detection | May 30, 2024 | Image CaptioningImage Inpainting | CodeCode Available | 1 |
| On Calibration of Object Detectors: Pitfalls, Evaluation and Baselines | May 30, 2024 | object-detectionObject Detection | CodeCode Available | 1 |
| OV-DQUO: Open-Vocabulary DETR with Denoising Text Query Training and Open-World Unknown Objects Supervision | May 28, 2024 | Contrastive LearningDenoising | CodeCode Available | 1 |
| Learning Shared RGB-D Fields: Unified Self-supervised Pre-training for Label-efficient LiDAR-Camera 3D Perception | May 28, 2024 | 3D Object DetectionAutonomous Driving | CodeCode Available | 1 |
| DMT-JEPA: Discriminative Masked Targets for Joint-Embedding Predictive Architecture | May 28, 2024 | image-classificationImage Classification | CodeCode Available | 1 |
| OED: Towards One-stage End-to-End Dynamic Scene Graph Generation | May 27, 2024 | Graph Generationobject-detection | CodeCode Available | 1 |
| MINet: Multi-scale Interactive Network for Real-time Salient Object Detection of Strip Steel Surface Defects | May 25, 2024 | CPUDefect Detection | CodeCode Available | 1 |
| Rethinking Early-Fusion Strategies for Improved Multispectral Object Detection | May 25, 2024 | Knowledge DistillationMultispectral Object Detection | CodeCode Available | 1 |
| DiffuBox: Refining 3D Object Detection with Point Diffusion | May 25, 2024 | 3D Object DetectionAutonomous Driving | CodeCode Available | 1 |
| UNION: Unsupervised 3D Object Detection using Object Appearance-based Pseudo-Classes | May 24, 2024 | 3D Object DetectionObject Detection | CodeCode Available | 1 |
| Improving Single Domain-Generalized Object Detection: A Focus on Diversification and Alignment | May 23, 2024 | Decision MakingDomain Generalization | CodeCode Available | 1 |
| MOD-UV: Learning Mobile Object Detectors from Unlabeled Videos | May 23, 2024 | Motion SegmentationObject | CodeCode Available | 1 |
| Versatile Teacher: A Class-aware Teacher-student Framework for Cross-domain Adaptation | May 20, 2024 | Domain Adaptationobject-detection | CodeCode Available | 1 |
| FADet: A Multi-sensor 3D Object Detection Network based on Local Featured Attention | May 19, 2024 | 3D Object DetectionAutonomous Driving | CodeCode Available | 1 |
| InfRS: Incremental Few-Shot Object Detection in Remote Sensing Images | May 18, 2024 | Few-Shot Object DetectionIncremental Learning | CodeCode Available | 1 |
| Visible and Clear: Finding Tiny Objects in Difference Map | May 18, 2024 | Objectobject-detection | CodeCode Available | 1 |
| A Large-scale Multi Domain Leukemia Dataset for the White Blood Cells Detection with Morphological Attributes for Explainability | May 17, 2024 | AttributeDomain Adaptation | CodeCode Available | 1 |
| Size-invariance Matters: Rethinking Metrics and Losses for Imbalanced Multi-object Salient Object Detection | May 16, 2024 | Objectobject-detection | CodeCode Available | 1 |
| RDPN6D: Residual-based Dense Point-wise Network for 6Dof Object Pose Estimation Based on RGB-D Images | May 14, 2024 | 6D Pose EstimationObject | CodeCode Available | 1 |
| Quality-aware Selective Fusion Network for V-D-T Salient Object Detection | May 13, 2024 | object-detectionObject Detection | CodeCode Available | 1 |
| Building a Strong Pre-Training Baseline for Universal 3D Large-Scale Perception | May 12, 2024 | object-detectionObject Detection | CodeCode Available | 1 |
| Differentiable Model Scaling using Differentiable Topk | May 12, 2024 | GPUimage-classification | CodeCode Available | 1 |
| RepVGG-GELAN: Enhanced GELAN with VGG-STYLE ConvNets for Brain Tumour Detection | May 6, 2024 | Computational Efficiencyobject-detection | CodeCode Available | 1 |
| Towards Consistent Object Detection via LiDAR-Camera Synergy | May 2, 2024 | Objectobject-detection | CodeCode Available | 1 |
| UniFS: Universal Few-shot Instance Perception with Point Representations | Apr 30, 2024 | Few-Shot LearningFew-Shot Object Detection | CodeCode Available | 1 |
| MiPa: Mixed Patch Infrared-Visible Modality Agnostic Object Detection | Apr 29, 2024 | Autonomous DrivingMultispectral Object Detection | CodeCode Available | 1 |
| CoSense3D: an Agent-based Efficient Learning Framework for Collective Perception | Apr 29, 2024 | Data VisualizationDecision Making | CodeCode Available | 1 |
| Reliable Student: Addressing Noise in Semi-Supervised 3D Object Detection | Apr 27, 2024 | 3D Object Detectionobject-detection | CodeCode Available | 1 |
| Constellation Dataset: Benchmarking High-Altitude Object Detection for an Urban Intersection | Apr 25, 2024 | Benchmarkingobject-detection | CodeCode Available | 1 |
| Revisiting Out-of-Distribution Detection in LiDAR-based 3D Object Detection | Apr 24, 2024 | 3D Object DetectionObject | CodeCode Available | 1 |
| Unified Unsupervised Salient Object Detection via Knowledge Transfer | Apr 23, 2024 | Objectobject-detection | CodeCode Available | 1 |
| The devil is in the object boundary: towards annotation-free instance segmentation using Foundation Models | Apr 18, 2024 | Instance SegmentationObject | CodeCode Available | 1 |
| Multi-resolution Rescored ByteTrack for Video Object Detection on Ultra-low-power Embedded Systems | Apr 17, 2024 | Objectobject-detection | CodeCode Available | 1 |
| Camera clustering for scalable stream-based active distillation | Apr 16, 2024 | ClusteringKnowledge Distillation | CodeCode Available | 1 |