| General Object Foundation Model for Images and Videos at Scale | Dec 14, 2023 | Instance SegmentationLong-tail Video Object Segmentation | CodeCode Available | 3 |
| Class-Wise Buffer Management for Incremental Object Detection: An Effective Buffer Training Strategy | Dec 14, 2023 | class-incremental learningClass Incremental Learning | —Unverified | 0 |
| Achelous++: Power-Oriented Water-Surface Panoptic Perception Framework on Edge Devices based on Vision-Radar Fusion and Pruning of Heterogeneous Modalities | Dec 14, 2023 | Autonomous NavigationMulti-Task Learning | CodeCode Available | 1 |
| Learned Fusion: 3D Object Detection using Calibration-Free Transformer Feature Fusion | Dec 14, 2023 | 3D Object DetectionObject | —Unverified | 0 |
| Agent Attention: On the Integration of Softmax and Linear Attention | Dec 14, 2023 | Computational Efficiencyimage-classification | CodeCode Available | 2 |
| Exploration of visual prompt in Grounded pre-trained open-set detection | Dec 14, 2023 | object-detectionObject Detection | —Unverified | 0 |
| SKDF: A Simple Knowledge Distillation Framework for Distilling Open-Vocabulary Knowledge to Open-world Object Detector | Dec 14, 2023 | Knowledge DistillationObject | CodeCode Available | 1 |
| DualTeacher: Bridging Coexistence of Unlabelled Classes for Semi-supervised Incremental Object Detection | Dec 13, 2023 | Objectobject-detection | CodeCode Available | 1 |
| Advancements in Content-Based Image Retrieval: A Comprehensive Survey of Relevance Feedback Techniques | Dec 13, 2023 | Active LearningContent-Based Image Retrieval | —Unverified | 0 |
| Challenges of YOLO Series for Object Detection in Extremely Heavy Rain: CALRA Simulator based Synthetic Evaluation Dataset | Dec 13, 2023 | Autonomous Vehiclesobject-detection | —Unverified | 0 |
| An Invitation to Deep Reinforcement Learning | Dec 13, 2023 | Code GenerationDeep Reinforcement Learning | —Unverified | 0 |
| PTT: Point-Trajectory Transformer for Efficient Temporal 3D Object Detection | Dec 13, 2023 | 3D Object Detectionobject-detection | CodeCode Available | 1 |
| Instance-aware Multi-Camera 3D Object Detection with Structural Priors Mining and Self-Boosting Learning | Dec 13, 2023 | 3D Object DetectionAutonomous Driving | —Unverified | 0 |
| OpenSight: A Simple Open-Vocabulary Framework for LiDAR-Based Object Detection | Dec 12, 2023 | cross-modal alignmentobject-detection | —Unverified | 0 |
| What, How, and When Should Object Detectors Update in Continually Changing Test Domains? | Dec 12, 2023 | object-detectionObject Detection | CodeCode Available | 1 |
| Teaching Unknown Objects by Leveraging Human Gaze and Augmented Reality in Human-Robot Interaction | Dec 12, 2023 | object-detectionObject Detection | —Unverified | 0 |
| MedYOLO: A Medical Image Object Detection Framework | Dec 12, 2023 | Computed Tomography (CT)Object | CodeCode Available | 1 |
| Lightweight high-resolution Subject Matting in the Real World | Dec 12, 2023 | Image Mattingobject-detection | —Unverified | 0 |
| IA2U: A Transfer Plugin with Multi-Prior for In-Air Model to Underwater | Dec 12, 2023 | Image Enhancementobject-detection | —Unverified | 0 |
| CholecTrack20: A Dataset for Multi-Class Multiple Tool Tracking in Laparoscopic Surgery | Dec 12, 2023 | Intracorporeal TrackingIntraoperative Tracking | CodeCode Available | 1 |
| Relax Image-Specific Prompt Requirement in SAM: A Single Generic Prompt for Segmenting Camouflaged Objects | Dec 12, 2023 | Camouflaged Object Segmentation with a Single Task-generic Promptobject-detection | CodeCode Available | 1 |
| MaxQ: Multi-Axis Query for N:M Sparsity Network | Dec 12, 2023 | image-classificationImage Classification | CodeCode Available | 1 |
| ProxyDet: Synthesizing Proxy Novel Classes via Classwise Mixup for Open-Vocabulary Object Detection | Dec 12, 2023 | object-detectionObject Detection | CodeCode Available | 1 |
| Edge Wasserstein Distance Loss for Oriented Object Detection | Dec 12, 2023 | Objectobject-detection | —Unverified | 0 |
| Weakly Supervised 3D Object Detection via Multi-Level Visual Guidance | Dec 12, 2023 | 3D Object Detectionobject-detection | CodeCode Available | 1 |
| Efficient Object Detection in Autonomous Driving using Spiking Neural Networks: Performance, Energy Consumption Analysis, and Insights into Open-set Object Discovery | Dec 12, 2023 | Autonomous DrivingAutonomous Vehicles | CodeCode Available | 1 |
| Mixed Pseudo Labels for Semi-Supervised Object Detection | Dec 12, 2023 | Objectobject-detection | CodeCode Available | 1 |
| ADOD: Adaptive Domain-Aware Object Detection with Residual Attention for Underwater Environments | Dec 11, 2023 | domain classificationDomain Generalization | CodeCode Available | 0 |
| A Multimodal Dataset and Benchmark for Radio Galaxy and Infrared Host Detection | Dec 11, 2023 | object-detectionObject Detection | CodeCode Available | 0 |
| SqueezeSAM: User friendly mobile interactive segmentation | Dec 11, 2023 | Data AugmentationInteractive Segmentation | —Unverified | 0 |
| User Friendly and Adaptable Discriminative AI: Using the Lessons from the Success of LLMs and Image Generation Models | Dec 11, 2023 | Image Generationobject-detection | —Unverified | 0 |
| Mitigating Perspective Distortion-induced Shape Ambiguity in Image Crops | Dec 11, 2023 | 3D Object DetectionDepth Estimation | —Unverified | 0 |
| SimMining-3D: Altitude-Aware 3D Object Detection in Complex Mining Environments: A Novel Dataset and ROS-Based Automatic Annotation Pipeline | Dec 11, 2023 | 3D Object DetectionObject | —Unverified | 0 |
| Beyond Classification: Definition and Density-based Estimation of Calibration in Object Detection | Dec 11, 2023 | Density EstimationObject | CodeCode Available | 0 |
| Investigating YOLO Models Towards Outdoor Obstacle Detection For Visually Impaired People | Dec 10, 2023 | object-detectionObject Detection | —Unverified | 0 |
| AM-RADIO: Agglomerative Vision Foundation Model -- Reduce All Domains Into One | Dec 10, 2023 | AllBenchmarking | CodeCode Available | 3 |
| Dynamic Adversarial Attacks on Autonomous Driving Systems | Dec 10, 2023 | Adversarial AttackAutonomous Driving | CodeCode Available | 0 |
| Open World Object Detection in the Era of Foundation Models | Dec 10, 2023 | Medical Image AnalysisObject | —Unverified | 0 |
| Spatial-wise Dynamic Distillation for MLP-like Efficient Visual Fault Detection of Freight Trains | Dec 10, 2023 | Fault Detectionobject-detection | CodeCode Available | 0 |
| Immature Green Apple Detection and Sizing in Commercial Orchards using YOLOv8 and Shape Fitting Techniques | Dec 8, 2023 | Instance SegmentationManagement | —Unverified | 0 |
| Lyrics: Boosting Fine-grained Language-Vision Alignment and Comprehension via Semantic-aware Visual Objects | Dec 8, 2023 | Image Captioningobject-detection | —Unverified | 0 |
| 3D Copy-Paste: Physically Plausible Object Insertion for Monocular 3D Detection | Dec 8, 2023 | 3D Object DetectionData Augmentation | CodeCode Available | 1 |
| SiCP: Simultaneous Individual and Cooperative Perception for 3D Object Detection in Connected and Automated Vehicles | Dec 8, 2023 | 3D Object Detectionobject-detection | CodeCode Available | 1 |
| Unify Change Point Detection and Segment Classification in a Regression Task for Transportation Mode Identification | Dec 8, 2023 | Change Point DetectionClassification | CodeCode Available | 0 |
| Image and AIS Data Fusion Technique for Maritime Computer Vision Applications | Dec 7, 2023 | Managementobject-detection | CodeCode Available | 1 |
| Stable Diffusion for Data Augmentation in COCO and Weed Datasets | Dec 7, 2023 | Data AugmentationImage Generation | —Unverified | 0 |
| Forcing Generative Models to Degenerate Ones: The Power of Data Poisoning Attacks | Dec 7, 2023 | Data Poisoningobject-detection | —Unverified | 0 |
| Multiview Aerial Visual Recognition (MAVREC): Can Multi-view Improve Aerial Visual Perception? | Dec 7, 2023 | BenchmarkingDiversity | —Unverified | 0 |
| Bootstrapping Autonomous Driving Radars with Self-Supervised Learning | Dec 7, 2023 | Autonomous DrivingAutonomous Vehicles | CodeCode Available | 1 |
| Augmentation-Free Dense Contrastive Knowledge Distillation for Efficient Semantic Segmentation | Dec 7, 2023 | Contrastive LearningData Augmentation | CodeCode Available | 1 |