| Cached Transformers: Improving Transformers with Differentiable Memory Cache | Dec 20, 2023 | image-classificationImage Classification | CodeCode Available | 1 |
| Object-Aware Domain Generalization for Object Detection | Dec 19, 2023 | Autonomous DrivingContrastive Learning | CodeCode Available | 1 |
| The Endoscapes Dataset for Surgical Scene Segmentation, Object Detection, and Critical View of Safety Assessment: Official Splits and Benchmark | Dec 19, 2023 | AnatomyInstance Segmentation | CodeCode Available | 1 |
| CLIM: Contrastive Language-Image Mosaic for Region Representation | Dec 18, 2023 | Objectobject-detection | CodeCode Available | 1 |
| PETDet: Proposal Enhancement for Two-Stage Fine-Grained Object Detection | Dec 16, 2023 | Multi-Task LearningObject | CodeCode Available | 1 |
| Semantic-Aware Autoregressive Image Modeling for Visual Representation Learning | Dec 16, 2023 | image-classificationImage Classification | CodeCode Available | 1 |
| Simple Image-level Classification Improves Open-vocabulary Object Detection | Dec 16, 2023 | Knowledge DistillationObject | CodeCode Available | 1 |
| Not Every Side Is Equal: Localization Uncertainty Estimation for Semi-Supervised 3D Object Detection | Dec 16, 2023 | 3D Object Detectionobject-detection | CodeCode Available | 1 |
| Transformers in Unsupervised Structure-from-Motion | Dec 16, 2023 | Decision Makingimage-classification | CodeCode Available | 1 |
| FoMo-Bench: a multi-modal, multi-scale and multi-task Forest Monitoring Benchmark for remote sensing foundation models | Dec 15, 2023 | object-detectionObject Detection | CodeCode Available | 1 |
| SKDF: A Simple Knowledge Distillation Framework for Distilling Open-Vocabulary Knowledge to Open-world Object Detector | Dec 14, 2023 | Knowledge DistillationObject | CodeCode Available | 1 |
| Achelous++: Power-Oriented Water-Surface Panoptic Perception Framework on Edge Devices based on Vision-Radar Fusion and Pruning of Heterogeneous Modalities | Dec 14, 2023 | Autonomous NavigationMulti-Task Learning | CodeCode Available | 1 |
| PTT: Point-Trajectory Transformer for Efficient Temporal 3D Object Detection | Dec 13, 2023 | 3D Object Detectionobject-detection | CodeCode Available | 1 |
| DualTeacher: Bridging Coexistence of Unlabelled Classes for Semi-supervised Incremental Object Detection | Dec 13, 2023 | Objectobject-detection | CodeCode Available | 1 |
| What, How, and When Should Object Detectors Update in Continually Changing Test Domains? | Dec 12, 2023 | object-detectionObject Detection | CodeCode Available | 1 |
| Weakly Supervised 3D Object Detection via Multi-Level Visual Guidance | Dec 12, 2023 | 3D Object Detectionobject-detection | CodeCode Available | 1 |
| Relax Image-Specific Prompt Requirement in SAM: A Single Generic Prompt for Segmenting Camouflaged Objects | Dec 12, 2023 | Camouflaged Object Segmentation with a Single Task-generic Promptobject-detection | CodeCode Available | 1 |
| MaxQ: Multi-Axis Query for N:M Sparsity Network | Dec 12, 2023 | image-classificationImage Classification | CodeCode Available | 1 |
| MedYOLO: A Medical Image Object Detection Framework | Dec 12, 2023 | Computed Tomography (CT)Object | CodeCode Available | 1 |
| Mixed Pseudo Labels for Semi-Supervised Object Detection | Dec 12, 2023 | Objectobject-detection | CodeCode Available | 1 |
| Efficient Object Detection in Autonomous Driving using Spiking Neural Networks: Performance, Energy Consumption Analysis, and Insights into Open-set Object Discovery | Dec 12, 2023 | Autonomous DrivingAutonomous Vehicles | CodeCode Available | 1 |
| CholecTrack20: A Dataset for Multi-Class Multiple Tool Tracking in Laparoscopic Surgery | Dec 12, 2023 | Intracorporeal TrackingIntraoperative Tracking | CodeCode Available | 1 |
| ProxyDet: Synthesizing Proxy Novel Classes via Classwise Mixup for Open-Vocabulary Object Detection | Dec 12, 2023 | object-detectionObject Detection | CodeCode Available | 1 |
| 3D Copy-Paste: Physically Plausible Object Insertion for Monocular 3D Detection | Dec 8, 2023 | 3D Object DetectionData Augmentation | CodeCode Available | 1 |
| SiCP: Simultaneous Individual and Cooperative Perception for 3D Object Detection in Connected and Automated Vehicles | Dec 8, 2023 | 3D Object Detectionobject-detection | CodeCode Available | 1 |