| SAM3D: Zero-Shot 3D Object Detection via Segment Anything Model | Jun 4, 2023 | 3D Object DetectionImage Segmentation | CodeCode Available | 2 | 5 |
| SAM-6D: Segment Anything Model Meets Zero-Shot 6D Object Pose Estimation | Nov 27, 2023 | 6D Pose Estimation using RGBInstance Segmentation | CodeCode Available | 2 | 5 |
| Earth-Adapter: Bridge the Geospatial Domain Gaps with Mixture of Frequency Adaptation | Apr 8, 2025 | Domain AdaptationDomain Generalization | CodeCode Available | 2 | 5 |
| DreamColour: Controllable Video Colour Editing without Training | Dec 6, 2024 | Instance SegmentationSemantic Segmentation | CodeCode Available | 2 | 5 |
| SAM-Med3D: Towards General-purpose Segmentation Models for Volumetric Medical Images | Oct 23, 2023 | 3D ArchitectureImage Segmentation | CodeCode Available | 2 | 5 |
| SAMM (Segment Any Medical Model): A 3D Slicer Integration to SAM | Apr 12, 2023 | Image SegmentationSegmentation | CodeCode Available | 2 | 5 |
| Beyond Adapting SAM: Towards End-to-End Ultrasound Image Segmentation via Auto Prompting | Sep 13, 2023 | Image SegmentationMedical Image Segmentation | CodeCode Available | 2 | 5 |
| Domain Adaptive and Generalizable Network Architectures and Training Strategies for Semantic Image Segmentation | Apr 26, 2023 | Domain AdaptationDomain Generalization | CodeCode Available | 2 | 5 |
| DreamLIP: Language-Image Pre-training with Long Captions | Mar 25, 2024 | Contrastive LearningImage-text Retrieval | CodeCode Available | 2 | 5 |
| SatMAE: Pre-training Transformers for Temporal and Multi-Spectral Satellite Imagery | Jul 17, 2022 | Land Cover ClassificationSemantic Segmentation | CodeCode Available | 2 | 5 |
| Does Image Anonymization Impact Computer Vision Training? | Jun 8, 2023 | Face AnonymizationInstance Segmentation | CodeCode Available | 2 | 5 |
| Agent Attention: On the Integration of Softmax and Linear Attention | Dec 14, 2023 | Computational Efficiencyimage-classification | CodeCode Available | 2 | 5 |
| Domain Adaptation with a Single Vision-Language Embedding | Oct 28, 2024 | Domain AdaptationOne-shot Unsupervised Domain Adaptation | CodeCode Available | 2 | 5 |
| DSNet: A Novel Way to Use Atrous Convolutions in Semantic Segmentation | Jun 6, 2024 | Real-Time Semantic SegmentationSemantic Segmentation | CodeCode Available | 2 | 5 |
| EasyPortrait -- Face Parsing and Portrait Segmentation Dataset | Apr 26, 2023 | DiversityDomain Generalization | CodeCode Available | 2 | 5 |
| Embedding Earth: Self-supervised contrastive pre-training for dense land cover classification | Mar 11, 2022 | Earth ObservationLand Cover Classification | CodeCode Available | 2 | 5 |
| Focal Modulation Networks | Mar 22, 2022 | image-classificationImage Classification | CodeCode Available | 2 | 5 |
| A Unified Transformer Framework for Group-based Segmentation: Co-Segmentation, Co-Saliency Detection and Video Salient Object Detection | Mar 9, 2022 | Co-Salient Object Detectionobject-detection | CodeCode Available | 2 | 5 |
| A Unified Image-Dense Annotation Generation Model for Underwater Scenes | Mar 27, 2025 | Depth EstimationPrediction | CodeCode Available | 2 | 5 |
| A Unified Framework for 3D Scene Understanding | Jul 3, 2024 | Contrastive LearningKnowledge Distillation | CodeCode Available | 2 | 5 |
| DI-MaskDINO: A Joint Object Detection and Instance Segmentation Model | Oct 22, 2024 | DecoderInstance Segmentation | CodeCode Available | 2 | 5 |
| DINO in the Room: Leveraging 2D Foundation Models for 3D Segmentation | Mar 24, 2025 | 3D Semantic SegmentationLIDAR Semantic Segmentation | CodeCode Available | 2 | 5 |
| Distribution-Free, Risk-Controlling Prediction Sets | Jan 7, 2021 | BIG-bench Machine LearningClassification | CodeCode Available | 2 | 5 |
| Diffusion models as plug-and-play priors | Jun 17, 2022 | Combinatorial OptimizationDenoising | CodeCode Available | 2 | 5 |
| Digital Twin Generation from Visual Data: A Survey | Apr 17, 2025 | Semantic SegmentationSurvey | CodeCode Available | 2 | 5 |
| Aerial Lifting: Neural Urban Semantic and Building Instance Lifting from Aerial Imagery | Mar 18, 2024 | Instance SegmentationNeRF | CodeCode Available | 2 | 5 |
| DiffRect: Latent Diffusion Label Rectification for Semi-supervised Medical Image Segmentation | Jul 13, 2024 | DenoisingImage Segmentation | CodeCode Available | 2 | 5 |
| Diffuse, Attend, and Segment: Unsupervised Zero-Shot Segmentation using Stable Diffusion | Aug 23, 2023 | SegmentationSemantic Segmentation | CodeCode Available | 2 | 5 |
| Dilated Neighborhood Attention Transformer | Sep 29, 2022 | Image ClassificationInstance Segmentation | CodeCode Available | 2 | 5 |
| DiverGen: Improving Instance Segmentation by Learning Wider Data Distribution with More Diverse Generative Data | May 16, 2024 | Data AugmentationDiversity | CodeCode Available | 2 | 5 |
| DFormer: Rethinking RGBD Representation Learning for Semantic Segmentation | Sep 18, 2023 | 3D geometryDecoder | CodeCode Available | 2 | 5 |
| Adversarial Supervision Makes Layout-to-Image Diffusion Models Thrive | Jan 16, 2024 | Domain GeneralizationImage Generation | CodeCode Available | 2 | 5 |
| Augmented Object Intelligence with XR-Objects | Apr 20, 2024 | ObjectSemantic Segmentation | CodeCode Available | 2 | 5 |
| DiffAtlas: GenAI-fying Atlas Segmentation via Image-Mask Diffusion | Mar 9, 2025 | Image SegmentationMedical Image Segmentation | CodeCode Available | 2 | 5 |
| DenseNets Reloaded: Paradigm Shift Beyond ResNets and ViTs | Mar 28, 2024 | Fine-Grained Image ClassificationImage Classification | CodeCode Available | 2 | 5 |
| Audio-Visual Segmentation with Semantics | Jan 30, 2023 | SegmentationSemantic Segmentation | CodeCode Available | 2 | 5 |
| Densely Connected Parameter-Efficient Tuning for Referring Image Segmentation | Jan 15, 2025 | Image SegmentationReferring Expression Segmentation | CodeCode Available | 2 | 5 |
| Deep Spectral Methods: A Surprisingly Strong Baseline for Unsupervised Semantic Segmentation and Localization | May 16, 2022 | graph partitioningSegmentation | CodeCode Available | 2 | 5 |
| Delineate Anything: Resolution-Agnostic Field Boundary Delineation on Satellite Imagery | Apr 3, 2025 | Field Boundary DelineationInstance Segmentation | CodeCode Available | 2 | 5 |
| Deep Incubation: Training Large Models by Divide-and-Conquering | Dec 8, 2022 | Image Segmentationobject-detection | CodeCode Available | 2 | 5 |
| Deep Snake for Real-Time Instance Segmentation | Jan 6, 2020 | GPUInstance Segmentation | CodeCode Available | 2 | 5 |
| Delivering Arbitrary-Modal Semantic Segmentation | Mar 2, 2023 | SegmentationSemantic Segmentation | CodeCode Available | 2 | 5 |
| DetectoRS: Detecting Objects with Recursive Feature Pyramid and Switchable Atrous Convolution | Jun 3, 2020 | Instance SegmentationObject | CodeCode Available | 2 | 5 |
| DiffBEV: Conditional Diffusion Model for Bird's Eye View Perception | Mar 15, 2023 | 3D Object DetectionAutonomous Driving | CodeCode Available | 2 | 5 |
| Diversified and Personalized Multi-rater Medical Image Segmentation | Mar 20, 2024 | Image SegmentationMedical Image Segmentation | CodeCode Available | 2 | 5 |
| Deep Video Prior for Video Consistency and Propagation | Jan 27, 2022 | Optical Flow EstimationSemantic Segmentation | CodeCode Available | 2 | 5 |
| DeCLIP: Decoupled Learning for Open-Vocabulary Dense Perception | May 7, 2025 | object-detectionObject Detection | CodeCode Available | 2 | 5 |
| DDP: Diffusion Model for Dense Visual Prediction | Mar 30, 2023 | DenoisingDepth Estimation | CodeCode Available | 2 | 5 |
| Decoupling Features in Hierarchical Propagation for Video Object Segmentation | Oct 18, 2022 | ObjectSemantic Segmentation | CodeCode Available | 2 | 5 |
| DAT++: Spatially Dynamic Vision Transformer with Deformable Attention | Sep 4, 2023 | Image ClassificationInstance Segmentation | CodeCode Available | 2 | 5 |