| Cached Transformers: Improving Transformers with Differentiable Memory Cache | Dec 20, 2023 | image-classificationImage Classification | CodeCode Available | 1 | 5 |
| CAD: Memory Efficient Convolutional Adapter for Segment Anything | Sep 24, 2024 | DecoderGPU | CodeCode Available | 1 | 5 |
| C3S3: Complementary Competition and Contrastive Selection for Semi-Supervised Medical Image Segmentation | Jun 9, 2025 | Contrastive LearningDiagnostic | CodeCode Available | 1 | 5 |
| 3D MRI Synthesis with Slice-Based Latent Diffusion Models: Improving Tumor Segmentation Tasks in Data-Scarce Regimes | Jun 8, 2024 | Data AugmentationImage Generation | CodeCode Available | 1 | 5 |
| Channelized Axial Attention for Semantic Segmentation -- Considering Channel Relation within Spatial Attention for Semantic Segmentation | Jan 19, 2021 | RelationSegmentation | CodeCode Available | 1 | 5 |
| Detect Any Shadow: Segment Anything for Video Shadow Detection | May 26, 2023 | Image SegmentationSemantic Segmentation | CodeCode Available | 1 | 5 |
| Detection and Retrieval of Out-of-Distribution Objects in Semantic Segmentation | May 14, 2020 | Dimensionality ReductionImage Retrieval | CodeCode Available | 1 | 5 |
| D-Former: A U-shaped Dilated Transformer for 3D Medical Image Segmentation | Jan 3, 2022 | DecoderImage Segmentation | CodeCode Available | 1 | 5 |
| DeRIS: Decoupling Perception and Cognition for Enhanced Referring Image Segmentation through Loopback Synergy | Jul 2, 2025 | Data AugmentationGeneralized Referring Expression Segmentation | CodeCode Available | 1 | 5 |
| BURST: A Benchmark for Unifying Object Recognition, Segmentation and Tracking in Video | Sep 25, 2022 | Long-tail Video Object SegmentationMulti-Object Tracking | CodeCode Available | 1 | 5 |
| DermSynth3D: Synthesis of in-the-wild Annotated Dermatology Images | May 22, 2023 | Semantic Segmentation | CodeCode Available | 1 | 5 |
| Building Extraction from Remote Sensing Images via an Uncertainty-Aware Network | Jul 23, 2023 | DecoderExtracting Buildings In Remote Sensing Images | CodeCode Available | 1 | 5 |
| BuildingNet: Learning to Label 3D Buildings | Oct 11, 2021 | 2k3D Building Mesh Labeling | CodeCode Available | 1 | 5 |
| DeSAM: Decoupled Segment Anything Model for Generalizable Medical Image Segmentation | Jun 1, 2023 | DecoderDomain Generalization | CodeCode Available | 1 | 5 |
| Depth Based Semantic Scene Completion with Position Importance Aware Loss | Jan 29, 2020 | 3D Semantic SegmentationPosition | CodeCode Available | 1 | 5 |
| Depth-based 6DoF Object Pose Estimation using Swin Transformer | Mar 3, 2023 | 6D Pose Estimation6D Pose Estimation using RGB | CodeCode Available | 1 | 5 |
| Depthformer : Multiscale Vision Transformer For Monocular Depth Estimation With Local Global Information Fusion | Jul 10, 2022 | DecoderDepth Estimation | CodeCode Available | 1 | 5 |
| BT-Unet: A self-supervised learning framework for biomedical image segmentation using Barlow Twins with U-Net models | Dec 7, 2021 | Image SegmentationSelf-Supervised Learning | CodeCode Available | 1 | 5 |
| 3D-MPA: Multi-Proposal Aggregation for 3D Semantic Instance Segmentation | Jun 1, 2020 | 3D Object Detection3D Semantic Instance Segmentation | CodeCode Available | 1 | 5 |
| Building a Strong Pre-Training Baseline for Universal 3D Large-Scale Perception | May 12, 2024 | object-detectionObject Detection | CodeCode Available | 1 | 5 |
| Depth-Wise Convolutions in Vision Transformers for Efficient Training on Small Datasets | Jul 28, 2024 | image-classificationImage Classification | CodeCode Available | 1 | 5 |
| Condition-Invariant Semantic Segmentation | May 27, 2023 | Domain AdaptationSegmentation | CodeCode Available | 1 | 5 |
| Bridging Vision and Language Encoders: Parameter-Efficient Tuning for Referring Image Segmentation | Jul 21, 2023 | DecoderImage Segmentation | CodeCode Available | 1 | 5 |
| Depth-Assisted ResiDualGAN for Cross-Domain Aerial Images Semantic Segmentation | Aug 21, 2022 | Domain AdaptationSegmentation | CodeCode Available | 1 | 5 |
| Bridging the Domain Gap: Self-Supervised 3D Scene Understanding with Foundation Models | May 15, 2023 | 3D Object DetectionImage Captioning | CodeCode Available | 1 | 5 |