| Audio-Visual Segmentation with Semantics | Jan 30, 2023 | SegmentationSemantic Segmentation | CodeCode Available | 2 |
| LKM-UNet: Large Kernel Vision Mamba UNet for Medical Image Segmentation | Mar 12, 2024 | Image SegmentationLong-range modeling | CodeCode Available | 2 |
| Learnable Prompting SAM-induced Knowledge Distillation for Semi-supervised Medical Image Segmentation | Dec 18, 2024 | Image SegmentationKnowledge Distillation | CodeCode Available | 2 |
| Learning Affinity from Attention: End-to-End Weakly-Supervised Semantic Segmentation with Transformers | Mar 5, 2022 | Semantic SegmentationWeakly supervised Semantic Segmentation | CodeCode Available | 2 |
| Learning Multi-View Aggregation In the Wild for Large-Scale 3D Semantic Segmentation | Apr 15, 2022 | 3D Semantic SegmentationColorization | CodeCode Available | 2 |
| Adaptive Bidirectional Displacement for Semi-Supervised Medical Image Segmentation | May 1, 2024 | Image SegmentationMedical Image Segmentation | CodeCode Available | 2 |
| Learning What Not to Segment: A New Perspective on Few-Shot Segmentation | Mar 15, 2022 | Few-Shot Semantic SegmentationMeta-Learning | CodeCode Available | 2 |
| Learning without Exact Guidance: Updating Large-scale High-resolution Land Cover Maps from Low-resolution Historical Labels | Mar 5, 2024 | Pseudo LabelSemantic Segmentation | CodeCode Available | 2 |
| A Unified Framework for 3D Scene Understanding | Jul 3, 2024 | Contrastive LearningKnowledge Distillation | CodeCode Available | 2 |
| A Unified Image-Dense Annotation Generation Model for Underwater Scenes | Mar 27, 2025 | Depth EstimationPrediction | CodeCode Available | 2 |
| DaCapo: a modular deep learning framework for scalable 3D image segmentation | Aug 5, 2024 | Image SegmentationManagement | CodeCode Available | 2 |
| Diving into Underwater: Segment Anything Model Guided Underwater Salient Instance Segmentation and A Large-scale Dataset | Jun 10, 2024 | Instance SegmentationSalient Object Detection | CodeCode Available | 2 |
| A Unified Transformer Framework for Group-based Segmentation: Co-Segmentation, Co-Saliency Detection and Video Salient Object Detection | Mar 9, 2022 | Co-Salient Object Detectionobject-detection | CodeCode Available | 2 |
| AllWeatherNet:Unified Image Enhancement for Autonomous Driving under Adverse Weather and Lowlight-conditions | Sep 3, 2024 | Autonomous DrivingDeep Attention | CodeCode Available | 2 |
| LViT: Language meets Vision Transformer in Medical Image Segmentation | Jun 29, 2022 | Image SegmentationMedical Image Segmentation | CodeCode Available | 2 |
| LVOS: A Benchmark for Large-scale Long-term Video Object Segmentation | Apr 30, 2024 | AttributeSemantic Segmentation | CodeCode Available | 2 |
| M^2SNet: Multi-scale in Multi-scale Subtraction Network for Medical Image Segmentation | Mar 20, 2023 | Computed Tomography (CT)Decoder | CodeCode Available | 2 |
| Make-A-Scene: Scene-Based Text-to-Image Generation with Human Priors | Mar 24, 2022 | Image GenerationSemantic Segmentation | CodeCode Available | 2 |
| DiffAtlas: GenAI-fying Atlas Segmentation via Image-Mask Diffusion | Mar 9, 2025 | Image SegmentationMedical Image Segmentation | CodeCode Available | 2 |
| AllSpark: Reborn Labeled Features from Unlabeled in Transformer for Semi-Supervised Semantic Segmentation | Mar 4, 2024 | Semantic SegmentationSemi-Supervised Semantic Segmentation | CodeCode Available | 2 |
| DiffBEV: Conditional Diffusion Model for Bird's Eye View Perception | Mar 15, 2023 | 3D Object DetectionAutonomous Driving | CodeCode Available | 2 |
| Alleviating Textual Reliance in Medical Language-guided Segmentation via Prototype-driven Semantic Approximation | Jul 15, 2025 | Image SegmentationSegmentation | CodeCode Available | 2 |
| DetectoRS: Detecting Objects with Recursive Feature Pyramid and Switchable Atrous Convolution | Jun 3, 2020 | Instance SegmentationObject | CodeCode Available | 2 |
| DenseNets Reloaded: Paradigm Shift Beyond ResNets and ViTs | Mar 28, 2024 | Fine-Grained Image ClassificationImage Classification | CodeCode Available | 2 |
| DFormer: Rethinking RGBD Representation Learning for Semantic Segmentation | Sep 18, 2023 | 3D geometryDecoder | CodeCode Available | 2 |