| ReCLIP++: Learn to Rectify the Bias of CLIP for Unsupervised Semantic Segmentation | Aug 13, 2024 | SegmentationSemantic Segmentation | CodeCode Available | 2 |
| In Defense of Lazy Visual Grounding for Open-Vocabulary Semantic Segmentation | Aug 9, 2024 | Image to textObject | CodeCode Available | 2 |
| ProxyCLIP: Proxy Attention Improves CLIP for Open-Vocabulary Segmentation | Aug 9, 2024 | Open Vocabulary Semantic SegmentationOpen-Vocabulary Semantic Segmentation | CodeCode Available | 2 |
| StitchFusion: Weaving Any Visual Modalities to Enhance Multimodal Semantic Segmentation | Aug 2, 2024 | SegmentationSemantic Segmentation | CodeCode Available | 2 |
| Segment anything model 2: an application to 2D and 3D medical images | Aug 1, 2024 | Computed Tomography (CT)Segmentation | CodeCode Available | 2 |
| MSA^2Net: Multi-scale Adaptive Attention-guided Network for Medical Image Segmentation | Jul 31, 2024 | DecoderImage Segmentation | CodeCode Available | 2 |
| MIST: A Simple and Scalable End-To-End 3D Medical Imaging Segmentation Framework | Jul 31, 2024 | 3D Medical Imaging SegmentationMedical Image Segmentation | CodeCode Available | 2 |
| RefMask3D: Language-Guided Transformer for 3D Referring Segmentation | Jul 25, 2024 | 3D visual groundingImage Segmentation | CodeCode Available | 2 |
| ESP-MedSAM: Efficient Self-Prompting SAM for Universal Domain-Generalized Medical Image Segmentation | Jul 19, 2024 | DecoderImage Segmentation | CodeCode Available | 2 |
| Mask2Map: Vectorized HD Map Construction Using Bird's Eye View Segmentation Masks | Jul 18, 2024 | Autonomous DrivingBEV Segmentation | CodeCode Available | 2 |
| DiffRect: Latent Diffusion Label Rectification for Semi-supervised Medical Image Segmentation | Jul 13, 2024 | DenoisingImage Segmentation | CodeCode Available | 2 |
| Exploiting Scale-Variant Attention for Segmenting Small Medical Objects | Jul 10, 2024 | Cell SegmentationMRI segmentation | CodeCode Available | 2 |
| LGRNet: Local-Global Reciprocal Network for Uterine Fibroid Segmentation in Ultrasound Videos | Jul 8, 2024 | SegmentationVideo Polyp Segmentation | CodeCode Available | 2 |
| Training-free CryoET Tomogram Segmentation | Jul 8, 2024 | Contrastive LearningCryogenic Electron Tomography | CodeCode Available | 2 |
| SCSA: Exploring the Synergistic Effects Between Spatial and Channel Attention | Jul 6, 2024 | Classificationobject-detection | CodeCode Available | 2 |
| A Unified Framework for 3D Scene Understanding | Jul 3, 2024 | Contrastive LearningKnowledge Distillation | CodeCode Available | 2 |
| HiDiff: Hybrid Diffusion Framework for Medical Image Segmentation | Jul 3, 2024 | Image SegmentationMedical Image Segmentation | CodeCode Available | 2 |
| Context-Aware Video Instance Segmentation | Jul 3, 2024 | Instance SegmentationPanoptic Segmentation | CodeCode Available | 2 |
| SegVG: Transferring Object Bounding Box to Segmentation for Visual Grounding | Jul 3, 2024 | object-detectionObject Detection | CodeCode Available | 2 |
| Centerline Boundary Dice Loss for Vascular Segmentation | Jul 1, 2024 | Segmentation | CodeCode Available | 2 |
| Stable Diffusion Segmentation for Biomedical Images with Single-step Reverse Process | Jun 26, 2024 | Image SegmentationMedical Image Segmentation | CodeCode Available | 2 |
| Are Vision xLSTM Embedded UNet More Reliable in Medical 3D Image Segmentation? | Jun 24, 2024 | Image SegmentationMedical Image Segmentation | CodeCode Available | 2 |
| SegNet4D: Efficient Instance-Aware 4D Semantic Segmentation for LiDAR Point Cloud | Jun 24, 2024 | Autonomous DrivingAutonomous Navigation | CodeCode Available | 2 |
| SelfReg-UNet: Self-Regularized UNet for Medical Image Segmentation | Jun 21, 2024 | DecoderImage Segmentation | CodeCode Available | 2 |
| Frozen CLIP: A Strong Backbone for Weakly Supervised Semantic Segmentation | Jun 17, 2024 | DecoderSegmentation | CodeCode Available | 2 |
| Understanding Multi-Granularity for Open-Vocabulary Part Segmentation | Jun 17, 2024 | Open Vocabulary Semantic SegmentationOpen-Vocabulary Semantic Segmentation | CodeCode Available | 2 |
| Diving into Underwater: Segment Anything Model Guided Underwater Salient Instance Segmentation and A Large-scale Dataset | Jun 10, 2024 | Instance SegmentationSalient Object Detection | CodeCode Available | 2 |
| F-LMM: Grounding Frozen Large Multimodal Models | Jun 9, 2024 | General KnowledgeInstruction Following | CodeCode Available | 2 |
| U-KAN Makes Strong Backbone for Medical Image Segmentation and Generation | Jun 5, 2024 | Image SegmentationKolmogorov-Arnold Networks | CodeCode Available | 2 |
| DiffCut: Catalyzing Zero-Shot Semantic Segmentation with Diffusion Features and Recursive Normalized Cut | Jun 5, 2024 | Image SegmentationSegmentation | CodeCode Available | 2 |
| Generative Active Learning for Long-tailed Instance Segmentation | Jun 4, 2024 | Active LearningInstance Segmentation | CodeCode Available | 2 |
| FreeTumor: Advance Tumor Segmentation via Large-Scale Tumor Synthesis | Jun 3, 2024 | SegmentationTumor Segmentation | CodeCode Available | 2 |
| Cascade-CLIP: Cascaded Vision-Language Embeddings Alignment for Zero-Shot Semantic Segmentation | Jun 2, 2024 | SegmentationSemantic Segmentation | CodeCode Available | 2 |
| TotalVibeSegmentator: Full Body MRI Segmentation for the NAKO and UK Biobank | May 31, 2024 | EpidemiologyHoldout Set | CodeCode Available | 2 |
| Memorize What Matters: Emergent Scene Decomposition from Multitraverse | May 27, 2024 | 3D ReconstructionNeural Rendering | CodeCode Available | 2 |
| Reason3D: Searching and Reasoning 3D Segmentation via Large Language Model | May 27, 2024 | DecoderLanguage Modeling | CodeCode Available | 2 |
| TokenUnify: Scalable Autoregressive Visual Pre-training with Mixture Token Prediction | May 27, 2024 | MambaPrediction | CodeCode Available | 2 |
| Zero-Shot Video Semantic Segmentation based on Pre-Trained Diffusion Models | May 27, 2024 | SegmentationSemantic correspondence | CodeCode Available | 2 |
| Large-Scale Multi-Center CT and MRI Segmentation of Pancreas with Deep Learning | May 20, 2024 | BenchmarkingMRI segmentation | CodeCode Available | 2 |
| Modality-agnostic Domain Generalizable Medical Image Segmentation by Multi-Frequency in Multi-Scale Attention | May 10, 2024 | Image SegmentationMedical Image Segmentation | CodeCode Available | 2 |
| MRSegmentator: Multi-Modality Segmentation of 40 Classes in MRI and CT | May 10, 2024 | Model OptimizationOrgan Segmentation | CodeCode Available | 2 |
| PTQ4SAM: Post-Training Quantization for Segment Anything | May 6, 2024 | Instance Segmentationobject-detection | CodeCode Available | 2 |
| Multi-Space Alignments Towards Universal LiDAR Segmentation | May 2, 2024 | Autonomous DrivingDiversity | CodeCode Available | 2 |
| GraCo: Granularity-Controllable Interactive Segmentation | May 1, 2024 | Interactive SegmentationSegmentation | CodeCode Available | 2 |
| ASAM: Boosting Segment Anything Model with Adversarial Tuning | May 1, 2024 | Image Segmentationmodel | CodeCode Available | 2 |
| A Multi-objective Optimization Benchmark Test Suite for Real-time Semantic Segmentation | Apr 25, 2024 | Autonomous DrivingEvolutionary Algorithms | CodeCode Available | 2 |
| Multimodal Information Interaction for Medical Image Segmentation | Apr 25, 2024 | Heart SegmentationImage Segmentation | CodeCode Available | 2 |
| Generalized Few-Shot Meets Remote Sensing: Discovering Novel Classes in Land Cover Mapping via Hybrid Semantic Segmentation Framework | Apr 19, 2024 | Earth ObservationSegmentation | CodeCode Available | 2 |
| Constructing and Exploring Intermediate Domains in Mixed Domain Semi-supervised Medical Image Segmentation | Apr 13, 2024 | Domain AdaptationImage Segmentation | CodeCode Available | 2 |
| Pay Attention to Your Neighbours: Training-Free Open-Vocabulary Semantic Segmentation | Apr 12, 2024 | Open Vocabulary Semantic SegmentationOpen-Vocabulary Semantic Segmentation | CodeCode Available | 2 |