| Open-Vocabulary Panoptic Segmentation Using BERT Pre-Training of Vision-Language Multiway Transformer Model | Dec 25, 2024 | Open Vocabulary Panoptic SegmentationPanoptic Segmentation | CodeCode Available | 1 |
| Spike2Former: Efficient Spiking Transformer for High-performance Image Segmentation | Dec 19, 2024 | Image SegmentationSegmentation | CodeCode Available | 1 |
| PC-BEV: An Efficient Polar-Cartesian BEV Fusion Framework for LiDAR Semantic Segmentation | Dec 19, 2024 | LIDAR Semantic SegmentationScene Understanding | CodeCode Available | 1 |
| ZoRI: Towards Discriminative Zero-Shot Remote Sensing Instance Segmentation | Dec 17, 2024 | Instance SegmentationSegmentation | CodeCode Available | 1 |
| MaskCLIP++: A Mask-Based CLIP Fine-tuning Framework for Open-Vocabulary Image Segmentation | Dec 16, 2024 | Image SegmentationOpen Vocabulary Semantic Segmentation | CodeCode Available | 1 |
| DCSEG: Decoupled 3D Open-Set Segmentation using Gaussian Splatting | Dec 14, 2024 | 3D ReconstructionSegmentation | CodeCode Available | 1 |
| Towards Open-Vocabulary Video Semantic Segmentation | Dec 12, 2024 | SegmentationSemantic Segmentation | CodeCode Available | 1 |
| EOV-Seg: Efficient Open-Vocabulary Panoptic Segmentation | Dec 11, 2024 | DecoderGPU | CodeCode Available | 1 |
| SAM-Mamba: Mamba Guided SAM Architecture for Generalized Zero-Shot Polyp Segmentation | Dec 11, 2024 | MambaSegmentation | CodeCode Available | 1 |
| XLSTM-HVED: Cross-Modal Brain Tumor Segmentation and MRI Reconstruction Method Using Vision XLSTM and Heteromodal Variational Encoder-Decoder | Dec 9, 2024 | Brain Tumor SegmentationDecoder | CodeCode Available | 1 |
| MCP-MedSAM: A Powerful Lightweight Medical Segment Anything Model Trained with a Single GPU in Just One Day | Dec 8, 2024 | GPUImage Segmentation | CodeCode Available | 1 |
| CLIP-TNseg: A Multi-Modal Hybrid Framework for Thyroid Nodule Segmentation in Ultrasound Images | Dec 7, 2024 | Segmentation | CodeCode Available | 1 |
| MRGen: Diffusion-based Controllable Data Engine for MRI Segmentation towards Unannotated Modalities | Dec 4, 2024 | Image GenerationImage Segmentation | CodeCode Available | 1 |
| RG-SAN: Rule-Guided Spatial Awareness Network for End-to-End 3D Referring Expression Segmentation | Dec 3, 2024 | Referring ExpressionReferring Expression Segmentation | CodeCode Available | 1 |
| CellSeg1: Robust Cell Segmentation with One Training Image | Dec 2, 2024 | Cell SegmentationSegmentation | CodeCode Available | 1 |
| Multi-Granularity Video Object Segmentation | Dec 2, 2024 | ObjectSegmentation | CodeCode Available | 1 |
| MambaU-Lite: A Lightweight Model based on Mamba and Integrated Channel-Spatial Attention for Skin Lesion Segmentation | Dec 2, 2024 | DiagnosticLesion Segmentation | CodeCode Available | 1 |
| SyncVIS: Synchronized Video Instance Segmentation | Dec 1, 2024 | Instance SegmentationSegmentation | CodeCode Available | 1 |
| Token Cropr: Faster ViTs for Quite a Few Tasks | Dec 1, 2024 | image-classificationImage Classification | CodeCode Available | 1 |
| cWDM: Conditional Wavelet Diffusion Models for Cross-Modality 3D Medical Image Synthesis | Nov 26, 2024 | Brain Tumor SegmentationImage Generation | CodeCode Available | 1 |
| Learn from Foundation Model: Fruit Detection Model without Manual Annotation | Nov 25, 2024 | Instance SegmentationKnowledge Distillation | CodeCode Available | 1 |
| Deformable Mamba for Wide Field of View Segmentation | Nov 25, 2024 | DecoderMamba | CodeCode Available | 1 |
| A SAM-guided and Match-based Semi-Supervised Segmentation Framework for Medical Imaging | Nov 25, 2024 | Image SegmentationMedical Image Segmentation | CodeCode Available | 1 |
| Peritumoral Expansion Radiomics for Improved Lung Cancer Classification | Nov 24, 2024 | 3D ClassificationCancer Classification | CodeCode Available | 1 |
| MulModSeg: Enhancing Unpaired Multi-Modal Medical Image Segmentation with Modality-Conditioned Text Embedding and Alternating Training | Nov 23, 2024 | Computed Tomography (CT)Image Segmentation | CodeCode Available | 1 |
| Optimized Vessel Segmentation: A Structure-Agnostic Approach with Small Vessel Enhancement and Morphological Correction | Nov 22, 2024 | Segmentation | CodeCode Available | 1 |
| CLIPer: Hierarchically Improving Spatial Representation of CLIP for Open-Vocabulary Semantic Segmentation | Nov 21, 2024 | Open Vocabulary Semantic SegmentationOpen-Vocabulary Semantic Segmentation | CodeCode Available | 1 |
| Adapting Vision Foundation Models for Robust Cloud Segmentation in Remote Sensing Images | Nov 20, 2024 | Segmentation | CodeCode Available | 1 |
| XMask3D: Cross-modal Mask Reasoning for Open Vocabulary 3D Semantic Segmentation | Nov 20, 2024 | 3D geometry3D Semantic Segmentation | CodeCode Available | 1 |
| Gradient-Weighted Feature Back-Projection: A Fast Alternative to Feature Distillation in 3D Gaussian Splatting | Nov 19, 2024 | Segmentation | CodeCode Available | 1 |
| ITACLIP: Boosting Training-Free Semantic Segmentation with Image, Text, and Architectural Enhancements | Nov 18, 2024 | Open Vocabulary Semantic SegmentationOpen-Vocabulary Semantic Segmentation | CodeCode Available | 1 |
| Problem-Oriented Segmentation and Retrieval: Case Study on Tutoring Conversations | Nov 12, 2024 | MathRetrieval | CodeCode Available | 1 |
| ZAHA: Introducing the Level of Facade Generalization and the Large-Scale Point Cloud Facade Semantic Segmentation Benchmark Dataset | Nov 7, 2024 | SegmentationSemantic Segmentation | CodeCode Available | 1 |
| HRDecoder: High-Resolution Decoder Network for Fundus Image Lesion Segmentation | Nov 6, 2024 | DecoderGPU | CodeCode Available | 1 |
| Rethinking Decoders for Transformer-based Semantic Segmentation: A Compression Perspective | Nov 5, 2024 | DecoderSegmentation | CodeCode Available | 1 |
| SpineFM: Leveraging Foundation Models for Automatic Spine X-ray Segmentation | Nov 1, 2024 | Segmentation | CodeCode Available | 1 |
| COSNet: A Novel Semantic Segmentation Network using Enhanced Boundaries in Cluttered Scenes | Oct 31, 2024 | SegmentationSemantic Segmentation | CodeCode Available | 1 |
| MLLA-UNet: Mamba-like Linear Attention in an Efficient U-Shape Model for Medical Image Segmentation | Oct 31, 2024 | Image SegmentationMamba | CodeCode Available | 1 |
| Lightweight Frequency Masker for Cross-Domain Few-Shot Semantic Segmentation | Oct 29, 2024 | Cross-Domain Few-ShotFew-Shot Semantic Segmentation | CodeCode Available | 1 |
| Topology-aware Mamba for Crack Segmentation in Structures | Oct 25, 2024 | Crack SegmentationDecoder | CodeCode Available | 1 |
| Beyond Point Annotation: A Weakly Supervised Network Guided by Multi-Level Labels Generated from Four-Point Annotation for Thyroid Nodule Segmentation in Ultrasound Image | Oct 25, 2024 | Segmentation | CodeCode Available | 1 |
| Gaze-Assisted Medical Image Segmentation | Oct 23, 2024 | DiagnosticImage Segmentation | CodeCode Available | 1 |
| Upsampling DINOv2 features for unsupervised vision tasks and weakly supervised materials segmentation | Oct 20, 2024 | Clusteringgraph partitioning | CodeCode Available | 1 |
| EViT-Unet: U-Net Like Efficient Vision Transformer for Medical Image Segmentation on Mobile and Edge Devices | Oct 19, 2024 | DecoderImage Segmentation | CodeCode Available | 1 |
| LESS: Label-Efficient and Single-Stage Referring 3D Segmentation | Oct 17, 2024 | cross-modal alignmentInstance Segmentation | CodeCode Available | 1 |
| RClicks: Realistic Click Simulation for Benchmarking Interactive Segmentation | Oct 15, 2024 | BenchmarkingInteractive Segmentation | CodeCode Available | 1 |
| PCF-Lift: Panoptic Lifting by Probabilistic Contrastive Fusion | Oct 14, 2024 | 3D Panoptic SegmentationPanoptic Segmentation | CodeCode Available | 1 |
| UnSeg: One Universal Unlearnable Example Generator is Enough against All Image Segmentation | Oct 13, 2024 | AllBilevel Optimization | CodeCode Available | 1 |
| CrackSegDiff: Diffusion Probability Model-based Multi-modal Crack Segmentation | Oct 10, 2024 | Crack SegmentationDenoising | CodeCode Available | 1 |
| Iterative Optimization Annotation Pipeline and ALSS-YOLO-Seg for Efficient Banana Plantation Segmentation in UAV Imagery | Oct 9, 2024 | Segmentation | CodeCode Available | 1 |