| Open-Vocabulary Panoptic Segmentation Using BERT Pre-Training of Vision-Language Multiway Transformer Model | Dec 25, 2024 | Open Vocabulary Panoptic SegmentationPanoptic Segmentation | CodeCode Available | 1 |
| PC-BEV: An Efficient Polar-Cartesian BEV Fusion Framework for LiDAR Semantic Segmentation | Dec 19, 2024 | LIDAR Semantic SegmentationScene Understanding | CodeCode Available | 1 |
| Spike2Former: Efficient Spiking Transformer for High-performance Image Segmentation | Dec 19, 2024 | Image SegmentationSegmentation | CodeCode Available | 1 |
| ZoRI: Towards Discriminative Zero-Shot Remote Sensing Instance Segmentation | Dec 17, 2024 | Instance SegmentationSegmentation | CodeCode Available | 1 |
| MaskCLIP++: A Mask-Based CLIP Fine-tuning Framework for Open-Vocabulary Image Segmentation | Dec 16, 2024 | Image SegmentationOpen Vocabulary Semantic Segmentation | CodeCode Available | 1 |
| DCSEG: Decoupled 3D Open-Set Segmentation using Gaussian Splatting | Dec 14, 2024 | 3D ReconstructionSegmentation | CodeCode Available | 1 |
| Towards Open-Vocabulary Video Semantic Segmentation | Dec 12, 2024 | SegmentationSemantic Segmentation | CodeCode Available | 1 |
| EOV-Seg: Efficient Open-Vocabulary Panoptic Segmentation | Dec 11, 2024 | DecoderGPU | CodeCode Available | 1 |
| SAM-Mamba: Mamba Guided SAM Architecture for Generalized Zero-Shot Polyp Segmentation | Dec 11, 2024 | MambaSegmentation | CodeCode Available | 1 |
| XLSTM-HVED: Cross-Modal Brain Tumor Segmentation and MRI Reconstruction Method Using Vision XLSTM and Heteromodal Variational Encoder-Decoder | Dec 9, 2024 | Brain Tumor SegmentationDecoder | CodeCode Available | 1 |
| MCP-MedSAM: A Powerful Lightweight Medical Segment Anything Model Trained with a Single GPU in Just One Day | Dec 8, 2024 | GPUImage Segmentation | CodeCode Available | 1 |
| CLIP-TNseg: A Multi-Modal Hybrid Framework for Thyroid Nodule Segmentation in Ultrasound Images | Dec 7, 2024 | Segmentation | CodeCode Available | 1 |
| MRGen: Diffusion-based Controllable Data Engine for MRI Segmentation towards Unannotated Modalities | Dec 4, 2024 | Image GenerationImage Segmentation | CodeCode Available | 1 |
| RG-SAN: Rule-Guided Spatial Awareness Network for End-to-End 3D Referring Expression Segmentation | Dec 3, 2024 | Referring ExpressionReferring Expression Segmentation | CodeCode Available | 1 |
| Multi-Granularity Video Object Segmentation | Dec 2, 2024 | ObjectSegmentation | CodeCode Available | 1 |
| MambaU-Lite: A Lightweight Model based on Mamba and Integrated Channel-Spatial Attention for Skin Lesion Segmentation | Dec 2, 2024 | DiagnosticLesion Segmentation | CodeCode Available | 1 |
| CellSeg1: Robust Cell Segmentation with One Training Image | Dec 2, 2024 | Cell SegmentationSegmentation | CodeCode Available | 1 |
| SyncVIS: Synchronized Video Instance Segmentation | Dec 1, 2024 | Instance SegmentationSegmentation | CodeCode Available | 1 |
| Token Cropr: Faster ViTs for Quite a Few Tasks | Dec 1, 2024 | image-classificationImage Classification | CodeCode Available | 1 |
| cWDM: Conditional Wavelet Diffusion Models for Cross-Modality 3D Medical Image Synthesis | Nov 26, 2024 | Brain Tumor SegmentationImage Generation | CodeCode Available | 1 |
| Learn from Foundation Model: Fruit Detection Model without Manual Annotation | Nov 25, 2024 | Instance SegmentationKnowledge Distillation | CodeCode Available | 1 |
| Deformable Mamba for Wide Field of View Segmentation | Nov 25, 2024 | DecoderMamba | CodeCode Available | 1 |
| A SAM-guided and Match-based Semi-Supervised Segmentation Framework for Medical Imaging | Nov 25, 2024 | Image SegmentationMedical Image Segmentation | CodeCode Available | 1 |
| Peritumoral Expansion Radiomics for Improved Lung Cancer Classification | Nov 24, 2024 | 3D ClassificationCancer Classification | CodeCode Available | 1 |
| MulModSeg: Enhancing Unpaired Multi-Modal Medical Image Segmentation with Modality-Conditioned Text Embedding and Alternating Training | Nov 23, 2024 | Computed Tomography (CT)Image Segmentation | CodeCode Available | 1 |