| PMFSNet: Polarized Multi-scale Feature Self-attention Network For Lightweight Medical Image Segmentation | Jan 15, 2024 | Image SegmentationMedical Image Segmentation | CodeCode Available | 2 |
| PartSTAD: 2D-to-3D Part Segmentation Task Adaptation | Jan 11, 2024 | 3D Part SegmentationForeground Segmentation | CodeCode Available | 2 |
| Deep Covariance Alignment for Domain Adaptive Remote Sensing Image Segmentation | Jan 9, 2024 | Image SegmentationSegmentation | CodeCode Available | 2 |
| U-Mamba: Enhancing Long-range Dependency for Biomedical Image Segmentation | Jan 9, 2024 | Cell SegmentationImage Segmentation | CodeCode Available | 2 |
| ClassWise-SAM-Adapter: Parameter Efficient Fine-tuning Adapts Segment Anything to SAR Domain for Semantic Segmentation | Jan 4, 2024 | Decoderparameter-efficient fine-tuning | CodeCode Available | 2 |
| ODIN: A Single Model for 2D and 3D Segmentation | Jan 4, 2024 | 3D Instance Segmentation3D Semantic Segmentation | CodeCode Available | 2 |
| MemSAM: Taming Segment Anything Model for Echocardiography Video Segmentation | Jan 1, 2024 | SegmentationVideo Segmentation | CodeCode Available | 2 |
| BRAU-Net++: U-Shaped Hybrid CNN-Transformer Network for Medical Image Segmentation | Jan 1, 2024 | DecoderImage Segmentation | CodeCode Available | 2 |
| From SAM to CAMs: Exploring Segment Anything Model for Weakly Supervised Semantic Segmentation | Jan 1, 2024 | SegmentationSemantic Segmentation | CodeCode Available | 2 |
| FACT: Frame-Action Cross-Attention Temporal Modeling for Efficient Action Segmentation | Jan 1, 2024 | Action SegmentationSegmentation | CodeCode Available | 2 |
| Learn to Rectify the Bias of CLIP for Unsupervised Semantic Segmentation | Jan 1, 2024 | SegmentationSemantic Segmentation | CodeCode Available | 2 |
| MRFS: Mutually Reinforcing Image Fusion and Segmentation | Jan 1, 2024 | SegmentationSemantic Segmentation | CodeCode Available | 2 |
| Rethinking Interactive Image Segmentation with Low Latency High Quality and Diverse Prompts | Jan 1, 2024 | Image SegmentationInteractive Segmentation | CodeCode Available | 2 |
| Unsupervised Universal Image Segmentation | Dec 28, 2023 | Image SegmentationInstance Segmentation | CodeCode Available | 2 |
| One Model to Rule them All: Towards Universal Segmentation for Medical Images with Text Prompts | Dec 28, 2023 | AllAnatomy | CodeCode Available | 2 |
| UniRef++: Segment Every Reference Object in Spatial and Temporal Spaces | Dec 25, 2023 | Image SegmentationObject | CodeCode Available | 2 |
| Narrowing the semantic gaps in U-Net with learnable skip connections: The case of medical image segmentation | Dec 23, 2023 | DecoderImage Segmentation | CodeCode Available | 2 |
| SegRefiner: Towards Model-Agnostic Segmentation Refinement with Discrete Diffusion Process | Dec 19, 2023 | DenoisingDichotomous Image Segmentation | CodeCode Available | 2 |
| MCANet: Medical Image Segmentation with Multi-Scale Cross-Axis Attention | Dec 14, 2023 | Image SegmentationLesion Segmentation | CodeCode Available | 2 |
| Polyper: Boundary Sensitive Polyp Segmentation | Dec 14, 2023 | Segmentation | CodeCode Available | 2 |
| ScribblePrompt: Fast and Flexible Interactive Segmentation for Any Biomedical Image | Dec 12, 2023 | Image SegmentationInteractive Segmentation | CodeCode Available | 2 |
| FRNet: Frustum-Range Networks for Scalable LiDAR Segmentation | Dec 7, 2023 | 3D Semantic SegmentationAutonomous Driving | CodeCode Available | 2 |
| SAM-Assisted Remote Sensing Imagery Semantic Segmentation with Object and Boundary Constraints | Dec 5, 2023 | Model OptimizationNovel Concepts | CodeCode Available | 2 |
| Towards Automatic Power Battery Detection: New Challenge, Benchmark Dataset and Baseline | Dec 5, 2023 | Crowd Countingobject-detection | CodeCode Available | 2 |
| PixelLM: Pixel Reasoning with Large Multimodal Model | Dec 4, 2023 | Decodermodel | CodeCode Available | 2 |
| Aligning and Prompting Everything All at Once for Universal Visual Perception | Dec 4, 2023 | AllObject | CodeCode Available | 2 |
| Universal Segmentation at Arbitrary Granularity with Language Instruction | Dec 4, 2023 | Referring Expression SegmentationSegmentation | CodeCode Available | 2 |
| 3D Face Reconstruction with the Geometric Guidance of Facial Part Segmentation | Dec 1, 2023 | 3D Face ReconstructionFace Reconstruction | CodeCode Available | 2 |
| SAM-6D: Segment Anything Model Meets Zero-Shot 6D Object Pose Estimation | Nov 27, 2023 | 6D Pose Estimation using RGBInstance Segmentation | CodeCode Available | 2 |
| Adapter is All You Need for Tuning Visual Tasks | Nov 25, 2023 | Allimage-classification | CodeCode Available | 2 |
| OneFormer3D: One Transformer for Unified Point Cloud Segmentation | Nov 24, 2023 | 3D Instance Segmentation3D Object Detection | CodeCode Available | 2 |
| SegVol: Universal and Interactive Volumetric Medical Image Segmentation | Nov 22, 2023 | Computed Tomography (CT)Image Segmentation | CodeCode Available | 2 |
| Open-Vocabulary Camouflaged Object Segmentation | Nov 19, 2023 | Camouflaged Object SegmentationImage Segmentation | CodeCode Available | 2 |
| GLaMM: Pixel Grounding Large Multimodal Model | Nov 6, 2023 | Conversational Question AnsweringImage Captioning | CodeCode Available | 2 |
| Medical Image Segmentation with Domain Adaptation: A Survey | Nov 3, 2023 | Domain AdaptationImage Segmentation | CodeCode Available | 2 |
| SAM-Med3D: Towards General-purpose Segmentation Models for Volumetric Medical Images | Oct 23, 2023 | 3D ArchitectureImage Segmentation | CodeCode Available | 2 |
| Is Weakly-supervised Action Segmentation Ready For Human-Robot Interaction? No, Let's Improve It With Action-union Learning | Oct 22, 2023 | Action RecognitionAction Segmentation | CodeCode Available | 2 |
| CLIPSelf: Vision Transformer Distills Itself for Open-Vocabulary Dense Prediction | Oct 2, 2023 | image-classificationImage Classification | CodeCode Available | 2 |
| You Only Look at Once for Real-time and Generic Multi-Task | Oct 2, 2023 | Autonomous DrivingDrivable Area Detection | CodeCode Available | 2 |
| nnSAM: Plug-and-play Segment Anything Model Improves nnUNet Performance | Sep 29, 2023 | Few-Shot LearningHeart Segmentation | CodeCode Available | 2 |
| PLVS: A SLAM System with Points, Lines, Volumetric Mapping, and 3D Incremental Segmentation | Sep 19, 2023 | 3D ReconstructionSegmentation | CodeCode Available | 2 |
| DFormer: Rethinking RGBD Representation Learning for Semantic Segmentation | Sep 18, 2023 | 3D geometryDecoder | CodeCode Available | 2 |
| Beyond Adapting SAM: Towards End-to-End Ultrasound Image Segmentation via Auto Prompting | Sep 13, 2023 | Image SegmentationMedical Image Segmentation | CodeCode Available | 2 |
| UniSeg: A Unified Multi-Modal LiDAR Segmentation Network and the OpenPCSeg Codebase | Sep 11, 2023 | 3D Semantic SegmentationLIDAR Semantic Segmentation | CodeCode Available | 2 |
| A-Eval: A Benchmark for Cross-Dataset Evaluation of Abdominal Multi-Organ Segmentation | Sep 7, 2023 | Organ SegmentationSegmentation | CodeCode Available | 2 |
| RevColV2: Exploring Disentangled Representations in Masked Image Modeling | Sep 2, 2023 | Decoderimage-classification | CodeCode Available | 2 |
| FastSurfer-HypVINN: Automated sub-segmentation of the hypothalamus and adjacent structures on high-resolutional brain MRI | Aug 24, 2023 | GPUSegmentation | CodeCode Available | 2 |
| Diffuse, Attend, and Segment: Unsupervised Zero-Shot Segmentation using Stable Diffusion | Aug 23, 2023 | SegmentationSemantic Segmentation | CodeCode Available | 2 |
| SRFormer: Text Detection Transformer with Incorporated Segmentation and Regression | Aug 21, 2023 | Decoderregression | CodeCode Available | 2 |
| MeViS: A Large-scale Benchmark for Video Segmentation with Motion Expressions | Aug 16, 2023 | Motion Expressions Guided Video SegmentationObject | CodeCode Available | 2 |