| ScribFormer: Transformer Makes CNN Work Better for Scribble-based Medical Image Segmentation | Feb 3, 2024 | DecoderImage Segmentation | CodeCode Available | 2 |
| SAGD: Boundary-Enhanced Segment Anything in 3D Gaussian via Gaussian Decomposition | Jan 31, 2024 | Novel View SynthesisSegmentation | CodeCode Available | 2 |
| MF-MOS: A Motion-Focused Model for Moving Object Segmentation | Jan 30, 2024 | Autonomous DrivingObject | CodeCode Available | 2 |
| MouSi: Poly-Visual-Expert Vision-Language Models | Jan 30, 2024 | Image SegmentationImage-text matching | CodeCode Available | 2 |
| SERNet-Former: Semantic Segmentation by Efficient Residual Network with Attention-Boosting Gates and Attention-Fusion Networks | Jan 28, 2024 | 2D Semantic SegmentationDecoder | CodeCode Available | 2 |
| Vivim: a Video Vision Mamba for Medical Video Segmentation | Jan 25, 2024 | Lesion SegmentationMamba | CodeCode Available | 2 |
| Rethinking Patch Dependence for Masked Autoencoders | Jan 25, 2024 | DecoderInstance Segmentation | CodeCode Available | 2 |
| Tyche: Stochastic In-Context Learning for Medical Image Segmentation | Jan 24, 2024 | Image SegmentationIn-Context Learning | CodeCode Available | 2 |
| Self-supervised Learning of LiDAR 3D Point Clouds via 2D-3D Neural Calibration | Jan 23, 2024 | 3D Semantic SegmentationAutonomous Driving | CodeCode Available | 2 |
| PA-SAM: Prompt Adapter SAM for High-Quality Image Segmentation | Jan 23, 2024 | DecoderImage Segmentation | CodeCode Available | 2 |
| Exploring Color Invariance through Image-Level Ensemble Learning | Jan 19, 2024 | Data AugmentationEnsemble Learning | CodeCode Available | 2 |
| A Simple Latent Diffusion Approach for Panoptic Segmentation and Mask Inpainting | Jan 18, 2024 | Instance SegmentationInteractive Segmentation | CodeCode Available | 2 |
| Vision Mamba: Efficient Visual Representation Learning with Bidirectional State Space Model | Jan 17, 2024 | GPUImage Classification | CodeCode Available | 2 |
| Adversarial Supervision Makes Layout-to-Image Diffusion Models Thrive | Jan 16, 2024 | Domain GeneralizationImage Generation | CodeCode Available | 2 |
| OBSeg: Accurate and Fast Instance Segmentation Framework Using Segmentation Foundation Models with Oriented Bounding Box Prompts | Jan 16, 2024 | Amodal Instance SegmentationInstance Segmentation | CodeCode Available | 2 |
| UV-SAM: Adapting Segment Anything Model for Urban Village Identification | Jan 16, 2024 | image-classificationImage Classification | CodeCode Available | 2 |
| PMFSNet: Polarized Multi-scale Feature Self-attention Network For Lightweight Medical Image Segmentation | Jan 15, 2024 | Image SegmentationMedical Image Segmentation | CodeCode Available | 2 |
| Seeing the roads through the trees: A benchmark for modeling spatial dependencies with aerial imagery | Jan 12, 2024 | Object RecognitionRoad Segmentation | CodeCode Available | 2 |
| Seg-metrics: a Python package to compute segmentation metrics | Jan 12, 2024 | Image SegmentationMedical Image Segmentation | CodeCode Available | 2 |
| PartSTAD: 2D-to-3D Part Segmentation Task Adaptation | Jan 11, 2024 | 3D Part SegmentationForeground Segmentation | CodeCode Available | 2 |
| Deep Covariance Alignment for Domain Adaptive Remote Sensing Image Segmentation | Jan 9, 2024 | Image SegmentationSegmentation | CodeCode Available | 2 |
| U-Mamba: Enhancing Long-range Dependency for Biomedical Image Segmentation | Jan 9, 2024 | Cell SegmentationImage Segmentation | CodeCode Available | 2 |
| ODIN: A Single Model for 2D and 3D Segmentation | Jan 4, 2024 | 3D Instance Segmentation3D Semantic Segmentation | CodeCode Available | 2 |
| ClassWise-SAM-Adapter: Parameter Efficient Fine-tuning Adapts Segment Anything to SAR Domain for Semantic Segmentation | Jan 4, 2024 | Decoderparameter-efficient fine-tuning | CodeCode Available | 2 |
| From SAM to CAMs: Exploring Segment Anything Model for Weakly Supervised Semantic Segmentation | Jan 1, 2024 | SegmentationSemantic Segmentation | CodeCode Available | 2 |
| MRFS: Mutually Reinforcing Image Fusion and Segmentation | Jan 1, 2024 | SegmentationSemantic Segmentation | CodeCode Available | 2 |
| LiSA: LiDAR Localization with Semantic Awareness | Jan 1, 2024 | Knowledge DistillationSemantic Segmentation | CodeCode Available | 2 |
| Learn to Rectify the Bias of CLIP for Unsupervised Semantic Segmentation | Jan 1, 2024 | SegmentationSemantic Segmentation | CodeCode Available | 2 |
| BRAU-Net++: U-Shaped Hybrid CNN-Transformer Network for Medical Image Segmentation | Jan 1, 2024 | DecoderImage Segmentation | CodeCode Available | 2 |
| Segment Any Event Streams via Weighted Adaptation of Pivotal Tokens | Jan 1, 2024 | Semantic Segmentation | CodeCode Available | 2 |
| Rethinking Interactive Image Segmentation with Low Latency High Quality and Diverse Prompts | Jan 1, 2024 | Image SegmentationInteractive Segmentation | CodeCode Available | 2 |
| SCTNet: Single-Branch CNN with Transformer Semantic Information for Real-Time Segmentation | Dec 28, 2023 | Real-Time Semantic SegmentationSemantic Segmentation | CodeCode Available | 2 |
| Learning Vision from Models Rivals Learning Vision from Data | Dec 28, 2023 | Contrastive LearningImage Captioning | CodeCode Available | 2 |
| Unsupervised Universal Image Segmentation | Dec 28, 2023 | Image SegmentationInstance Segmentation | CodeCode Available | 2 |
| UniRef++: Segment Every Reference Object in Spatial and Temporal Spaces | Dec 25, 2023 | Image SegmentationObject | CodeCode Available | 2 |
| Narrowing the semantic gaps in U-Net with learnable skip connections: The case of medical image segmentation | Dec 23, 2023 | DecoderImage Segmentation | CodeCode Available | 2 |
| SegRefiner: Towards Model-Agnostic Segmentation Refinement with Discrete Diffusion Process | Dec 19, 2023 | DenoisingDichotomous Image Segmentation | CodeCode Available | 2 |
| CLIP-DINOiser: Teaching CLIP a few DINO tricks for open-vocabulary semantic segmentation | Dec 19, 2023 | Open Vocabulary Semantic SegmentationOpen-Vocabulary Semantic Segmentation | CodeCode Available | 2 |
| Agent Attention: On the Integration of Softmax and Linear Attention | Dec 14, 2023 | Computational Efficiencyimage-classification | CodeCode Available | 2 |
| MCANet: Medical Image Segmentation with Multi-Scale Cross-Axis Attention | Dec 14, 2023 | Image SegmentationLesion Segmentation | CodeCode Available | 2 |
| ScribblePrompt: Fast and Flexible Interactive Segmentation for Any Biomedical Image | Dec 12, 2023 | Image SegmentationInteractive Segmentation | CodeCode Available | 2 |
| ControlNet-XS: Rethinking the Control of Text-to-Image Diffusion Models as Feedback-Control Systems | Dec 11, 2023 | Image GenerationSemantic Segmentation | CodeCode Available | 2 |
| Stronger, Fewer, & Superior: Harnessing Vision Foundation Models for Domain Generalized Semantic Segmentation | Dec 7, 2023 | Domain Generalization | CodeCode Available | 2 |
| Feature 3DGS: Supercharging 3D Gaussian Splatting to Enable Distilled Feature Fields | Dec 6, 2023 | 3DGS3D scene Editing | CodeCode Available | 2 |
| SAM-Assisted Remote Sensing Imagery Semantic Segmentation with Object and Boundary Constraints | Dec 5, 2023 | Model OptimizationNovel Concepts | CodeCode Available | 2 |
| Hulk: A Universal Knowledge Translator for Human-Centric Tasks | Dec 4, 2023 | 3D Human Pose EstimationAction Recognition | CodeCode Available | 2 |
| TransNeXt: Robust Foveal Visual Perception for Vision Transformers | Nov 28, 2023 | ClassificationDomain Generalization | CodeCode Available | 2 |
| SAM-6D: Segment Anything Model Meets Zero-Shot 6D Object Pose Estimation | Nov 27, 2023 | 6D Pose Estimation using RGBInstance Segmentation | CodeCode Available | 2 |
| Adapter is All You Need for Tuning Visual Tasks | Nov 25, 2023 | Allimage-classification | CodeCode Available | 2 |
| OneFormer3D: One Transformer for Unified Point Cloud Segmentation | Nov 24, 2023 | 3D Instance Segmentation3D Object Detection | CodeCode Available | 2 |