| MRFS: Mutually Reinforcing Image Fusion and Segmentation | Jan 1, 2024 | SegmentationSemantic Segmentation | CodeCode Available | 2 |
| LiSA: LiDAR Localization with Semantic Awareness | Jan 1, 2024 | Knowledge DistillationSemantic Segmentation | CodeCode Available | 2 |
| Learn to Rectify the Bias of CLIP for Unsupervised Semantic Segmentation | Jan 1, 2024 | SegmentationSemantic Segmentation | CodeCode Available | 2 |
| BRAU-Net++: U-Shaped Hybrid CNN-Transformer Network for Medical Image Segmentation | Jan 1, 2024 | DecoderImage Segmentation | CodeCode Available | 2 |
| Segment Any Event Streams via Weighted Adaptation of Pivotal Tokens | Jan 1, 2024 | Semantic Segmentation | CodeCode Available | 2 |
| Rethinking Interactive Image Segmentation with Low Latency High Quality and Diverse Prompts | Jan 1, 2024 | Image SegmentationInteractive Segmentation | CodeCode Available | 2 |
| SCTNet: Single-Branch CNN with Transformer Semantic Information for Real-Time Segmentation | Dec 28, 2023 | Real-Time Semantic SegmentationSemantic Segmentation | CodeCode Available | 2 |
| Learning Vision from Models Rivals Learning Vision from Data | Dec 28, 2023 | Contrastive LearningImage Captioning | CodeCode Available | 2 |
| Unsupervised Universal Image Segmentation | Dec 28, 2023 | Image SegmentationInstance Segmentation | CodeCode Available | 2 |
| UniRef++: Segment Every Reference Object in Spatial and Temporal Spaces | Dec 25, 2023 | Image SegmentationObject | CodeCode Available | 2 |
| Narrowing the semantic gaps in U-Net with learnable skip connections: The case of medical image segmentation | Dec 23, 2023 | DecoderImage Segmentation | CodeCode Available | 2 |
| SegRefiner: Towards Model-Agnostic Segmentation Refinement with Discrete Diffusion Process | Dec 19, 2023 | DenoisingDichotomous Image Segmentation | CodeCode Available | 2 |
| CLIP-DINOiser: Teaching CLIP a few DINO tricks for open-vocabulary semantic segmentation | Dec 19, 2023 | Open Vocabulary Semantic SegmentationOpen-Vocabulary Semantic Segmentation | CodeCode Available | 2 |
| Agent Attention: On the Integration of Softmax and Linear Attention | Dec 14, 2023 | Computational Efficiencyimage-classification | CodeCode Available | 2 |
| MCANet: Medical Image Segmentation with Multi-Scale Cross-Axis Attention | Dec 14, 2023 | Image SegmentationLesion Segmentation | CodeCode Available | 2 |
| ScribblePrompt: Fast and Flexible Interactive Segmentation for Any Biomedical Image | Dec 12, 2023 | Image SegmentationInteractive Segmentation | CodeCode Available | 2 |
| ControlNet-XS: Rethinking the Control of Text-to-Image Diffusion Models as Feedback-Control Systems | Dec 11, 2023 | Image GenerationSemantic Segmentation | CodeCode Available | 2 |
| Stronger, Fewer, & Superior: Harnessing Vision Foundation Models for Domain Generalized Semantic Segmentation | Dec 7, 2023 | Domain Generalization | CodeCode Available | 2 |
| Feature 3DGS: Supercharging 3D Gaussian Splatting to Enable Distilled Feature Fields | Dec 6, 2023 | 3DGS3D scene Editing | CodeCode Available | 2 |
| SAM-Assisted Remote Sensing Imagery Semantic Segmentation with Object and Boundary Constraints | Dec 5, 2023 | Model OptimizationNovel Concepts | CodeCode Available | 2 |
| Hulk: A Universal Knowledge Translator for Human-Centric Tasks | Dec 4, 2023 | 3D Human Pose EstimationAction Recognition | CodeCode Available | 2 |
| TransNeXt: Robust Foveal Visual Perception for Vision Transformers | Nov 28, 2023 | ClassificationDomain Generalization | CodeCode Available | 2 |
| SAM-6D: Segment Anything Model Meets Zero-Shot 6D Object Pose Estimation | Nov 27, 2023 | 6D Pose Estimation using RGBInstance Segmentation | CodeCode Available | 2 |
| Adapter is All You Need for Tuning Visual Tasks | Nov 25, 2023 | Allimage-classification | CodeCode Available | 2 |
| OneFormer3D: One Transformer for Unified Point Cloud Segmentation | Nov 24, 2023 | 3D Instance Segmentation3D Object Detection | CodeCode Available | 2 |