| DuSSS: Dual Semantic Similarity-Supervised Vision-Language Model for Semi-Supervised Medical Image Segmentation | Dec 17, 2024 | Contrastive LearningImage Segmentation | CodeCode Available | 1 |
| ZoRI: Towards Discriminative Zero-Shot Remote Sensing Instance Segmentation | Dec 17, 2024 | Instance SegmentationSegmentation | CodeCode Available | 1 |
| MaskCLIP++: A Mask-Based CLIP Fine-tuning Framework for Open-Vocabulary Image Segmentation | Dec 16, 2024 | Image SegmentationOpen Vocabulary Semantic Segmentation | CodeCode Available | 1 |
| Exploring Semantic Consistency and Style Diversity for Domain Generalized Semantic Segmentation | Dec 16, 2024 | DiversitySemantic Segmentation | CodeCode Available | 1 |
| MoRe: Class Patch Attention Needs Regularization for Weakly Supervised Semantic Segmentation | Dec 15, 2024 | Semantic SegmentationWeakly supervised Semantic Segmentation | CodeCode Available | 1 |
| RapidNet: Multi-Level Dilated Convolution Based Mobile Backbone | Dec 14, 2024 | image-classificationImage Classification | CodeCode Available | 1 |
| DCSEG: Decoupled 3D Open-Set Segmentation using Gaussian Splatting | Dec 14, 2024 | 3D ReconstructionSegmentation | CodeCode Available | 1 |
| Towards Open-Vocabulary Video Semantic Segmentation | Dec 12, 2024 | SegmentationSemantic Segmentation | CodeCode Available | 1 |
| Benchmarking Large Vision-Language Models via Directed Scene Graph for Comprehensive Image Captioning | Dec 11, 2024 | AttributeBenchmarking | CodeCode Available | 1 |
| EOV-Seg: Efficient Open-Vocabulary Panoptic Segmentation | Dec 11, 2024 | DecoderGPU | CodeCode Available | 1 |
| Knowledge Transfer and Domain Adaptation for Fine-Grained Remote Sensing Image Segmentation | Dec 9, 2024 | Domain AdaptationImage Segmentation | CodeCode Available | 1 |
| MCP-MedSAM: A Powerful Lightweight Medical Segment Anything Model Trained with a Single GPU in Just One Day | Dec 8, 2024 | GPUImage Segmentation | CodeCode Available | 1 |
| RSUniVLM: A Unified Vision Language Model for Remote Sensing via Granularity-oriented Mixture of Experts | Dec 7, 2024 | Change DetectionImage Comprehension | CodeCode Available | 1 |
| MRGen: Diffusion-based Controllable Data Engine for MRI Segmentation towards Unannotated Modalities | Dec 4, 2024 | Image GenerationImage Segmentation | CodeCode Available | 1 |
| Active Negative Loss: A Robust Framework for Learning with Noisy Labels | Dec 3, 2024 | Image SegmentationLearning with noisy labels | CodeCode Available | 1 |
| COSMOS: Cross-Modality Self-Distillation for Vision Language Pre-training | Dec 2, 2024 | Self-Supervised LearningSemantic Segmentation | CodeCode Available | 1 |
| Referring Video Object Segmentation via Language-aligned Track Selection | Dec 2, 2024 | ObjectObject Tracking | CodeCode Available | 1 |
| Multi-Granularity Video Object Segmentation | Dec 2, 2024 | ObjectSegmentation | CodeCode Available | 1 |
| SyncVIS: Synchronized Video Instance Segmentation | Dec 1, 2024 | Instance SegmentationSegmentation | CodeCode Available | 1 |
| Token Cropr: Faster ViTs for Quite a Few Tasks | Dec 1, 2024 | image-classificationImage Classification | CodeCode Available | 1 |
| TAROT: Targeted Data Selection via Optimal Transport | Nov 30, 2024 | motion predictionSemantic Segmentation | CodeCode Available | 1 |
| Bootstraping Clustering of Gaussians for View-consistent 3D Scene Understanding | Nov 29, 2024 | 3D geometry3DGS | CodeCode Available | 1 |
| A SAM-guided and Match-based Semi-Supervised Segmentation Framework for Medical Imaging | Nov 25, 2024 | Image SegmentationMedical Image Segmentation | CodeCode Available | 1 |
| Deformable Mamba for Wide Field of View Segmentation | Nov 25, 2024 | DecoderMamba | CodeCode Available | 1 |
| Learn from Foundation Model: Fruit Detection Model without Manual Annotation | Nov 25, 2024 | Instance SegmentationKnowledge Distillation | CodeCode Available | 1 |