| Diffuse, Attend, and Segment: Unsupervised Zero-Shot Segmentation using Stable Diffusion | Aug 23, 2023 | SegmentationSemantic Segmentation | CodeCode Available | 2 |
| DetectoRS: Detecting Objects with Recursive Feature Pyramid and Switchable Atrous Convolution | Jun 3, 2020 | Instance SegmentationObject | CodeCode Available | 2 |
| AllSpark: Reborn Labeled Features from Unlabeled in Transformer for Semi-Supervised Semantic Segmentation | Mar 4, 2024 | Semantic SegmentationSemi-Supervised Semantic Segmentation | CodeCode Available | 2 |
| Alleviating Textual Reliance in Medical Language-guided Segmentation via Prototype-driven Semantic Approximation | Jul 15, 2025 | Image SegmentationSegmentation | CodeCode Available | 2 |
| AllWeatherNet:Unified Image Enhancement for Autonomous Driving under Adverse Weather and Lowlight-conditions | Sep 3, 2024 | Autonomous DrivingDeep Attention | CodeCode Available | 2 |
| FEC: Fast Euclidean Clustering for Point Cloud Segmentation | Aug 16, 2022 | ClusteringInstance Segmentation | CodeCode Available | 2 |
| Fields of The World: A Machine Learning Benchmark Dataset For Global Agricultural Field Boundary Segmentation | Sep 24, 2024 | DiversityInstance Segmentation | CodeCode Available | 2 |
| Find First, Track Next: Decoupling Identification and Propagation in Referring Video Object Segmentation | Mar 5, 2025 | ObjectReferring Video Object Segmentation | CodeCode Available | 2 |
| RandLA-Net: Efficient Semantic Segmentation of Large-Scale Point Clouds | Nov 25, 2019 | 3D Semantic SegmentationLIDAR Semantic Segmentation | CodeCode Available | 2 |
| ASAM: Boosting Segment Anything Model with Adversarial Tuning | May 1, 2024 | Image Segmentationmodel | CodeCode Available | 2 |
| DFormer: Rethinking RGBD Representation Learning for Semantic Segmentation | Sep 18, 2023 | 3D geometryDecoder | CodeCode Available | 2 |
| FreeSOLO: Learning to Segment Objects without Annotations | Feb 24, 2022 | Instance Segmentationobject-detection | CodeCode Available | 2 |
| Are Vision xLSTM Embedded UNet More Reliable in Medical 3D Image Segmentation? | Jun 24, 2024 | Image SegmentationMedical Image Segmentation | CodeCode Available | 2 |
| Frozen CLIP: A Strong Backbone for Weakly Supervised Semantic Segmentation | Jun 17, 2024 | DecoderSegmentation | CodeCode Available | 2 |
| ARKit LabelMaker: A New Scale for Indoor 3D Scene Understanding | Oct 17, 2024 | 3D Semantic SegmentationImage Generation | CodeCode Available | 2 |
| Full Page Handwriting Recognition via Image to Sequence Extraction | Mar 11, 2021 | Handwriting RecognitionHandwritten Text Recognition | CodeCode Available | 2 |
| Generalized Few-Shot Meets Remote Sensing: Discovering Novel Classes in Land Cover Mapping via Hybrid Semantic Segmentation Framework | Apr 19, 2024 | Earth ObservationSegmentation | CodeCode Available | 2 |
| Diffusion models as plug-and-play priors | Jun 17, 2022 | Combinatorial OptimizationDenoising | CodeCode Available | 2 |
| Generative Active Learning for Long-tailed Instance Segmentation | Jun 4, 2024 | Active LearningInstance Segmentation | CodeCode Available | 2 |
| Generative AI Enables Medical Image Segmentation in Ultra Low-Data Regimes | Aug 30, 2024 | Deep LearningImage Segmentation | CodeCode Available | 2 |
| GLaMM: Pixel Grounding Large Multimodal Model | Nov 6, 2023 | Conversational Question AnsweringImage Captioning | CodeCode Available | 2 |
| Densely Connected Parameter-Efficient Tuning for Referring Image Segmentation | Jan 15, 2025 | Image SegmentationReferring Expression Segmentation | CodeCode Available | 2 |
| Global Context Vision Transformers | Jun 20, 2022 | image-classificationImage Classification | CodeCode Available | 2 |
| GLUS: Global-Local Reasoning Unified into A Single Large Language Model for Video Segmentation | Apr 10, 2025 | Contrastive LearningLanguage Modeling | CodeCode Available | 2 |
| DenseNets Reloaded: Paradigm Shift Beyond ResNets and ViTs | Mar 28, 2024 | Fine-Grained Image ClassificationImage Classification | CodeCode Available | 2 |