| Scalable Video Object Segmentation with Identification Mechanism | Mar 22, 2022 | ObjectSegmentation | CodeCode Available | 2 |
| FEC: Fast Euclidean Clustering for Point Cloud Segmentation | Aug 16, 2022 | ClusteringInstance Segmentation | CodeCode Available | 2 |
| Fields of The World: A Machine Learning Benchmark Dataset For Global Agricultural Field Boundary Segmentation | Sep 24, 2024 | DiversityInstance Segmentation | CodeCode Available | 2 |
| Find First, Track Next: Decoupling Identification and Propagation in Referring Video Object Segmentation | Mar 5, 2025 | ObjectReferring Video Object Segmentation | CodeCode Available | 2 |
| An Empirical Study of Remote Sensing Pretraining | Apr 6, 2022 | Aerial Scene ClassificationBuilding change detection for remote sensing images | CodeCode Available | 2 |
| Focal Modulation Networks | Mar 22, 2022 | image-classificationImage Classification | CodeCode Available | 2 |
| Visible-Thermal Multiple Object Tracking: Large-scale Video Dataset and Progressive Fusion Approach | Aug 2, 2024 | cross-modal alignmentMultiple Object Tracking | CodeCode Available | 2 |
| FreeSOLO: Learning to Segment Objects without Annotations | Feb 24, 2022 | Instance Segmentationobject-detection | CodeCode Available | 2 |
| RandLA-Net: Efficient Semantic Segmentation of Large-Scale Point Clouds | Nov 25, 2019 | 3D Semantic SegmentationLIDAR Semantic Segmentation | CodeCode Available | 2 |
| Are Vision xLSTM Embedded UNet More Reliable in Medical 3D Image Segmentation? | Jun 24, 2024 | Image SegmentationMedical Image Segmentation | CodeCode Available | 2 |
| Full Page Handwriting Recognition via Image to Sequence Extraction | Mar 11, 2021 | Handwriting RecognitionHandwritten Text Recognition | CodeCode Available | 2 |
| Fully Convolutional Instance-aware Semantic Segmentation | Nov 23, 2016 | General ClassificationInstance Segmentation | CodeCode Available | 2 |
| Diversified and Personalized Multi-rater Medical Image Segmentation | Mar 20, 2024 | Image SegmentationMedical Image Segmentation | CodeCode Available | 2 |
| ASAM: Boosting Segment Anything Model with Adversarial Tuning | May 1, 2024 | Image Segmentationmodel | CodeCode Available | 2 |
| Generalized Few-Shot Meets Remote Sensing: Discovering Novel Classes in Land Cover Mapping via Hybrid Semantic Segmentation Framework | Apr 19, 2024 | Earth ObservationSegmentation | CodeCode Available | 2 |
| Domain Adaptation with a Single Vision-Language Embedding | Oct 28, 2024 | Domain AdaptationOne-shot Unsupervised Domain Adaptation | CodeCode Available | 2 |
| Dynamic in Static: Hybrid Visual Correspondence for Self-Supervised Video Object Segmentation | Apr 21, 2024 | Semantic SegmentationVideo Object Segmentation | CodeCode Available | 2 |
| Generative Semantic Segmentation | Mar 20, 2023 | SegmentationSemantic Segmentation | CodeCode Available | 2 |
| Feature Pyramid Networks for Object Detection | Dec 9, 2016 | GPUObject | CodeCode Available | 2 |
| Global Context for Convolutional Pose Machines | Jun 10, 2019 | Pose EstimationSemantic Segmentation | CodeCode Available | 2 |
| GMMSeg: Gaussian Mixture based Generative Semantic Segmentation Models | Oct 5, 2022 | Out-of-Distribution DetectionSegmentation | CodeCode Available | 2 |
| Golden Cudgel Network for Real-Time Semantic Segmentation | Mar 5, 2025 | Real-Time Semantic SegmentationSemantic Segmentation | CodeCode Available | 2 |
| GroupMamba: Efficient Group-Based Visual State Space Model | Jul 18, 2024 | image-classificationImage Classification | CodeCode Available | 2 |
| GroupViT: Semantic Segmentation Emerges from Text Supervision | Feb 22, 2022 | Object DetectionScene Understanding | CodeCode Available | 2 |
| HyperSeg: Towards Universal Visual Segmentation with Large Language Model | Nov 26, 2024 | Language ModelingLarge Language Model | CodeCode Available | 2 |
| Hierarchical Multi-Scale Attention for Semantic Segmentation | May 21, 2020 | Panoptic SegmentationSemantic Segmentation | CodeCode Available | 2 |
| Digital Twin Generation from Visual Data: A Survey | Apr 17, 2025 | Semantic SegmentationSurvey | CodeCode Available | 2 |
| High-Precision Dichotomous Image Segmentation via Probing Diffusion Capacity | Oct 14, 2024 | DenoisingDichotomous Image Segmentation | CodeCode Available | 2 |
| A Simple Latent Diffusion Approach for Panoptic Segmentation and Mask Inpainting | Jan 18, 2024 | Instance SegmentationInteractive Segmentation | CodeCode Available | 2 |
| A Simple Image Segmentation Framework via In-Context Examples | Oct 7, 2024 | DecoderImage Segmentation | CodeCode Available | 2 |
| Dilated Neighborhood Attention Transformer | Sep 29, 2022 | Image ClassificationInstance Segmentation | CodeCode Available | 2 |
| Diffuse, Attend, and Segment: Unsupervised Zero-Shot Segmentation using Stable Diffusion | Aug 23, 2023 | SegmentationSemantic Segmentation | CodeCode Available | 2 |
| Ambiguous Medical Image Segmentation using Diffusion Models | Apr 10, 2023 | DiagnosticDiversity | CodeCode Available | 2 |
| iFormer: Integrating ConvNet and Transformer for Mobile Application | Jan 26, 2025 | Instance Segmentationobject-detection | CodeCode Available | 2 |
| Diffusion models as plug-and-play priors | Jun 17, 2022 | Combinatorial OptimizationDenoising | CodeCode Available | 2 |
| You Only Need 90K Parameters to Adapt Light: A Light Weight Transformer for Image Enhancement and Exposure Correction | May 30, 2022 | Exposure CorrectionImage Enhancement | CodeCode Available | 2 |
| DI-MaskDINO: A Joint Object Detection and Instance Segmentation Model | Oct 22, 2024 | DecoderInstance Segmentation | CodeCode Available | 2 |
| DiffAtlas: GenAI-fying Atlas Segmentation via Image-Mask Diffusion | Mar 9, 2025 | Image SegmentationMedical Image Segmentation | CodeCode Available | 2 |
| Asymmetric Non-local Neural Networks for Semantic Segmentation | Aug 21, 2019 | GPUSegmentation | CodeCode Available | 2 |
| A Tale of Two Features: Stable Diffusion Complements DINO for Zero-Shot Semantic Correspondence | May 24, 2023 | Dense Pixel Correspondence EstimationRepresentation Learning | CodeCode Available | 2 |
| In Defense of Online Models for Video Instance Segmentation | Jul 21, 2022 | Contrastive LearningInstance Segmentation | CodeCode Available | 2 |
| AllSpark: Reborn Labeled Features from Unlabeled in Transformer for Semi-Supervised Semantic Segmentation | Mar 4, 2024 | Semantic SegmentationSemi-Supervised Semantic Segmentation | CodeCode Available | 2 |
| DiffBEV: Conditional Diffusion Model for Bird's Eye View Perception | Mar 15, 2023 | 3D Object DetectionAutonomous Driving | CodeCode Available | 2 |
| AllWeatherNet:Unified Image Enhancement for Autonomous Driving under Adverse Weather and Lowlight-conditions | Sep 3, 2024 | Autonomous DrivingDeep Attention | CodeCode Available | 2 |
| DiffRect: Latent Diffusion Label Rectification for Semi-supervised Medical Image Segmentation | Jul 13, 2024 | DenoisingImage Segmentation | CodeCode Available | 2 |
| DINO in the Room: Leveraging 2D Foundation Models for 3D Segmentation | Mar 24, 2025 | 3D Semantic SegmentationLIDAR Semantic Segmentation | CodeCode Available | 2 |
| InvPT: Inverted Pyramid Multi-task Transformer for Dense Scene Understanding | Mar 15, 2022 | Boundary DetectionHuman Parsing | CodeCode Available | 2 |
| IRSAM: Advancing Segment Anything Model for Infrared Small Target Detection | Jul 10, 2024 | DecoderImage Segmentation | CodeCode Available | 2 |
| DetectoRS: Detecting Objects with Recursive Feature Pyramid and Switchable Atrous Convolution | Jun 3, 2020 | Instance SegmentationObject | CodeCode Available | 2 |
| Alleviating Textual Reliance in Medical Language-guided Segmentation via Prototype-driven Semantic Approximation | Jul 15, 2025 | Image SegmentationSegmentation | CodeCode Available | 2 |