| DAT++: Spatially Dynamic Vision Transformer with Deformable Attention | Sep 4, 2023 | Image ClassificationInstance Segmentation | CodeCode Available | 2 |
| Mamba-R: Vision Mamba ALSO Needs Registers | May 23, 2024 | MambaSemantic Segmentation | CodeCode Available | 2 |
| Decoupling Features in Hierarchical Propagation for Video Object Segmentation | Oct 18, 2022 | ObjectSemantic Segmentation | CodeCode Available | 2 |
| Deep Spectral Methods: A Surprisingly Strong Baseline for Unsupervised Semantic Segmentation and Localization | May 16, 2022 | graph partitioningSegmentation | CodeCode Available | 2 |
| DiffAtlas: GenAI-fying Atlas Segmentation via Image-Mask Diffusion | Mar 9, 2025 | Image SegmentationMedical Image Segmentation | CodeCode Available | 2 |
| Bidirectional Copy-Paste for Semi-Supervised Medical Image Segmentation | May 1, 2023 | Image SegmentationMedical Image Segmentation | CodeCode Available | 2 |
| Masked Generative Distillation | May 3, 2022 | image-classificationImage Classification | CodeCode Available | 2 |
| Mask-Free Video Instance Segmentation | Mar 28, 2023 | Instance SegmentationOptical Flow Estimation | CodeCode Available | 2 |
| BEVCar: Camera-Radar Fusion for BEV Map and Object Segmentation | Mar 18, 2024 | Decision MakingScene Segmentation | CodeCode Available | 2 |
| Crowd-SAM: SAM as a Smart Annotator for Object Detection in Crowded Scenes | Jul 16, 2024 | Human Instance SegmentationInstance Segmentation | CodeCode Available | 2 |
| Beyond Image Super-Resolution for Image Recognition with Task-Driven Perceptual Loss | Apr 2, 2024 | image-classificationImage Classification | CodeCode Available | 2 |
| Customized Segment Anything Model for Medical Image Segmentation | Apr 26, 2023 | DecoderImage Segmentation | CodeCode Available | 2 |
| Beyond Self-Attention: Deformable Large Kernel Attention for Medical Image Segmentation | Aug 31, 2023 | Image SegmentationMedical Image Segmentation | CodeCode Available | 2 |
| MedCLIP-SAMv2: Towards Universal Text-Driven Medical Image Segmentation | Sep 28, 2024 | Image SegmentationMedical Image Analysis | CodeCode Available | 2 |
| DAMamba: Vision State Space Model with Dynamic Adaptive Scan | Feb 18, 2025 | image-classificationImage Classification | CodeCode Available | 2 |
| Medical Image Segmentation with Domain Adaptation: A Survey | Nov 3, 2023 | Domain AdaptationImage Segmentation | CodeCode Available | 2 |
| Cross-Image Relational Knowledge Distillation for Semantic Segmentation | Apr 14, 2022 | Knowledge DistillationSegmentation | CodeCode Available | 2 |
| MedNeXt: Transformer-driven Scaling of ConvNets for Medical Image Segmentation | Mar 17, 2023 | DecoderImage Segmentation | CodeCode Available | 2 |
| 2DPASS: 2D Priors Assisted Semantic Segmentation on LiDAR Point Clouds | Jul 10, 2022 | 3D Semantic SegmentationAutonomous Driving | CodeCode Available | 2 |
| MedTsLLM: Leveraging LLMs for Multimodal Medical Time Series Analysis | Aug 14, 2024 | Anomaly DetectionBoundary Detection | CodeCode Available | 2 |
| StitchFusion: Weaving Any Visual Modalities to Enhance Multimodal Semantic Segmentation | Aug 2, 2024 | SegmentationSemantic Segmentation | CodeCode Available | 2 |
| Adversarial Supervision Makes Layout-to-Image Diffusion Models Thrive | Jan 16, 2024 | Domain GeneralizationImage Generation | CodeCode Available | 2 |
| Binary Neural Networks: A Survey | Mar 31, 2020 | Binarizationimage-classification | CodeCode Available | 2 |
| MetaUAS: Universal Anomaly Segmentation with One-Prompt Meta-Learning | May 14, 2025 | Anomaly DetectionAnomaly Segmentation | CodeCode Available | 2 |
| MIC: Masked Image Consistency for Context-Enhanced Domain Adaptation | Dec 2, 2022 | Domain Adaptationimage-classification | CodeCode Available | 2 |
| MinVIS: A Minimal Video Instance Segmentation Framework without Video-based Training | Aug 3, 2022 | Instance SegmentationSegmentation | CodeCode Available | 2 |
| Cross Language Image Matching for Weakly Supervised Semantic Segmentation | Mar 5, 2022 | ObjectSemantic Segmentation | CodeCode Available | 2 |
| MMEarth: Exploring Multi-Modal Pretext Tasks For Geospatial Representation Learning | May 4, 2024 | Earth Observationimage-classification | CodeCode Available | 2 |
| CrossEarth: Geospatial Vision Foundation Model for Domain Generalizable Remote Sensing Semantic Segmentation | Oct 30, 2024 | Domain AdaptationDomain Generalization | CodeCode Available | 2 |
| 4D Spatio-Temporal ConvNets: Minkowski Convolutional Neural Networks | Apr 18, 2019 | 3D Semantic Segmentation4D Spatio Temporal Semantic Segmentation | CodeCode Available | 2 |
| BiSeNet V2: Bilateral Network with Guided Aggregation for Real-time Semantic Segmentation | Apr 5, 2020 | Real-Time Semantic SegmentationSegmentation | CodeCode Available | 2 |
| MOSE: A New Dataset for Video Object Segmentation in Complex Scenes | Feb 3, 2023 | ObjectSegmentation | CodeCode Available | 2 |
| BlenderProc | Oct 25, 2019 | 3D Object RecognitionDepth Image Estimation | CodeCode Available | 2 |
| Moving Object Segmentation in Point Cloud Data using Hidden Markov Models | Oct 24, 2024 | Semantic Segmentation | CodeCode Available | 2 |
| CrossFormer++: A Versatile Vision Transformer Hinging on Cross-scale Attention | Mar 13, 2023 | image-classificationImage Classification | CodeCode Available | 2 |
| MSVM-UNet: Multi-Scale Vision Mamba UNet for Medical Image Segmentation | Aug 25, 2024 | Image SegmentationMamba | CodeCode Available | 2 |
| Cross-Modal Interactive Perception Network with Mamba for Lung Tumor Segmentation in PET-CT Images | Mar 21, 2025 | Image SegmentationMamba | CodeCode Available | 2 |
| Multimodality Helps Few-Shot 3D Point Cloud Semantic Segmentation | Oct 29, 2024 | Few-shot 3D Point Cloud Semantic SegmentationPoint Cloud Segmentation | CodeCode Available | 2 |
| Aerial Lifting: Neural Urban Semantic and Building Instance Lifting from Aerial Imagery | Mar 18, 2024 | Instance SegmentationNeRF | CodeCode Available | 2 |
| Multi-Task Learning as Multi-Objective Optimization | Oct 10, 2018 | Depth EstimationGeneral Classification | CodeCode Available | 2 |
| Neighborhood Attention Transformer | Apr 14, 2022 | image-classificationImage Classification | CodeCode Available | 2 |
| Neural 3D Scene Reconstruction with the Manhattan-world Assumption | May 5, 2022 | 2D Semantic Segmentation3D Reconstruction | CodeCode Available | 2 |
| nnMamba: 3D Biomedical Image Segmentation, Classification and Landmark Detection with State Space Model | Feb 5, 2024 | 3D Medical Imaging SegmentationImage Segmentation | CodeCode Available | 2 |
| nnSAM: Plug-and-play Segment Anything Model Improves nnUNet Performance | Sep 29, 2023 | Few-Shot LearningHeart Segmentation | CodeCode Available | 2 |
| DatasetDM: Synthesizing Data with Perception Annotations Using Diffusion Models | Aug 11, 2023 | Dataset GenerationDecoder | CodeCode Available | 2 |
| Ambiguous Medical Image Segmentation using Diffusion Models | Apr 10, 2023 | DiagnosticDiversity | CodeCode Available | 2 |
| Boundary-Aware Segmentation Network for Mobile and Web Applications | Jan 12, 2021 | Camouflaged Object SegmentationDecoder | CodeCode Available | 2 |
| An Experimental Study on Exploring Strong Lightweight Vision Transformers via Masked Image Modeling Pre-Training | Apr 18, 2024 | Contrastive LearningCPU | CodeCode Available | 2 |
| Convolutions Die Hard: Open-Vocabulary Segmentation with Single Frozen Convolutional CLIP | Aug 4, 2023 | Open Vocabulary Panoptic SegmentationOpen Vocabulary Semantic Segmentation | CodeCode Available | 2 |
| Co-Occurrent Features in Semantic Segmentation | Jun 1, 2019 | SegmentationSemantic Segmentation | CodeCode Available | 2 |