| Panoptic Feature Pyramid Networks | Jan 8, 2019 | Instance SegmentationPanoptic Segmentation | CodeCode Available | 4 |
| U-Net: Convolutional Networks for Biomedical Image Segmentation | May 18, 2015 | Cell SegmentationCell Tracking | CodeCode Available | 3 |
| Sigma: Siamese Mamba Network for Multi-Modal Semantic Segmentation | Apr 5, 2024 | DecoderMamba | CodeCode Available | 3 |
| StitchFusion: Weaving Any Visual Modalities to Enhance Multimodal Semantic Segmentation | Aug 2, 2024 | SegmentationSemantic Segmentation | CodeCode Available | 2 |
| Context Encoding for Semantic Segmentation | Mar 23, 2018 | image-classificationImage Classification | CodeCode Available | 2 |
| CMX: Cross-Modal Fusion for RGB-X Semantic Segmentation with Transformers | Mar 9, 2022 | 3D Object DetectionAutonomous Vehicles | CodeCode Available | 2 |
| Swin Transformer: Hierarchical Vision Transformer using Shifted Windows | Mar 25, 2021 | image-classificationImage Classification | CodeCode Available | 2 |
| UniRGB-IR: A Unified Framework for RGB-Infrared Semantic Tasks via Adapter Tuning | Apr 26, 2024 | Multispectral Object DetectionPedestrian Detection | CodeCode Available | 2 |
| Delivering Arbitrary-Modal Semantic Segmentation | Mar 2, 2023 | SegmentationSemantic Segmentation | CodeCode Available | 2 |
| Context-Aware Interaction Network for RGB-T Semantic Segmentation | Jan 3, 2024 | Autonomous DrivingSemantic Segmentation | CodeCode Available | 1 |
| Deep High-Resolution Representation Learning for Visual Recognition | Aug 20, 2019 | Dichotomous Image SegmentationFace Alignment | CodeCode Available | 1 |
| Accurate RGB-D Salient Object Detection via Collaborative Learning | Jul 23, 2020 | Objectobject-detection | CodeCode Available | 1 |
| A Single Stream Network for Robust and Real-time RGB-D Salient Object Detection | Jul 14, 2020 | Decoderobject-detection | CodeCode Available | 1 |
| Asymmetric Two-Stream Architecture for Accurate RGB-D Saliency Detection | Aug 1, 2020 | Saliency DetectionThermal Image Segmentation | CodeCode Available | 1 |
| Bi-directional Cross-Modality Feature Propagation with Separation-and-Aggregation Gate for RGB-D Semantic Segmentation | Jul 17, 2020 | Object DetectionSegmentation | CodeCode Available | 1 |
| BiSeNet: Bilateral Segmentation Network for Real-time Semantic Segmentation | Aug 2, 2018 | Dichotomous Image SegmentationReal-Time Semantic Segmentation | CodeCode Available | 1 |
| Calibrated RGB-D Salient Object Detection | Jun 19, 2021 | Objectobject-detection | CodeCode Available | 1 |
| CEKD: Cross-Modal Edge-Privileged Knowledge Distillation for Semantic Scene Understanding Using Only Thermal Images | Feb 22, 2023 | Knowledge DistillationScene Understanding | CodeCode Available | 1 |
| Complementary Random Masking for RGB-Thermal Semantic Segmentation | Mar 30, 2023 | Scene UnderstandingSemantic Segmentation | CodeCode Available | 1 |
| Efficient Multimodal Semantic Segmentation via Dual-Prompt Learning | Dec 1, 2023 | Decoderobject-detection | CodeCode Available | 1 |
| Efficient RGB-D Semantic Segmentation for Indoor Scene Analysis | Nov 13, 2020 | SegmentationSemantic Segmentation | CodeCode Available | 1 |
| EGFNet: Edge-Aware Guidance Fusion Network for RGB–Thermal Urban Scene Parsing | Aug 15, 2023 | Scene ParsingSemantic Segmentation | CodeCode Available | 1 |
| Enhanced Boundary Learning for Glass-like Object Segmentation | Mar 29, 2021 | DecoderObject | CodeCode Available | 1 |
| Explicit Attention-Enhanced Fusion for RGB-Thermal Perception Tasks | Mar 28, 2023 | Crowd Countingobject-detection | CodeCode Available | 1 |
| FEANet: Feature-Enhanced Attention Network for RGB-Thermal Real-time Semantic Segmentation | Oct 18, 2021 | Real-Time Semantic SegmentationSegmentation | CodeCode Available | 1 |
| FTNet: Feature Transverse Network for Thermal Image Semantic Segmentation | Oct 26, 2021 | BenchmarkingScene Segmentation | CodeCode Available | 1 |
| Fully Convolutional Networks for Semantic Segmentation | Nov 14, 2014 | Crack SegmentationMultispectral Object Detection | CodeCode Available | 1 |
| Glass Segmentation with RGB-Thermal Image Pairs | Apr 12, 2022 | SegmentationThermal Image Segmentation | CodeCode Available | 1 |
| Hierarchical Dynamic Filtering Network for RGB-D Salient Object Detection | Jul 13, 2020 | object-detectionRGB-D Salient Object Detection | CodeCode Available | 1 |
| LinkNet: Exploiting Encoder Representations for Efficient Semantic Segmentation | Jun 14, 2017 | GPUScene Understanding | CodeCode Available | 1 |
| Multi-interactive Feature Learning and a Full-time Multi-modality Benchmark for Image Fusion and Segmentation | Aug 4, 2023 | Autonomous DrivingSegmentation | CodeCode Available | 1 |
| MMSFormer: Multimodal Transformer for Material and Semantic Segmentation | Sep 7, 2023 | SegmentationSemantic Segmentation | CodeCode Available | 1 |
| PAIF: Perception-Aware Infrared-Visible Image Fusion for Attack-Tolerant Semantic Segmentation | Aug 8, 2023 | Infrared And Visible Image FusionSegmentation | CodeCode Available | 1 |
| Pyramid Scene Parsing Network | Dec 4, 2016 | Dichotomous Image SegmentationImage Classification | CodeCode Available | 1 |
| Rethinking Atrous Convolution for Semantic Image Segmentation | Jun 17, 2017 | 2D Semantic SegmentationDichotomous Image Segmentation | CodeCode Available | 1 |
| RGB-D Saliency Detection via Cascaded Mutual Information Minimization | Sep 15, 2021 | Saliency DetectionThermal Image Segmentation | CodeCode Available | 1 |
| RGB-D Salient Object Detection via 3D Convolutional Neural Networks | Jan 25, 2021 | Decoderobject-detection | CodeCode Available | 1 |
| RGB-T Semantic Segmentation with Location, Activation, and Sharpening | Oct 26, 2022 | DecoderScene Understanding | CodeCode Available | 1 |
| SegFormer: Simple and Efficient Design for Semantic Segmentation with Transformers | May 31, 2021 | 2D Semantic SegmentationC++ code | CodeCode Available | 1 |
| Segmenter: Transformer for Semantic Segmentation | May 12, 2021 | Decoderimage-classification | CodeCode Available | 1 |
| SegNet: A Deep Convolutional Encoder-Decoder Architecture for Image Segmentation | Nov 2, 2015 | Crowd CountingDecoder | CodeCode Available | 1 |
| Select, Supplement and Focus for RGB-D Saliency Detection | Jun 1, 2020 | RGB-D Salient Object DetectionSaliency Detection | CodeCode Available | 1 |
| Self-adversarial Multi-scale Contrastive Learning for Semantic Segmentation of Thermal Facial Images | Sep 21, 2022 | Contrastive LearningImage Augmentation | CodeCode Available | 1 |
| ShapeConv: Shape-aware Convolutional Layer for Indoor RGB-D Semantic Segmentation | Aug 24, 2021 | SegmentationSemantic Segmentation | CodeCode Available | 1 |
| Specificity-preserving RGB-D Saliency Detection | Aug 18, 2021 | Decoderobject-detection | CodeCode Available | 1 |
| UC-Net: Uncertainty Inspired RGB-D Saliency Detection via Conditional Variational Autoencoders | Apr 13, 2020 | RGB-D Salient Object DetectionSaliency Detection | CodeCode Available | 1 |
| UNet++: A Nested U-Net Architecture for Medical Image Segmentation | Jul 18, 2018 | Camouflaged Object SegmentationImage Segmentation | CodeCode Available | 1 |
| Unveiling the Potential of Segment Anything Model 2 for RGB-Thermal Semantic Segmentation with Language Guidance | Mar 4, 2025 | DecoderSemantic Segmentation | CodeCode Available | 1 |
| Visual Saliency Transformer | Apr 25, 2021 | Boundary DetectionDecoder | CodeCode Available | 1 |
| We Learn Better Road Pothole Detection: from Attention Aggregation to Adversarial Domain Adaptation | Aug 16, 2020 | Domain AdaptationSegmentation | CodeCode Available | 1 |