| Panoptic Feature Pyramid Networks | Jan 8, 2019 | Instance SegmentationPanoptic Segmentation | CodeCode Available | 4 |
| Sigma: Siamese Mamba Network for Multi-Modal Semantic Segmentation | Apr 5, 2024 | DecoderMamba | CodeCode Available | 3 |
| U-Net: Convolutional Networks for Biomedical Image Segmentation | May 18, 2015 | Cell SegmentationCell Tracking | CodeCode Available | 3 |
| StitchFusion: Weaving Any Visual Modalities to Enhance Multimodal Semantic Segmentation | Aug 2, 2024 | SegmentationSemantic Segmentation | CodeCode Available | 2 |
| UniRGB-IR: A Unified Framework for RGB-Infrared Semantic Tasks via Adapter Tuning | Apr 26, 2024 | Multispectral Object DetectionPedestrian Detection | CodeCode Available | 2 |
| Delivering Arbitrary-Modal Semantic Segmentation | Mar 2, 2023 | SegmentationSemantic Segmentation | CodeCode Available | 2 |
| CMX: Cross-Modal Fusion for RGB-X Semantic Segmentation with Transformers | Mar 9, 2022 | 3D Object DetectionAutonomous Vehicles | CodeCode Available | 2 |
| Swin Transformer: Hierarchical Vision Transformer using Shifted Windows | Mar 25, 2021 | image-classificationImage Classification | CodeCode Available | 2 |
| Context Encoding for Semantic Segmentation | Mar 23, 2018 | image-classificationImage Classification | CodeCode Available | 2 |
| Unveiling the Potential of Segment Anything Model 2 for RGB-Thermal Semantic Segmentation with Language Guidance | Mar 4, 2025 | DecoderSemantic Segmentation | CodeCode Available | 1 |
| CSFNet: A Cosine Similarity Fusion Network for Real-Time RGB-X Semantic Segmentation of Driving Scenes | Jul 1, 2024 | Autonomous VehiclesImage Segmentation | CodeCode Available | 1 |
| Context-Aware Interaction Network for RGB-T Semantic Segmentation | Jan 3, 2024 | Autonomous DrivingSemantic Segmentation | CodeCode Available | 1 |
| Efficient Multimodal Semantic Segmentation via Dual-Prompt Learning | Dec 1, 2023 | Decoderobject-detection | CodeCode Available | 1 |
| MMSFormer: Multimodal Transformer for Material and Semantic Segmentation | Sep 7, 2023 | SegmentationSemantic Segmentation | CodeCode Available | 1 |
| EGFNet: Edge-Aware Guidance Fusion Network for RGB–Thermal Urban Scene Parsing | Aug 15, 2023 | Scene ParsingSemantic Segmentation | CodeCode Available | 1 |
| PAIF: Perception-Aware Infrared-Visible Image Fusion for Attack-Tolerant Semantic Segmentation | Aug 8, 2023 | Infrared And Visible Image FusionSegmentation | CodeCode Available | 1 |
| Multi-interactive Feature Learning and a Full-time Multi-modality Benchmark for Image Fusion and Segmentation | Aug 4, 2023 | Autonomous DrivingSegmentation | CodeCode Available | 1 |
| Complementary Random Masking for RGB-Thermal Semantic Segmentation | Mar 30, 2023 | Scene UnderstandingSemantic Segmentation | CodeCode Available | 1 |
| Explicit Attention-Enhanced Fusion for RGB-Thermal Perception Tasks | Mar 28, 2023 | Crowd Countingobject-detection | CodeCode Available | 1 |
| CEKD: Cross-Modal Edge-Privileged Knowledge Distillation for Semantic Scene Understanding Using Only Thermal Images | Feb 22, 2023 | Knowledge DistillationScene Understanding | CodeCode Available | 1 |
| RGB-T Semantic Segmentation with Location, Activation, and Sharpening | Oct 26, 2022 | DecoderScene Understanding | CodeCode Available | 1 |
| Self-adversarial Multi-scale Contrastive Learning for Semantic Segmentation of Thermal Facial Images | Sep 21, 2022 | Contrastive LearningImage Augmentation | CodeCode Available | 1 |
| Glass Segmentation with RGB-Thermal Image Pairs | Apr 12, 2022 | SegmentationThermal Image Segmentation | CodeCode Available | 1 |
| Edge-aware Guidance Fusion Network for RGB Thermal Scene Parsing | Dec 9, 2021 | Scene ParsingThermal Image Segmentation | CodeCode Available | 1 |
| FTNet: Feature Transverse Network for Thermal Image Semantic Segmentation | Oct 26, 2021 | BenchmarkingScene Segmentation | CodeCode Available | 1 |