| Boosting Cross-spectral Unsupervised Domain Adaptation for Thermal Semantic Segmentation | May 11, 2025 | Autonomous DrivingDomain Adaptation | —Unverified | 0 |
| Unveiling the Potential of Segment Anything Model 2 for RGB-Thermal Semantic Segmentation with Language Guidance | Mar 4, 2025 | DecoderSemantic Segmentation | CodeCode Available | 1 |
| StitchFusion: Weaving Any Visual Modalities to Enhance Multimodal Semantic Segmentation | Aug 2, 2024 | SegmentationSemantic Segmentation | CodeCode Available | 2 |
| RoadFormer+: Delivering RGB-X Scene Parsing through Scale-Aware Information Decoupling and Advanced Heterogeneous Feature Fusion | Jul 31, 2024 | Scene ParsingSemantic Segmentation | —Unverified | 0 |
| CSFNet: A Cosine Similarity Fusion Network for Real-Time RGB-X Semantic Segmentation of Driving Scenes | Jul 1, 2024 | Autonomous VehiclesImage Segmentation | CodeCode Available | 1 |
| UniRGB-IR: A Unified Framework for RGB-Infrared Semantic Tasks via Adapter Tuning | Apr 26, 2024 | Multispectral Object DetectionPedestrian Detection | CodeCode Available | 2 |
| Sigma: Siamese Mamba Network for Multi-Modal Semantic Segmentation | Apr 5, 2024 | DecoderMamba | CodeCode Available | 3 |
| HAPNet: Toward Superior RGB-Thermal Scene Parsing via Hybrid, Asymmetric, and Progressive Heterogeneous Feature Fusion | Apr 4, 2024 | Scene ParsingSemantic Segmentation | CodeCode Available | 0 |
| Context-Aware Interaction Network for RGB-T Semantic Segmentation | Jan 3, 2024 | Autonomous DrivingSemantic Segmentation | CodeCode Available | 1 |
| IGFNet: Illumination-Guided Fusion Network for Semantic Scene Understanding using RGB-Thermal Images | Dec 4, 2023 | Autonomous DrivingScene Understanding | CodeCode Available | 0 |
| Efficient Multimodal Semantic Segmentation via Dual-Prompt Learning | Dec 1, 2023 | Decoderobject-detection | CodeCode Available | 1 |
| InfraParis: A multi-modal and multi-task autonomous driving dataset | Sep 27, 2023 | Autonomous DrivingMonocular Depth Estimation | CodeCode Available | 0 |
| CACFNet: Cross-Modal Attention Cascaded Fusion Network for RGB-T Urban Scene Parsing | Sep 14, 2023 | Scene ParsingThermal Image Segmentation | —Unverified | 0 |
| MMSFormer: Multimodal Transformer for Material and Semantic Segmentation | Sep 7, 2023 | SegmentationSemantic Segmentation | CodeCode Available | 1 |
| Channel and Spatial Relation-Propagation Network for RGB-Thermal Semantic Segmentation | Aug 24, 2023 | RelationSegmentation | —Unverified | 0 |
| EGFNet: Edge-Aware Guidance Fusion Network for RGB–Thermal Urban Scene Parsing | Aug 15, 2023 | Scene ParsingSemantic Segmentation | CodeCode Available | 1 |
| PAIF: Perception-Aware Infrared-Visible Image Fusion for Attack-Tolerant Semantic Segmentation | Aug 8, 2023 | Infrared And Visible Image FusionSegmentation | CodeCode Available | 1 |
| Multi-interactive Feature Learning and a Full-time Multi-modality Benchmark for Image Fusion and Segmentation | Aug 4, 2023 | Autonomous DrivingSegmentation | CodeCode Available | 1 |
| Variational Probabilistic Fusion Network for RGB-T Semantic Segmentation | Jul 17, 2023 | SegmentationSemantic Segmentation | —Unverified | 0 |
| Residual Spatial Fusion Network for RGB-Thermal Semantic Segmentation | Jun 17, 2023 | Autonomous DrivingSaliency Detection | —Unverified | 0 |
| Complementary Random Masking for RGB-Thermal Semantic Segmentation | Mar 30, 2023 | Scene UnderstandingSemantic Segmentation | CodeCode Available | 1 |
| Explicit Attention-Enhanced Fusion for RGB-Thermal Perception Tasks | Mar 28, 2023 | Crowd Countingobject-detection | CodeCode Available | 1 |
| SpiderMesh: Spatial-aware Demand-guided Recursive Meshing for RGB-T Semantic Segmentation | Mar 15, 2023 | Data AugmentationSegmentation | CodeCode Available | 0 |
| Delivering Arbitrary-Modal Semantic Segmentation | Mar 2, 2023 | SegmentationSemantic Segmentation | CodeCode Available | 2 |
| CEKD: Cross-Modal Edge-Privileged Knowledge Distillation for Semantic Scene Understanding Using Only Thermal Images | Feb 22, 2023 | Knowledge DistillationScene Understanding | CodeCode Available | 1 |
| GEBNet: Graph-Enhancement Branch Network for RGB-T Scene Parsing | Oct 31, 2022 | DecoderScene Parsing | —Unverified | 0 |
| RGB-T Semantic Segmentation with Location, Activation, and Sharpening | Oct 26, 2022 | DecoderScene Understanding | CodeCode Available | 1 |
| Self-adversarial Multi-scale Contrastive Learning for Semantic Segmentation of Thermal Facial Images | Sep 21, 2022 | Contrastive LearningImage Augmentation | CodeCode Available | 1 |
| DooDLeNet: Double DeepLab Enhanced Feature Fusion for Thermal-color Semantic Segmentation | Apr 21, 2022 | DecoderSegmentation | —Unverified | 0 |
| Glass Segmentation with RGB-Thermal Image Pairs | Apr 12, 2022 | SegmentationThermal Image Segmentation | CodeCode Available | 1 |
| MTANet: Multitask-Aware Network With Hierarchical Multimodal Fusion for RGB-T Urban Scene Understanding | Apr 5, 2022 | Autonomous VehiclesScene Understanding | —Unverified | 0 |
| CMX: Cross-Modal Fusion for RGB-X Semantic Segmentation with Transformers | Mar 9, 2022 | 3D Object DetectionAutonomous Vehicles | CodeCode Available | 2 |
| Learning Generative Vision Transformer with Energy-Based Latent Space for Saliency Prediction | Dec 27, 2021 | object-detectionObject Detection | —Unverified | 0 |
| Edge-aware Guidance Fusion Network for RGB Thermal Scene Parsing | Dec 9, 2021 | Scene ParsingThermal Image Segmentation | CodeCode Available | 1 |
| FTNet: Feature Transverse Network for Thermal Image Semantic Segmentation | Oct 26, 2021 | BenchmarkingScene Segmentation | CodeCode Available | 1 |
| FEANet: Feature-Enhanced Attention Network for RGB-Thermal Real-time Semantic Segmentation | Oct 18, 2021 | Real-Time Semantic SegmentationSegmentation | CodeCode Available | 1 |
| RGB-D Saliency Detection via Cascaded Mutual Information Minimization | Sep 15, 2021 | Saliency DetectionThermal Image Segmentation | CodeCode Available | 1 |
| RGB-D Salient Object Detection with Ubiquitous Target Awareness | Sep 8, 2021 | Objectobject-detection | —Unverified | 0 |
| ShapeConv: Shape-aware Convolutional Layer for Indoor RGB-D Semantic Segmentation | Aug 24, 2021 | SegmentationSemantic Segmentation | CodeCode Available | 1 |
| Specificity-preserving RGB-D Saliency Detection | Aug 18, 2021 | Decoderobject-detection | CodeCode Available | 1 |
| Calibrated RGB-D Salient Object Detection | Jun 19, 2021 | Objectobject-detection | CodeCode Available | 1 |
| ABMDRNet: Adaptive-Weighted Bi-Directional Modality Difference Reduction Network for RGB-T Semantic Segmentation | Jun 19, 2021 | Image-to-Image TranslationSegmentation | —Unverified | 0 |
| SegFormer: Simple and Efficient Design for Semantic Segmentation with Transformers | May 31, 2021 | 2D Semantic SegmentationC++ code | CodeCode Available | 1 |
| Segmenter: Transformer for Semantic Segmentation | May 12, 2021 | Decoderimage-classification | CodeCode Available | 1 |
| Visual Saliency Transformer | Apr 25, 2021 | Boundary DetectionDecoder | CodeCode Available | 1 |
| Enhanced Boundary Learning for Glass-like Object Segmentation | Mar 29, 2021 | DecoderObject | CodeCode Available | 1 |
| Swin Transformer: Hierarchical Vision Transformer using Shifted Windows | Mar 25, 2021 | image-classificationImage Classification | CodeCode Available | 2 |
| RGB-D Salient Object Detection via 3D Convolutional Neural Networks | Jan 25, 2021 | Decoderobject-detection | CodeCode Available | 1 |
| Efficient RGB-D Semantic Segmentation for Indoor Scene Analysis | Nov 13, 2020 | SegmentationSemantic Segmentation | CodeCode Available | 1 |
| We Learn Better Road Pothole Detection: from Attention Aggregation to Adversarial Domain Adaptation | Aug 16, 2020 | Domain AdaptationSegmentation | CodeCode Available | 1 |