| Fully Aligned Network for Referring Image Segmentation | Sep 29, 2024 | cross-modal alignmentDecoder | —Unverified | 0 |
| Fusing Cross-modal and Uni-modal Representations: A Kronecker Product Approach | Jun 10, 2025 | cross-modal alignment | —Unverified | 0 |
| GALLa: Graph Aligned Large Language Models for Improved Source Code Understanding | Sep 6, 2024 | cross-modal alignmentLanguage Modelling | —Unverified | 0 |
| GatedxLSTM: A Multimodal Affective Computing Approach for Emotion Recognition in Conversations | Mar 26, 2025 | cross-modal alignmentEmotion Classification | —Unverified | 0 |
| Generalized Zero-Shot Classification via Semantics-Free Inter-Class Feature Generation | Jan 1, 2025 | Classificationcross-modal alignment | —Unverified | 0 |
| Generating Vision-Language Navigation Instructions Incorporated Fine-Grained Alignment Annotations | Jun 10, 2025 | cross-modal alignmentNavigate | —Unverified | 0 |
| GEXIA: Granularity Expansion and Iterative Approximation for Scalable Multi-grained Video-language Learning | Dec 10, 2024 | cross-modal alignmentVideo Understanding | —Unverified | 0 |
| Hierarchical Cross-Modal Alignment for Open-Vocabulary 3D Object Detection | Mar 10, 2025 | 3D Object Detectioncross-modal alignment | —Unverified | 0 |
| Hire: Hybrid-modal Interaction with Multiple Relational Enhancements for Image-Text Matching | Jun 5, 2024 | cross-modal alignmentImage-text matching | —Unverified | 0 |
| HiTeA: Hierarchical Temporal-Aware Video-Language Pre-training | Dec 30, 2022 | cross-modal alignmentTGIF-Action | —Unverified | 0 |