| Hierarchical Cross-Modal Alignment for Open-Vocabulary 3D Object Detection | Mar 10, 2025 | 3D Object Detectioncross-modal alignment | —Unverified | 0 | 0 |
| Hire: Hybrid-modal Interaction with Multiple Relational Enhancements for Image-Text Matching | Jun 5, 2024 | cross-modal alignmentImage-text matching | —Unverified | 0 | 0 |
| HiTeA: Hierarchical Temporal-Aware Video-Language Pre-training | Dec 30, 2022 | cross-modal alignmentTGIF-Action | —Unverified | 0 | 0 |
| How do Cross-View and Cross-Modal Alignment Affect Representations in Contrastive Learning? | Nov 23, 2022 | Contrastive Learningcross-modal alignment | —Unverified | 0 | 0 |
| Improving Cross-modal Alignment for Text-Guided Image Inpainting | Jan 26, 2023 | cross-modal alignmentImage Inpainting | —Unverified | 0 | 0 |
| Improving Cross-modal Alignment with Synthetic Pairs for Text-only Image Captioning | Dec 14, 2023 | cross-modal alignmentDecoder | —Unverified | 0 | 0 |
| Improving Medical Visual Representation Learning with Pathological-level Cross-Modal Alignment and Correlation Exploration | Jun 12, 2025 | cross-modal alignmentImage to text | —Unverified | 0 | 0 |
| Improving speech translation by fusing speech and text | May 23, 2023 | cross-modal alignmentMachine Translation | —Unverified | 0 | 0 |
| InfoMAE: Pair-Efficient Cross-Modal Alignment for Multimodal Time-Series Sensing Signals | Apr 13, 2025 | cross-modal alignmentSelf-Supervised Learning | —Unverified | 0 | 0 |
| Integrate Temporal Graph Learning into LLM-based Temporal Knowledge Graph Model | Jan 21, 2025 | cross-modal alignmentGraph Embedding | —Unverified | 0 | 0 |
| Intriguing Properties of Large Language and Vision Models | Oct 7, 2024 | cross-modal alignmentLarge Language Model | —Unverified | 0 | 0 |
| JPG - Jointly Learn to Align: Automated Disease Prediction and Radiology Report Generation | Oct 1, 2022 | cross-modal alignmentDisease Prediction | —Unverified | 0 | 0 |
| KD-VLP: Improving End-to-End Vision-and-Language Pretraining with Object Knowledge Distillation | Jan 16, 2022 | cross-modal alignmentKnowledge Distillation | —Unverified | 0 | 0 |
| LangBridge: Interpreting Image as a Combination of Language Embeddings | Mar 25, 2025 | cross-modal alignment | —Unverified | 0 | 0 |
| Linguistic Query-Guided Mask Generation for Referring Image Segmentation | Jan 16, 2023 | Contrastive Learningcross-modal alignment | —Unverified | 0 | 0 |
| Learning Better Visual Representations for Weakly-Supervised Object Detection Using Natural Language Supervision | Sep 29, 2021 | cross-modal alignmentobject-detection | —Unverified | 0 | 0 |
| Learning by Hallucinating: Vision-Language Pre-training with Weak Supervision | Oct 24, 2022 | cross-modal alignmentCross-Modal Retrieval | —Unverified | 0 | 0 |
| Learning Joint Embedding with Modality Alignments for Cross-Modal Retrieval of Recipes and Food Images | Aug 9, 2021 | cross-modal alignmentCross-Modal Retrieval | —Unverified | 0 | 0 |
| Learning Multi-Modal Nonlinear Embeddings: Performance Bounds and an Algorithm | Jun 3, 2020 | cross-modal alignmentGeneral Classification | —Unverified | 0 | 0 |
| Learning to Localize Actions in Instructional Videos with LLM-Based Multi-Pathway Text-Video Alignment | Sep 22, 2024 | Contrastive Learningcross-modal alignment | —Unverified | 0 | 0 |
| Let Me Finish My Sentence: Video Temporal Grounding with Holistic Text Understanding | Oct 17, 2024 | cross-modal alignmentSentence | —Unverified | 0 | 0 |
| Leveraging Modality Tags for Enhanced Cross-Modal Video Retrieval | Apr 2, 2025 | cross-modal alignmentRetrieval | —Unverified | 0 | 0 |
| Leveraging Pre-Trained Models for Multimodal Class-Incremental Learning under Adaptive Fusion | Feb 7, 2025 | class-incremental learningClass Incremental Learning | —Unverified | 0 | 0 |
| LLaVA-RadZ: Can Multimodal Large Language Models Effectively Tackle Zero-shot Radiology Recognition? | Mar 10, 2025 | cross-modal alignment | —Unverified | 0 | 0 |
| Locality-aware Cross-modal Correspondence Learning for Dense Audio-Visual Events Localization | Sep 12, 2024 | cross-modal alignment | —Unverified | 0 | 0 |