SOTAVerified

cross-modal alignment

Papers

Showing 221230 of 342 papers

TitleStatusHype
Learning to Localize Actions in Instructional Videos with LLM-Based Multi-Pathway Text-Video Alignment0
OneEncoder: A Lightweight Framework for Progressive Alignment of Modalities0
CAST: Cross-modal Alignment Similarity Test for Vision Language ModelsCode0
KALE: An Artwork Image Captioning System Augmented with Heterogeneous GraphCode0
NEVLP: Noise-Robust Framework for Efficient Vision-Language Pre-training0
Locality-aware Cross-modal Correspondence Learning for Dense Audio-Visual Events Localization0
GALLa: Graph Aligned Large Language Models for Improved Source Code Understanding0
Temporal Order Preserved Optimal Transport-based Cross-modal Knowledge Transfer Learning for ASR0
Focus on Focus: Focus-oriented Representation Learning and Multi-view Cross-modal Alignment for Glioma GradingCode0
Cross-Modal Denoising: A Novel Training Paradigm for Enhancing Speech-Image Retrieval0
Show:102550
← PrevPage 23 of 35Next →

No leaderboard results yet.