SOTAVerified

cross-modal alignment

Papers

Showing 131140 of 342 papers

TitleStatusHype
AlignMamba: Enhancing Multimodal Mamba with Local and Global Cross-modal Alignment0
SimCMF: A Simple Cross-modal Fine-tuning Strategy from Vision Foundation Models to Any Imaging ModalityCode1
Revisiting Misalignment in Multispectral Pedestrian Detection: A Language-Driven Approach for Cross-modal Alignment Fusion0
Tiny-Align: Bridging Automatic Speech Recognition and Large Language Model on the Edge0
CTPD: Cross-Modal Temporal Pattern Discovery for Enhanced Multimodal Electronic Health Records Analysis0
Towards Cross-Modal Text-Molecule Retrieval with Better Modality AlignmentCode0
Multi-path Exploration and Feedback Adjustment for Text-to-Image Person Retrieval0
AlignVSR: Audio-Visual Cross-Modal Alignment for Visual Speech RecognitionCode1
Modeling the Human Visual System: Comparative Insights from Response-Optimized and Task-Optimized Vision Models, Language Models, and different Readout Mechanisms0
Let Me Finish My Sentence: Video Temporal Grounding with Holistic Text Understanding0
Show:102550
← PrevPage 14 of 35Next →

No leaderboard results yet.