SOTAVerified

cross-modal alignment

Papers

Showing 111120 of 342 papers

TitleStatusHype
TSDASeg: A Two-Stage Model with Direct Alignment for Interactive Point Cloud Segmentation0
DALR: Dual-level Alignment Learning for Multimodal Sentence Representation Learning0
HyperPath: Knowledge-Guided Hyperbolic Semantic Hierarchy Modeling for WSI AnalysisCode0
Speech-Language Models with Decoupled Tokenizers and Multi-Token Prediction0
TAViS: Text-bridged Audio-Visual Segmentation with Foundation Models0
Improving Medical Visual Representation Learning with Pathological-level Cross-Modal Alignment and Correlation Exploration0
OmniDRCA: Parallel Speech-Text Foundation Model via Dual-Resolution Speech Representations and Contrastive AlignmentCode0
Fusing Cross-modal and Uni-modal Representations: A Kronecker Product Approach0
Generating Vision-Language Navigation Instructions Incorporated Fine-Grained Alignment Annotations0
WhisQ: Cross-Modal Representation Learning for Text-to-Music MOS Prediction0
Show:102550
← PrevPage 12 of 35Next →

No leaderboard results yet.