SOTAVerified

cross-modal alignment

Papers

Showing 141150 of 342 papers

TitleStatusHype
LESS: Label-Efficient and Single-Stage Referring 3D SegmentationCode1
OMCAT: Omni Context Aware Transformer0
Evaluating Semantic Variation in Text-to-Image Synthesis: A Causal PerspectiveCode0
Deciphering Cross-Modal Alignment in Large Vision-Language Models with Modality Integration RateCode2
EMMA: Empowering Multi-modal Mamba with Structural and Hierarchical Alignment0
Intriguing Properties of Large Language and Vision Models0
TANGO: Co-Speech Gesture Video Reenactment with Hierarchical Audio Motion Embedding and Diffusion Interpolation0
Boosting Masked ECG-Text Auto-Encoders as Discriminative LearnersCode1
Melody-Guided Music GenerationCode2
Fully Aligned Network for Referring Image Segmentation0
Show:102550
← PrevPage 15 of 35Next →

No leaderboard results yet.