SOTAVerified

cross-modal alignment

Papers

Showing 111120 of 342 papers

TitleStatusHype
Chat-based Person Retrieval via Dialogue-Refined Cross-Modal Alignment0
Diffusion Bridge: Leveraging Diffusion Model to Reduce the Modality Gap Between Text and Vision for Zero-Shot Image CaptioningCode1
Align-KD: Distilling Cross-Modal Alignment Knowledge for Mobile Vision-Language Large Model EnhancementCode1
Generalized Zero-Shot Classification via Semantics-Free Inter-Class Feature Generation0
Audio-Visual Semantic Graph Network for Audio-Visual Event Localization0
ChartAdapter: Large Vision-Language Model for Chart Summarization0
Enhancing Visual Representation for Text-based Person SearchingCode0
Enhancing Multimodal Emotion Recognition through Multi-Granularity Cross-Modal Alignment0
Bag of Tricks for Multimodal AutoML with Image, Text, and Tabular Data0
ASAP: Advancing Semantic Alignment Promotes Multi-Modal Manipulation Detecting and GroundingCode1
Show:102550
← PrevPage 12 of 35Next →

No leaderboard results yet.