SOTAVerified

cross-modal alignment

Papers

Showing 251260 of 342 papers

TitleStatusHype
Contrast-augmented Diffusion Model with Fine-grained Sequence Alignment for Markup-to-Image GenerationCode0
Text-guided Image Restoration and Semantic Enhancement for Text-to-Image Person RetrievalCode1
WiCo: Win-win Cooperation of Bottom-up and Top-down Referring Image Segmentation0
Retrieving-to-Answer: Zero-Shot Video Question Answering with Frozen Large Language Models0
Global and Local Semantic Completion Learning for Vision-Language Pre-trainingCode1
ManagerTower: Aggregating the Insights of Uni-Modal Experts for Vision-Language Representation LearningCode1
SOC: Semantic-Assisted Object Cluster for Referring Video Object SegmentationCode1
Improving speech translation by fusing speech and text0
Speech-Text Dialog Pre-training for Spoken Dialog Understanding with Explicit Cross-Modal Alignment0
Multi-task Paired Masking with Alignment Modeling for Medical Vision-Language Pre-training0
Show:102550
← PrevPage 26 of 35Next →

No leaderboard results yet.