SOTAVerified

cross-modal alignment

Papers

Showing 301310 of 342 papers

TitleStatusHype
Cross-Modal Alignment Learning of Vision-Language Conceptual Systems0
A Priority Map for Vision-and-Language Navigation with Trajectory Plans and Feature-Location CuesCode0
BridgeTower: Building Bridges Between Encoders in Vision-Language Representation LearningCode1
VLMixer: Unpaired Vision-Language Pre-training via Cross-Modal CutMix0
mPLUG: Effective and Efficient Vision-Language Learning by Cross-modal Skip-connectionsCode1
Reinforced Cross-modal Alignment for Radiology Report GenerationCode0
LayoutLMv3: Pre-training for Document AI with Unified Text and Image MaskingCode0
DSGN++: Exploiting Visual-Spatial Relation for Stereo-based 3D DetectorsCode1
Learning Commonsense-aware Moment-Text Alignment for Fast Video Temporal GroundingCode1
Vision-Language Pre-Training with Triple Contrastive LearningCode2
Show:102550
← PrevPage 31 of 35Next →

No leaderboard results yet.