SOTAVerified

cross-modal alignment

Papers

Showing 91100 of 342 papers

TitleStatusHype
CAMANet: Class Activation Map Guided Attention Network for Radiology Report GenerationCode1
CLIP-Driven Fine-grained Text-Image Person Re-identificationCode1
Low-resource Neural Machine Translation with Cross-modal AlignmentCode1
Multi-Granularity Cross-modal Alignment for Generalized Medical Visual Representation LearningCode1
Efficient Vision-Language Pretraining with Visual Concepts and Hierarchical AlignmentCode1
Fine-Grained Semantically Aligned Vision-Language Pre-TrainingCode1
BridgeTower: Building Bridges Between Encoders in Vision-Language Representation LearningCode1
mPLUG: Effective and Efficient Vision-Language Learning by Cross-modal Skip-connectionsCode1
DSGN++: Exploiting Visual-Spatial Relation for Stereo-based 3D DetectorsCode1
Learning Commonsense-aware Moment-Text Alignment for Fast Video Temporal GroundingCode1
Show:102550
← PrevPage 10 of 35Next →

No leaderboard results yet.