SOTAVerified

cross-modal alignment

Papers

Showing 241250 of 342 papers

TitleStatusHype
Cross-modal Alignment with Optimal Transport for CTC-based ASR0
Sound Source Localization is All about Cross-Modal Alignment0
Multi-Semantic Fusion Model for Generalized Zero-Shot Skeleton-Based Action RecognitionCode1
Prompt-based Context- and Domain-aware Pretraining for Vision and Language Navigation0
Towards High-Fidelity Text-Guided 3D Face Generation and Manipulation Using only Images0
Position-Enhanced Visual Instruction Tuning for Multimodal Large Language ModelsCode1
Grounded Entity-Landmark Adaptive Pre-training for Vision-and-Language NavigationCode1
DiffCloth: Diffusion Based Garment Synthesis and Manipulation via Structural Cross-modal Semantic Alignment0
Language-Guided Diffusion Model for Visual GroundingCode0
AerialVLN: Vision-and-Language Navigation for UAVsCode2
Show:102550
← PrevPage 25 of 35Next →

No leaderboard results yet.