SOTAVerified

cross-modal alignment

Papers

Showing 221230 of 342 papers

TitleStatusHype
Towards Balanced Alignment: Modal-Enhanced Semantic Modeling for Video Moment RetrievalCode1
Mask Grounding for Referring Image SegmentationCode1
M^2ConceptBase: A Fine-Grained Aligned Concept-Centric Multimodal Knowledge BaseCode0
Improving Cross-modal Alignment with Synthetic Pairs for Text-only Image Captioning0
ViLA: Efficient Video-Language Alignment for Video Question AnsweringCode1
OpenSight: A Simple Open-Vocabulary Framework for LiDAR-Based Object Detection0
Navigating Open Set Scenarios for Skeleton-based Action RecognitionCode1
Progressive Multi-Modality Learning for Inverse Protein FoldingCode1
PMMTalk: Speech-Driven 3D Facial Animation from Complementary Pseudo Multi-modal Features0
DAP: Domain-aware Prompt Learning for Vision-and-Language Navigation0
Show:102550
← PrevPage 23 of 35Next →

No leaderboard results yet.