SOTAVerified

cross-modal alignment

Papers

Showing 7180 of 342 papers

TitleStatusHype
Towards Balanced Alignment: Modal-Enhanced Semantic Modeling for Video Moment RetrievalCode1
Mask Grounding for Referring Image SegmentationCode1
ViLA: Efficient Video-Language Alignment for Video Question AnsweringCode1
Navigating Open Set Scenarios for Skeleton-based Action RecognitionCode1
Progressive Multi-Modality Learning for Inverse Protein FoldingCode1
ReForm-Eval: Evaluating Large Vision Language Models via Unified Re-Formulation of Task-Oriented BenchmarksCode1
VDC: Versatile Data Cleanser based on Visual-Linguistic Inconsistency by Multimodal Large Language ModelsCode1
Multi-Semantic Fusion Model for Generalized Zero-Shot Skeleton-Based Action RecognitionCode1
Position-Enhanced Visual Instruction Tuning for Multimodal Large Language ModelsCode1
Grounded Entity-Landmark Adaptive Pre-training for Vision-and-Language NavigationCode1
Show:102550
← PrevPage 8 of 35Next →

No leaderboard results yet.