SOTAVerified

cross-modal alignment

Papers

Showing 291300 of 342 papers

TitleStatusHype
Learning Better Visual Representations for Weakly-Supervised Object Detection Using Natural Language Supervision0
Learning by Hallucinating: Vision-Language Pre-training with Weak Supervision0
Learning Joint Embedding with Modality Alignments for Cross-Modal Retrieval of Recipes and Food Images0
Learning Multi-Modal Nonlinear Embeddings: Performance Bounds and an Algorithm0
Learning to Localize Actions in Instructional Videos with LLM-Based Multi-Pathway Text-Video Alignment0
Let Me Finish My Sentence: Video Temporal Grounding with Holistic Text Understanding0
Leveraging Modality Tags for Enhanced Cross-Modal Video Retrieval0
Leveraging Pre-Trained Models for Multimodal Class-Incremental Learning under Adaptive Fusion0
LLaVA-RadZ: Can Multimodal Large Language Models Effectively Tackle Zero-shot Radiology Recognition?0
Locality-aware Cross-modal Correspondence Learning for Dense Audio-Visual Events Localization0
Show:102550
← PrevPage 30 of 35Next →

No leaderboard results yet.