SOTAVerified

cross-modal alignment

Papers

Showing 231240 of 342 papers

TitleStatusHype
Reinforcement Learning for Weakly Supervised Temporal Grounding of Natural Language in Untrimmed Videos0
Representation Discrepancy Bridging Method for Remote Sensing Image-Text Retrieval0
Retrieving-to-Answer: Zero-Shot Video Question Answering with Frozen Large Language Models0
Revisiting Misalignment in Multispectral Pedestrian Detection: A Language-Driven Approach for Cross-modal Alignment Fusion0
Scene-Intuitive Agent for Remote Embodied Visual Grounding0
SE4Lip: Speech-Lip Encoder for Talking Head Synthesis to Solve Phoneme-Viseme Alignment Ambiguity0
See What You See: Self-supervised Cross-modal Retrieval of Visual Stimuli from Brain Activity0
Self-supervised Feature Adaptation for 3D Industrial Anomaly Detection0
Semantics-enhanced Cross-modal Masked Image Modeling for Vision-Language Pre-training0
Semantic-Space-Intervened Diffusive Alignment for Visual Classification0
Show:102550
← PrevPage 24 of 35Next →

No leaderboard results yet.