SOTAVerified

cross-modal alignment

Papers

Showing 291300 of 342 papers

TitleStatusHype
On the Language Encoder of Contrastive Cross-modal Models0
OpenSight: A Simple Open-Vocabulary Framework for LiDAR-Based Object Detection0
OV-SCAN: Semantically Consistent Alignment for Novel Object Discovery in Open-Vocabulary 3D Object Detection0
PhysLLM: Harnessing Large Language Models for Cross-Modal Remote Physiological Sensing0
PMMTalk: Speech-Driven 3D Facial Animation from Complementary Pseudo Multi-modal Features0
Prompt-based Context- and Domain-aware Pretraining for Vision and Language Navigation0
MicarVLMoE: A Modern Gated Cross-Aligned Vision-Language Mixture of Experts Model for Medical Image Captioning and Report GenerationCode0
HCMA: Hierarchical Cross-model Alignment for Grounded Text-to-Image GenerationCode0
Generating Image Descriptions via Sequential Cross-Modal Alignment Guided by Human GazeCode0
Unsupervised Cross-Modal Alignment for Multi-Person 3D Pose EstimationCode0
Show:102550
← PrevPage 30 of 35Next →

No leaderboard results yet.