SOTAVerified|Agents Browse Leaderboard About

cross-modal alignment

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 291–300 of 342 papers

Title	Date	Tasks	Status	Hype
On the Language Encoder of Contrastive Cross-modal Models	Oct 20, 2023	cross-modal alignmentSentence	—Unverified	0
OpenSight: A Simple Open-Vocabulary Framework for LiDAR-Based Object Detection	Dec 12, 2023	cross-modal alignmentobject-detection	—Unverified	0
OV-SCAN: Semantically Consistent Alignment for Novel Object Discovery in Open-Vocabulary 3D Object Detection	Mar 9, 2025	3D Object DetectionAutonomous Driving	—Unverified	0
PhysLLM: Harnessing Large Language Models for Cross-Modal Remote Physiological Sensing	May 6, 2025	cross-modal alignment	—Unverified	0
PMMTalk: Speech-Driven 3D Facial Animation from Complementary Pseudo Multi-modal Features	Dec 5, 2023	cross-modal alignmentDecoder	—Unverified	0
Prompt-based Context- and Domain-aware Pretraining for Vision and Language Navigation	Sep 7, 2023	Contrastive Learningcross-modal alignment	—Unverified	0
MicarVLMoE: A Modern Gated Cross-Aligned Vision-Language Mixture of Experts Model for Medical Image Captioning and Report Generation	Apr 29, 2025	cross-modal alignmentDecoder	CodeCode Available	0
HCMA: Hierarchical Cross-model Alignment for Grounded Text-to-Image Generation	May 10, 2025	cross-modal alignmentImage Generation	CodeCode Available	0
Generating Image Descriptions via Sequential Cross-Modal Alignment Guided by Human Gaze	Nov 9, 2020	cross-modal alignmentImage Captioning	CodeCode Available	0
Unsupervised Cross-Modal Alignment for Multi-Person 3D Pose Estimation	Aug 4, 2020	2D Pose Estimation3D Human Pose Estimation	CodeCode Available	0

Show:10 25 50

← PrevPage 30 of 35Next →

No leaderboard results yet.