SOTAVerified|Agents Browse Leaderboard About

cross-modal alignment

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 101–110 of 342 papers

Title	Date	Tasks	Status	Hype	Score
CAV-MAE Sync: Improving Contrastive Audio-Visual Mask Autoencoders via Fine-Grained Alignment	May 2, 2025	audio-visual learningcross-modal alignment	CodeCode Available	1	5
LPOSS: Label Propagation Over Patches and Pixels for Open-vocabulary Semantic Segmentation	Mar 25, 2025	cross-modal alignmentOpen Vocabulary Semantic Segmentation	CodeCode Available	1	5
EPMF: Efficient Perception-aware Multi-sensor Fusion for 3D Semantic Segmentation	Jun 21, 2021	3D Semantic SegmentationAutonomous Driving	CodeCode Available	1	5
Diffusion Bridge: Leveraging Diffusion Model to Reduce the Modality Gap Between Text and Vision for Zero-Shot Image Captioning	Jan 1, 2025	cross-modal alignmentDenoising	CodeCode Available	1	5
ReForm-Eval: Evaluating Large Vision Language Models via Unified Re-Formulation of Task-Oriented Benchmarks	Oct 4, 2023	cross-modal alignment	CodeCode Available	1	5
Towards Balanced Alignment: Modal-Enhanced Semantic Modeling for Video Moment Retrieval	Dec 19, 2023	cross-modal alignmentMoment Retrieval	CodeCode Available	1	5
Adaptive Spatial Transcriptomics Interpolation via Cross-modal Cross-slice Modeling	May 15, 2025	cross-modal alignment	CodeCode Available	0	5
MicarVLMoE: A Modern Gated Cross-Aligned Vision-Language Mixture of Experts Model for Medical Image Captioning and Report Generation	Apr 29, 2025	cross-modal alignmentDecoder	CodeCode Available	0	5
A Priority Map for Vision-and-Language Navigation with Trajectory Plans and Feature-Location Cues	Jul 24, 2022	cross-modal alignmentTrajectory Planning	CodeCode Available	0	5
Discrete Cross-Modal Alignment Enables Zero-Shot Speech Translation	Oct 18, 2022	Automatic Speech RecognitionAutomatic Speech Recognition (ASR)	CodeCode Available	0	5

Show:10 25 50

← PrevPage 11 of 35Next →

No leaderboard results yet.