SOTAVerified

cross-modal alignment

Papers

Showing 121130 of 342 papers

TitleStatusHype
Clip4Retrofit: Enabling Real-Time Image Labeling on Edge Devices via Cross-Architecture CLIP Distillation0
ALAS: Measuring Latent Speech-Text Alignment For Spoken Language Understanding In Multimodal LLMs0
4D-ACFNet: A 4D Attention Mechanism-Based Prognostic Framework for Colorectal Cancer Liver Metastasis Integrating Multimodal Spatiotemporal Features0
Enhancing Modality Representation and Alignment for Multimodal Cold-start Active Learning0
Does Your 3D Encoder Really Work? When Pretrain-SFT from 2D VLMs Meets 3D VLMs0
Enhancing Vision-Language Compositional Understanding with Multimodal Synthetic Data0
Does Vision Accelerate Hierarchical Generalization in Neural Language Learners?0
CIRP: Cross-Item Relational Pre-training for Multimodal Product Bundling0
Disentangled Noisy Correspondence Learning0
Chat-based Person Retrieval via Dialogue-Refined Cross-Modal Alignment0
Show:102550
← PrevPage 13 of 35Next →

No leaderboard results yet.