SOTAVerified

cross-modal alignment

Papers

Showing 111120 of 342 papers

TitleStatusHype
EMMA: Empowering Multi-modal Mamba with Structural and Hierarchical Alignment0
EA-VTR: Event-Aware Video-Text Retrieval0
CLIP-PING: Boosting Lightweight Vision-Language Models with Proximus Intrinsic Neighbors Guidance0
Dynamic Cross-Modal Alignment for Robust Semantic Location Prediction0
DUNIA: Pixel-Sized Embeddings via Cross-Modal Alignment for Earth Observation Applications0
Technical Approach for the EMI Challenge in the 8th Affective Behavior Analysis in-the-Wild Competition0
Clip4Retrofit: Enabling Real-Time Image Labeling on Edge Devices via Cross-Architecture CLIP Distillation0
ALAS: Measuring Latent Speech-Text Alignment For Spoken Language Understanding In Multimodal LLMs0
4D-ACFNet: A 4D Attention Mechanism-Based Prognostic Framework for Colorectal Cancer Liver Metastasis Integrating Multimodal Spatiotemporal Features0
Does Your 3D Encoder Really Work? When Pretrain-SFT from 2D VLMs Meets 3D VLMs0
Show:102550
← PrevPage 12 of 35Next →

No leaderboard results yet.