SOTAVerified

cross-modal alignment

Papers

Showing 121130 of 342 papers

TitleStatusHype
CardiacMamba: A Multimodal RGB-RF Fusion Framework with State Space Models for Remote Physiological MeasurementCode0
Listen Then See: Video Alignment with Speaker AttentionCode0
LoGoNet: Towards Accurate 3D Object Detection with Local-to-Global Cross-Modal FusionCode0
DAC: 2D-3D Retrieval with Noisy Labels via Divide-and-Conquer Alignment and CorrectionCode0
Learning Contextual Tag Embeddings for Cross-Modal Alignment of Audio and TagsCode0
A coupled autoencoder approach for multi-modal analysis of cell typesCode0
Language-based Image Colorization: A Benchmark and BeyondCode0
Language-Guided Diffusion Model for Visual GroundingCode0
LayoutLMv3: Pre-training for Document AI with Unified Text and Image MaskingCode0
KALE: An Artwork Image Captioning System Augmented with Heterogeneous GraphCode0
Show:102550
← PrevPage 13 of 35Next →

No leaderboard results yet.