SOTAVerified

cross-modal alignment

Papers

Showing 271280 of 342 papers

TitleStatusHype
TOT: Topology-Aware Optimal Transport For Multimodal Hate Detection0
End-to-end Semantic Object Detection with Cross-Modal Alignment0
Does Vision Accelerate Hierarchical Generalization in Neural Language Learners?0
Improving Cross-modal Alignment for Text-Guided Image Inpainting0
Linguistic Query-Guided Mask Generation for Referring Image Segmentation0
HiTeA: Hierarchical Temporal-Aware Video-Language Pre-training0
MM-Diffusion: Learning Multi-Modal Diffusion Models for Joint Audio and Video GenerationCode2
SimVTP: Simple Video Text Pre-training with Masked AutoencodersCode0
Asymmetric Cross-Scale Alignment for Text-Based Person SearchCode0
Seeing What You Miss: Vision-Language Pre-training with Semantic Completion LearningCode1
Show:102550
← PrevPage 28 of 35Next →

No leaderboard results yet.