MXM-CLR: A Unified Framework for Contrastive Learning of Multifold Cross-Modal Representations Mar 20, 2023 Contrastive Learning Cross-Modal Retrieval
Code Code Available 0Deep Supervised Cross-Modal Retrieval Jun 1, 2019 Cross-Modal Retrieval Retrieval
Code Code Available 0OPT: Omni-Perception Pre-Trainer for Cross-Modal Understanding and Generation Jul 1, 2021 Audio to Text Retrieval Cross-Modal Retrieval
Code Code Available 0Multimodal LLM Enhanced Cross-lingual Cross-modal Retrieval Sep 30, 2024 Cross-Modal Retrieval Large Language Model
Code Code Available 0Multilingual Vision-Language Pre-training for the Remote Sensing Domain Oct 30, 2024 Cross-Modal Retrieval image-classification
Code Code Available 0Unified Visual-Semantic Embeddings: Bridging Vision and Language With Structured Meaning Representations Jun 1, 2019 Contrastive Learning Cross-Modal Retrieval
Code Code Available 0Deep Sketched Output Kernel Regression for Structured Prediction Jun 13, 2024 Cross-Modal Retrieval Prediction
Code Code Available 0MuLan: A Joint Embedding of Music Audio and Natural Language Aug 26, 2022 Cross-Modal Retrieval Music Tagging
Code Code Available 0MTFH: A Matrix Tri-Factorization Hashing Framework for Efficient Cross-Modal Retrieval May 4, 2018 Cross-Modal Retrieval Retrieval
Code Code Available 0Modality-specific Cross-modal Similarity Measurement with Recurrent Attention Network Aug 16, 2017 Cross-Modal Retrieval Retrieval
Code Code Available 0ContextRefine-CLIP for EPIC-KITCHENS-100 Multi-Instance Retrieval Challenge 2025 Jun 12, 2025 Cross-Modal Retrieval Ensemble Learning
Code Code Available 0UNIMO: Towards Unified-Modal Understanding and Generation via Cross-Modal Contrastive Learning Dec 31, 2020 Contrastive Learning Cross-Modal Retrieval
Code Code Available 0ModalChorus: Visual Probing and Alignment of Multi-modal Embeddings via Modal Fusion Map Jul 17, 2024 Cross-Modal Retrieval Dimensionality Reduction
Code Code Available 0Deep Reversible Consistency Learning for Cross-modal Retrieval Jan 10, 2025 Cross-Modal Retrieval Representation Learning
Code Code Available 0Picture It In Your Mind: Generating High Level Visual Representations From Textual Descriptions Jun 23, 2016 Cross-Modal Information Retrieval Cross-Modal Retrieval
Code Code Available 0Alternative Telescopic Displacement: An Efficient Multimodal Alignment Method Jun 29, 2023 Arrhythmia Detection Cross-Modal Retrieval
Code Code Available 0Deep Joint-Semantics Reconstructing Hashing for Large-Scale Unsupervised Cross-Modal Retrieval Oct 1, 2019 Cross-Modal Retrieval Retrieval
Code Code Available 0MaMMUT: A Simple Architecture for Joint Learning for MultiModal Tasks Mar 29, 2023 Cross-Modal Retrieval Decoder
Code Code Available 0Leveraging Acoustic Images for Effective Self-Supervised Audio Representation Learning Aug 1, 2020 Cross-Modal Retrieval Representation Learning
Code Code Available 0Learning Visual Actions Using Multiple Verb-Only Labels Jul 25, 2019 Action Recognition Cross-Modal Retrieval
Code Code Available 0Learning TFIDF Enhanced Joint Embedding for Recipe-Image Cross-Modal Retrieval Service Aug 2, 2021 Cross-Modal Retrieval Retrieval
Code Code Available 0Aligning Multilingual Word Embeddings for Cross-Modal Retrieval Task Oct 8, 2019 Cross-Modal Retrieval Image to text
Code Code Available 0Learning Text-Image Joint Embedding for Efficient Cross-Modal Retrieval with Deep Feature Engineering Oct 22, 2021 Cross-Modal Retrieval Feature Engineering
Code Code Available 0Probabilistic Embeddings for Frozen Vision-Language Models: Uncertainty Quantification with Gaussian Process Latent Variable Models May 8, 2025 Active Learning cross-modal alignment
Code Code Available 0Learning Cross-Modal Embeddings with Adversarial Networks for Cooking Recipes and Food Images May 3, 2019 Cross-Modal Retrieval Nutrition
Code Code Available 0Align before Search: Aligning Ads Image to Text for Accurate Cross-Modal Sponsored Search Sep 28, 2023 cross-modal alignment Cross-Modal Retrieval
Code Code Available 0PromptHash: Affinity-Prompted Collaborative Cross-Modal Learning for Adaptive Hashing Retrieval Mar 20, 2025 Contrastive Learning Cross-Modal Retrieval
Code Code Available 0Learnable PINs: Cross-Modal Embeddings for Person Identity May 2, 2018 Cross-Modal Retrieval Retrieval
Code Code Available 0Language-Agnostic Visual-Semantic Embeddings Oct 1, 2019 Cross-Modal Retrieval Retrieval
Code Code Available 0Deep Cross-Modal Projection Learning for Image-Text Matching Sep 1, 2018 Cross-Modal Retrieval Image-text matching
Code Code Available 0Adversarial Modality Alignment Network for Cross-Modal Molecule Retrieval Mar 8, 2023 Contrastive Learning Cross-Modal Retrieval
Code Code Available 0Deep Cross-Modal Hashing Feb 15, 2016 Cross-Modal Retrieval Retrieval
Code Code Available 0InvGC: Robust Cross-Modal Retrieval by Inverse Graph Convolution Oct 20, 2023 Cross-Modal Retrieval Retrieval
Code Code Available 0Intra-Modal Constraint Loss For Image-Text Retrieval Jul 11, 2022 Cross-Modal Retrieval Image-text Retrieval
Code Code Available 0Improving the Consistency in Cross-Lingual Cross-Modal Retrieval with 1-to-K Contrastive Learning Jun 26, 2024 Contrastive Learning Cross-Modal Retrieval
Code Code Available 0Improving Medical Multi-modal Contrastive Learning with Expert Annotations Mar 15, 2024 Contrastive Learning Cross-Modal Retrieval
Code Code Available 0Impression-CLIP: Contrastive Shape-Impression Embedding for Fonts Feb 26, 2024 Cross-Modal Retrieval Retrieval
Code Code Available 0Implicit Differentiable Outlier Detection Enable Robust Deep Multimodal Analysis Sep 21, 2023 Cross-Modal Retrieval Image Captioning
Code Code Available 0Image as a Foreign Language: BEiT Pretraining for All Vision and Vision-Language Tasks Aug 22, 2022 All Cross-Modal Retrieval
Code Code Available 0Harmonized Multimodal Learning with Gaussian Process Latent Variable Models Aug 14, 2019 Cross-Modal Retrieval Retrieval
Code Code Available 0Deep Class-guided Hashing for Multi-label Cross-modal Retrieval Oct 20, 2024 Cross-Modal Retrieval Deep Hashing
Code Code Available 0Finding beans in burgers: Deep semantic-visual embedding with localization Apr 5, 2018 Cross-Modal Retrieval Image Captioning
Code Code Available 0ALADIN: Distilling Fine-grained Alignment Scores for Efficient Image-Text Matching and Retrieval Jul 29, 2022 Cross-Modal Retrieval Image-text matching
Code Code Available 0FDDH: Fast Discriminative Discrete Hashing for Large-Scale Cross-Modal Retrieval May 15, 2021 Cross-Modal Retrieval Quantization
Code Code Available 0Deep Binary Reconstruction for Cross-modal Hashing Aug 17, 2017 Cross-Modal Retrieval Retrieval
Code Code Available 0Context-Aware Embeddings for Automatic Art Analysis Apr 10, 2019 Art Analysis Cross-Modal Retrieval
Code Code Available 0Exposing and Mitigating Spurious Correlations for Cross-Modal Retrieval Apr 6, 2023 Cross-Modal Retrieval Image-text Retrieval
Code Code Available 0RoCOCO: Robustness Benchmark of MS-COCO to Stress-test Image-Text Matching Models Apr 21, 2023 Cross-Modal Retrieval Image-text matching
Code Code Available 0Exploring modality-agnostic representations for music classification Jun 2, 2021 Classification Cross-Modal Retrieval
Code Code Available 0Toward Building General Foundation Models for Language, Vision, and Vision-Language Understanding Tasks Jan 12, 2023 Cross-Modal Retrieval Open-Ended Question Answering
Code Code Available 0