Enhancing medical vision-language contrastive learning via inter-matching relation modelling Jan 19, 2024 Contrastive Learning Cross-Modal Retrieval
— Unverified 0Everything is a Video: Unifying Modalities through Next-Frame Prediction Nov 15, 2024 Caption Generation Cross-Modal Retrieval
— Unverified 0Explainable and Interpretable Multimodal Large Language Models: A Comprehensive Survey Dec 3, 2024 Cross-Modal Retrieval Natural Language Understanding
— Unverified 0Exploiting Transformation Invariance and Equivariance for Self-supervised Sound Localisation Jun 26, 2022 Cross-Modal Retrieval Representation Learning
— Unverified 0Exploring Optimal Transport-Based Multi-Grained Alignments for Text-Molecule Retrieval Nov 4, 2024 Contrastive Learning Cross-Modal Retrieval
— Unverified 0Extending Cross-Modal Retrieval with Interactive Learning to Improve Image Retrieval Performance in Forensics Aug 28, 2023 Cross-Modal Retrieval Image Retrieval
— Unverified 0FaD-VLP: Fashion Vision-and-Language Pre-training towards Unified Retrieval and Captioning Oct 26, 2022 Cross-Modal Retrieval Decoder
— Unverified 0FedNano: Toward Lightweight Federated Tuning for Pretrained Multimodal Large Language Models Jun 12, 2025 Cross-Modal Retrieval Federated Learning
— Unverified 0FINECAPTION: Compositional Image Captioning Focusing on Wherever You Want at Any Granularity Nov 23, 2024 Attribute Cross-Modal Retrieval
— Unverified 0Fine-Grained Action Retrieval Through Multiple Parts-of-Speech Embeddings Aug 9, 2019 Cross-Modal Retrieval POS
— Unverified 0Fine-Grained Instance-Level Sketch-Based Video Retrieval Feb 21, 2020 Cross-Modal Retrieval Image Retrieval
— Unverified 0Fine-grained Prototypical Voting with Heterogeneous Mixup for Semi-supervised 2D-3D Cross-modal Retrieval Jan 1, 2024 Cross-Modal Retrieval Retrieval
— Unverified 0FineLIP: Extending CLIP's Reach via Fine-Grained Alignment with Longer Text Inputs Apr 2, 2025 cross-modal alignment Cross-Modal Retrieval
— Unverified 0FLEX-CLIP: Feature-Level GEneration Network Enhanced CLIP for X-shot Cross-modal Retrieval Nov 26, 2024 Cross-Modal Retrieval Retrieval
— Unverified 0FOLIAGE: Towards Physical Intelligence World Models Via Unbounded Surface Evolution May 29, 2025 counterfactual Cross-Modal Retrieval
— Unverified 0Fusing Physics-Driven Strategies and Cross-Modal Adversarial Learning: Toward Multi-Domain Applications Nov 30, 2024 Cross-Modal Retrieval
— Unverified 0Fusion-supervised Deep Cross-modal Hashing Apr 25, 2019 Cross-Modal Retrieval Deep Hashing
— Unverified 0Generalized Multi-view Embedding for Visual Recognition and Cross-modal Retrieval May 31, 2016 Cross-Modal Retrieval Image Retrieval
— Unverified 0Generalized Semantic Preserving Hashing for N-Label Cross-Modal Retrieval Jul 1, 2017 Cross-Modal Retrieval Retrieval
— Unverified 0Generative Cross-Modal Retrieval: Memorizing Images in Multimodal Language Models for Retrieval and Beyond Feb 16, 2024 Cross-Modal Retrieval Retrieval
— Unverified 0GleanVec: Accelerating vector search with minimalist nonlinear dimensionality reduction Oct 14, 2024 Cross-Modal Retrieval Dimensionality Reduction
— Unverified 0Global–Local Information Soft-Alignment for Cross-Modal Remote-Sensing Image–Text Retrieval May 14, 2024 Cross-Modal Retrieval Cross-Modal Retrieval on RSITMD
— Unverified 0GMM-Based Comprehensive Feature Extraction and Relative Distance Preservation For Few-Shot Cross-Modal Retrieval May 19, 2025 Contrastive Learning Cross-Modal Retrieval
— Unverified 0Bridging Information Asymmetry in Text-video Retrieval: A Data-centric Approach Aug 14, 2024 Cross-Modal Retrieval Language Modeling
— Unverified 0Graph Pattern Loss based Diversified Attention Network for Cross-Modal Retrieval Jun 25, 2021 Cross-Modal Retrieval Retrieval
— Unverified 0HashGAN:Attention-aware Deep Adversarial Hashing for Cross Modal Retrieval Nov 26, 2017 Cross-Modal Retrieval Retrieval
— Unverified 0Hindi as a Second Language: Improving Visually Grounded Speech with Semantically Similar Samples Mar 30, 2023 Cross-Modal Retrieval Retrieval
— Unverified 0HiVLP: Hierarchical Vision-Language Pre-Training for Fast Image-Text Retrieval May 24, 2022 Cross-Modal Retrieval Image-text Retrieval
— Unverified 0Image as a Foreign Language: BEiT Pretraining for Vision and Vision-Language Tasks Jan 1, 2023 Cross-Modal Retrieval Image Captioning
— Unverified 0Improved Text-Image Matching by Mitigating Visual Semantic Hubs May 22, 2019 Cross-Modal Retrieval Retrieval
— Unverified 0Improving Factuality of 3D Brain MRI Report Generation with Paired Image-domain Retrieval and Text-domain Augmentation Nov 23, 2024 Cross-Modal Retrieval Image to text
— Unverified 0Improving Sound Source Localization with Joint Slot Attention on Image and Audio Apr 21, 2025 Contrastive Learning Cross-Modal Retrieval
— Unverified 0Incorporating Dense Knowledge Alignment into Unified Multimodal Representation Models Jan 1, 2025 Contrastive Learning Cross-Modal Retrieval
— Unverified 0Inflate and Shrink:Enriching and Reducing Interactions for Fast Text-Image Retrieval Nov 1, 2021 Cross-Modal Retrieval Image Retrieval
— Unverified 0Information-Theoretic Hashing for Zero-Shot Cross-Modal Retrieval Sep 26, 2022 Cross-Modal Retrieval Retrieval
— Unverified 0Ink Marker Segmentation in Histopathology Images Using Deep Learning Oct 29, 2020 Cross-Modal Retrieval Deep Learning
— Unverified 0Instance-Variant Loss with Gaussian RBF Kernel for 3D Cross-modal Retriveal May 7, 2023 Cross-Modal Retrieval Retrieval
— Unverified 0Integrating Information Theory and Adversarial Learning for Cross-modal Retrieval Apr 11, 2021 Cross-Modal Retrieval Retrieval
— Unverified 0Joint Wasserstein Autoencoders for Aligning Multimodal Embeddings Sep 14, 2019 Cross-Modal Retrieval Retrieval
— Unverified 0Label Prediction Framework for Semi-Supervised Cross-Modal Retrieval May 27, 2019 Cross-Modal Retrieval Prediction
— Unverified 0Large Language Models for Captioning and Retrieving Remote Sensing Images Feb 9, 2024 Cross-Modal Retrieval Decoder
— Unverified 0Learning by Hallucinating: Vision-Language Pre-training with Weak Supervision Oct 24, 2022 cross-modal alignment Cross-Modal Retrieval
— Unverified 0Learning Concordant Attention via Target-aware Alignment for Visible-Infrared Person Re-identification Jan 1, 2023 Cross-Modal Retrieval Person Re-Identification
— Unverified 0Learning Discriminative Hashing Codes for Cross-Modal Retrieval based on Multi-view Features Aug 13, 2018 Cross-Modal Retrieval Information Retrieval
— Unverified 0Learning Disentangled Latent Factors from Paired Data in Cross-Modal Retrieval: An Implicit Identifiable VAE Approach Dec 1, 2020 Cross-Modal Retrieval Decoder
— Unverified 0Learning Embodied Semantics via Music and Dance Semiotic Correlations Mar 25, 2019 Cross-Modal Retrieval Retrieval
— Unverified 0Learning Joint Embedding with Modality Alignments for Cross-Modal Retrieval of Recipes and Food Images Aug 9, 2021 cross-modal alignment Cross-Modal Retrieval
— Unverified 0Learning Program Representations for Food Images and Cooking Recipes Mar 30, 2022 Cross-Modal Retrieval Retrieval
— Unverified 0Learning Semantic Concepts and Order for Image and Sentence Matching Dec 6, 2017 Cross-Modal Retrieval Sentence
— Unverified 0Learning Similarity Preserving Binary Codes for Recommender Systems Apr 18, 2022 Binarization Cross-Modal Retrieval
— Unverified 0