FaD-VLP: Fashion Vision-and-Language Pre-training towards Unified Retrieval and Captioning Oct 26, 2022 Cross-Modal Retrieval Decoder
— Unverified 0Learning by Hallucinating: Vision-Language Pre-training with Weak Supervision Oct 24, 2022 cross-modal alignment Cross-Modal Retrieval
— Unverified 0Dissecting Deep Metric Learning Losses for Image-Text Retrieval Oct 21, 2022 Cross-Modal Retrieval Image-text matching
Code Code Available 0PoseScript: Linking 3D Human Poses and Natural Language Oct 21, 2022 Cross-Modal Retrieval Image Captioning
Code Code Available 2Cross-Modal Fusion Distillation for Fine-Grained Sketch-Based Image Retrieval Oct 19, 2022 Cross-Modal Retrieval Image Retrieval
Code Code Available 1Cross-modal Search Method of Technology Video based on Adversarial Learning and Feature Fusion Oct 11, 2022 Cross-Modal Retrieval Retrieval
— Unverified 0Deep Evidential Learning with Noisy Correspondence for Cross-Modal Retrieval Oct 10, 2022 Cross-Modal Retrieval Cross-modal retrieval with noisy correspondence
Code Code Available 1ERNIE-ViL 2.0: Multi-view Contrastive Learning for Image-Text Pre-training Sep 30, 2022 Computational Efficiency Contrastive Learning
Code Code Available 0Text-Adaptive Multiple Visual Prototype Matching for Video-Text Retrieval Sep 27, 2022 Cross-Modal Retrieval Retrieval
— Unverified 0Information-Theoretic Hashing for Zero-Shot Cross-Modal Retrieval Sep 26, 2022 Cross-Modal Retrieval Retrieval
— Unverified 0Deep Manifold Hashing: A Divide-and-Conquer Approach for Semi-Paired Unsupervised Cross-Modal Retrieval Sep 26, 2022 Cross-Modal Retrieval Retrieval
— Unverified 0OmniVL:One Foundation Model for Image-Language and Video-Language Tasks Sep 15, 2022 Action Classification Action Recognition
— Unverified 0Learning to Evaluate Performance of Multi-modal Semantic Localization Sep 14, 2022 Cross-Modal Retrieval Referring Expression
Code Code Available 1A Molecular Multimodal Foundation Model Associating Molecule Graphs with Natural Language Sep 12, 2022 Contrastive Learning Cross-Modal Retrieval
Code Code Available 1A Channel Mix Method for Fine-Grained Cross-Modal Retrieval Aug 26, 2022 Cross-Modal Retrieval Retrieval
Code Code Available 0Cross-Lingual Cross-Modal Retrieval with Noise-Robust Learning Aug 26, 2022 Cross-Modal Retrieval Machine Translation
Code Code Available 1MuLan: A Joint Embedding of Music Audio and Natural Language Aug 26, 2022 Cross-Modal Retrieval Music Tagging
Code Code Available 0Image as a Foreign Language: BEiT Pretraining for All Vision and Vision-Language Tasks Aug 22, 2022 All Cross-Modal Retrieval
Code Code Available 0See What You See: Self-supervised Cross-modal Retrieval of Visual Stimuli from Brain Activity Aug 7, 2022 cross-modal alignment Cross-Modal Retrieval
— Unverified 0Learning Modal-Invariant and Temporal-Memory for Video-based Visible-Infrared Person Re-Identification Aug 4, 2022 Cross-Modal Retrieval Person Re-Identification
Code Code Available 1ALADIN: Distilling Fine-grained Alignment Scores for Efficient Image-Text Matching and Retrieval Jul 29, 2022 Cross-Modal Retrieval Image-text matching
Code Code Available 0Paired Cross-Modal Data Augmentation for Fine-Grained Image-to-Text Retrieval Jul 29, 2022 Cross-Modal Retrieval Data Augmentation
— Unverified 0Adaptive Asymmetric Label-guided Hashing for Multimedia Search Jul 26, 2022 Cross-Modal Retrieval Quantization
— Unverified 0Intra-Modal Constraint Loss For Image-Text Retrieval Jul 11, 2022 Cross-Modal Retrieval Image-text Retrieval
Code Code Available 0Integrating multi-label contrastive learning with dual adversarial graph neural networks for cross-modal retrieval Jul 5, 2022 Contrastive Learning Cross-Modal Retrieval
Code Code Available 1Contrastive Cross-Modal Knowledge Sharing Pre-training for Vision-Language Representation Learning and Retrieval Jul 2, 2022 Contrastive Learning Cross-Modal Retrieval
— Unverified 0Exploiting Transformation Invariance and Equivariance for Self-supervised Sound Localisation Jun 26, 2022 Cross-Modal Retrieval Representation Learning
— Unverified 0Emphasizing Complementary Samples for Non-literal Cross-modal Retrieval Jun 25, 2022 Cross-Modal Retrieval Retrieval
— Unverified 0Comprehending and Ordering Semantics for Image Captioning Jun 14, 2022 Cross-Modal Retrieval Image Captioning
Code Code Available 2HiVLP: Hierarchical Vision-Language Pre-Training for Fast Image-Text Retrieval May 24, 2022 Cross-Modal Retrieval Image-text Retrieval
— Unverified 0Deep Supervised Information Bottleneck Hashing for Cross-modal Retrieval based Computer-aided Diagnosis May 6, 2022 Cross-Modal Retrieval Retrieval
— Unverified 0Exploring a Fine-Grained Multiscale Method for Cross-Modal Remote Sensing Image Retrieval Apr 21, 2022 Cross-Modal Retrieval Image Retrieval
Code Code Available 2Remote Sensing Cross-Modal Text-Image Retrieval Based on Global and Local Information Apr 21, 2022 Cross-Modal Retrieval Image Retrieval
Code Code Available 1Uncertainty-based Cross-Modal Retrieval with Probabilistic Representations Apr 20, 2022 Cross-Modal Retrieval Image Retrieval
— Unverified 0Transformer Decoders with MultiModal Regularization for Cross-Modal Food Retrieval Apr 20, 2022 Cross-Modal Retrieval Retrieval
Code Code Available 1Unsupervised Contrastive Hashing for Cross-Modal Retrieval in Remote Sensing Apr 19, 2022 Binarization Cross-Modal Retrieval
— Unverified 0Learning Similarity Preserving Binary Codes for Recommender Systems Apr 18, 2022 Binarization Cross-Modal Retrieval
— Unverified 0COTS: Collaborative Two-Stream Vision-Language Pre-Training Model for Cross-Modal Retrieval Apr 15, 2022 Contrastive Learning Cross-Modal Retrieval
— Unverified 0ViSTA: Vision and Scene Text Aggregation for Cross-Modal Retrieval Mar 31, 2022 Contrastive Learning Cross-Modal Retrieval
— Unverified 0Learning Program Representations for Food Images and Cooking Recipes Mar 30, 2022 Cross-Modal Retrieval Retrieval
— Unverified 0Cross-Media Scientific Research Achievements Retrieval Based on Deep Language Model Mar 29, 2022 Cross-Modal Retrieval Language Modeling
— Unverified 0On Metric Learning for Audio-Text Cross-Modal Retrieval Mar 29, 2022 AudioCaps Cross-Modal Retrieval
Code Code Available 1LILE: Look In-Depth before Looking Elsewhere -- A Dual Attention Network using Transformers for Cross-Modal Information Retrieval in Histopathology Archives Mar 2, 2022 Cross-Modal Information Retrieval Cross-Modal Retrieval
— Unverified 0Vision-Language Pre-Training with Triple Contrastive Learning Feb 21, 2022 Contrastive Learning cross-modal alignment
Code Code Available 2Efficient Cross-Modal Retrieval via Deep Binary Hashing and Quantization Feb 15, 2022 Cross-Modal Retrieval Deep Hashing
Code Code Available 0IGLUE: A Benchmark for Transfer Learning across Modalities, Tasks, and Languages Jan 27, 2022 Cross-Modal Retrieval Few-Shot Learning
Code Code Available 1Discriminative Supervised Subspace Learning for Cross-modal Retrieval Jan 26, 2022 Cross-Modal Retrieval Retrieval
— Unverified 0Deep Unsupervised Contrastive Hashing for Large-Scale Cross-Modal Text-Image Retrieval in Remote Sensing Jan 20, 2022 Binarization Cross-Modal Retrieval
— Unverified 0A Text-Image Pair Is not Enough: Language-Vision Relation Inference with Auxiliary Modality Translation Jan 16, 2022 Cross-Modal Retrieval image-classification
— Unverified 0A Comprehensive Empirical Study of Vision-Language Pre-trained Model for Supervised Cross-Modal Retrieval Jan 8, 2022 Cross-Modal Retrieval Information Retrieval
Code Code Available 1