Remote Sensing Cross-Modal Text-Image Retrieval Based on Global and Local Information Apr 21, 2022 Cross-Modal Retrieval Image Retrieval
Code Code Available 1Transformer Decoders with MultiModal Regularization for Cross-Modal Food Retrieval Apr 20, 2022 Cross-Modal Retrieval Retrieval
Code Code Available 1On Metric Learning for Audio-Text Cross-Modal Retrieval Mar 29, 2022 AudioCaps Cross-Modal Retrieval
Code Code Available 1IGLUE: A Benchmark for Transfer Learning across Modalities, Tasks, and Languages Jan 27, 2022 Cross-Modal Retrieval Few-Shot Learning
Code Code Available 1A Comprehensive Empirical Study of Vision-Language Pre-trained Model for Supervised Cross-Modal Retrieval Jan 8, 2022 Cross-Modal Retrieval Information Retrieval
Code Code Available 1Cross Modal Retrieval with Querybank Normalisation Dec 23, 2021 Cross-Modal Retrieval Metric Learning
Code Code Available 1Fusion and Orthogonal Projection for Improved Face-Voice Association Dec 20, 2021 Cross-Modal Retrieval Triplet
Code Code Available 1Learning with Noisy Correspondence for Cross-modal Matching Dec 1, 2021 Cross-Modal Retrieval Cross-modal retrieval with noisy correspondence
Code Code Available 1Emotion Embedding Spaces for Matching Music to Stories Nov 26, 2021 Cross-Modal Retrieval Metric Learning
Code Code Available 1Florence: A New Foundation Model for Computer Vision Nov 22, 2021 Action Classification Action Recognition
Code Code Available 1Multi-Grained Vision Language Pre-Training: Aligning Texts with Visual Concepts Nov 16, 2021 Cross-Modal Retrieval Image Captioning
Code Code Available 1The Curious Layperson: Fine-Grained Image Recognition without Expert Labels Nov 5, 2021 Cross-Modal Retrieval Fine-Grained Image Recognition
Code Code Available 1An Empirical Study of Training End-to-End Vision-and-Language Transformers Nov 3, 2021 Cross-Modal Retrieval Decoder
Code Code Available 1Text2Mol: Cross-Modal Molecule Retrieval with Natural Language Queries Nov 1, 2021 Cross-Modal Retrieval Natural Language Queries
Code Code Available 1BiC-Net: Learning Efficient Spatio-Temporal Relation for Text-Video Retrieval Oct 29, 2021 Cross-Modal Retrieval Relation
Code Code Available 1Wav2CLIP: Learning Robust Audio Representations From CLIP Oct 21, 2021 Cross-Modal Retrieval Image Generation
Code Code Available 1Text-Based Person Search with Limited Data Oct 20, 2021 Benchmarking Contrastive Learning
Code Code Available 1X-modaler: A Versatile and High-performance Codebase for Cross-modal Analytics Aug 18, 2021 Cross-Modal Retrieval Decoder
Code Code Available 1Adaptive label-aware graph convolutional networks for cross-modal retrieval Aug 6, 2021 Cross-Modal Retrieval Representation Learning
Code Code Available 1Self-supervised Audiovisual Representation Learning for Remote Sensing Data Aug 2, 2021 Cross-Modal Retrieval Representation Learning
Code Code Available 1Align before Fuse: Vision and Language Representation Learning with Momentum Distillation Jul 16, 2021 Cross-Modal Retrieval Grounded language learning
Code Code Available 1Dynamic Modality Interaction Modeling for Image-Text Retrieval Jul 11, 2021 cross-modal alignment Cross-Modal Retrieval
Code Code Available 1FedCMR: Federated Cross-Modal Retrieval Jul 1, 2021 Cross-Modal Retrieval Federated Learning
Code Code Available 1Domain-Smoothing Network for Zero-Shot Sketch-Based Image Retrieval Jun 22, 2021 Cross-Modal Retrieval Diversity
Code Code Available 1Learning Cross-Modal Retrieval With Noisy Labels Jun 19, 2021 Cross-Modal Retrieval Retrieval
Code Code Available 1Learning Relation Alignment for Calibrated Cross-modal Retrieval May 28, 2021 Cross-Modal Retrieval Image-text Retrieval
Code Code Available 1Dual adversarial graph neural networks for multi-label cross-modal retrieval May 18, 2021 Cross-Modal Retrieval Retrieval
Code Code Available 1MusCaps: Generating Captions for Music Audio Apr 24, 2021 Audio captioning Classification
Code Code Available 1More Photos are All You Need: Semi-Supervised Learning for Fine-Grained Sketch Based Image Retrieval Mar 25, 2021 All Cross-Modal Retrieval
Code Code Available 1Revamping Cross-Modal Recipe Retrieval with Hierarchical Transformers and Self-supervised Learning Mar 24, 2021 Cross-Modal Retrieval Retrieval
Code Code Available 1Retrieve Fast, Rerank Smart: Cooperative and Joint Approaches for Improved Cross-Modal Retrieval Mar 22, 2021 Cross-Modal Retrieval Retrieval
Code Code Available 1Ask&Confirm: Active Detail Enriching for Cross-Modal Retrieval with Partial Query Mar 2, 2021 Cross-Modal Retrieval Image Retrieval
Code Code Available 1ViLT: Vision-and-Language Transformer Without Convolution or Region Supervision Feb 5, 2021 Cross-Modal Retrieval Image Retrieval
Code Code Available 1Probabilistic Embeddings for Cross-Modal Retrieval Jan 13, 2021 Cross-Modal Retrieval Retrieval
Code Code Available 1Similarity Reasoning and Filtration for Image-Text Matching Jan 5, 2021 Cross-Modal Retrieval Image Retrieval
Code Code Available 1VisualSparta: An Embarrassingly Simple Approach to Large-scale Text-to-Image Search with Weighted Bag-of-words Jan 1, 2021 CPU Cross-Modal Information Retrieval
Code Code Available 1StacMR: Scene-Text Aware Cross-Modal Retrieval Dec 8, 2020 Cross-Modal Retrieval Information Retrieval
Code Code Available 1CodeCMR: Cross-Modal Retrieval For Function-Level Binary Source Code Matching Dec 1, 2020 Computer Security Cross-Modal Retrieval
Code Code Available 1COOT: Cooperative Hierarchical Transformer for Video-Text Representation Learning Nov 1, 2020 Cross-Modal Retrieval Representation Learning
Code Code Available 1Multimodal Metric Learning for Tag-based Music Retrieval Oct 30, 2020 Cross-Modal Retrieval Metric Learning
Code Code Available 1Learning Dual Semantic Relations with Graph Attention for Image-Text Matching Oct 22, 2020 Cross-Modal Retrieval Graph Attention
Code Code Available 1Fine-grained Visual Textual Alignment for Cross-Modal Retrieval using Transformer Encoders Aug 12, 2020 Cross-Modal Information Retrieval Cross-Modal Retrieval
Code Code Available 1Rescaling Egocentric Vision Jun 23, 2020 Action Anticipation Action Detection
Code Code Available 1Neural Methods for Point-wise Dependency Estimation Jun 9, 2020 Cross-Modal Retrieval Representation Learning
Code Code Available 1Sketch Less for More: On-the-Fly Fine-Grained Sketch-Based Image Retrieval Jun 1, 2020 Cross-Modal Retrieval Image Retrieval
Code Code Available 1FashionBERT: Text and Image Matching with Adaptive Loss for Cross-modal Retrieval May 20, 2020 Cross-Modal Retrieval Retrieval
Code Code Available 1COBRA: Contrastive Bi-Modal Representation Algorithm May 7, 2020 Cross-Modal Retrieval Image Captioning
Code Code Available 1Graph Structured Network for Image-Text Matching Apr 1, 2020 Attribute Cross-Modal Retrieval
Code Code Available 1IMRAM: Iterative Matching with Recurrent Attention Memory for Cross-Modal Image-Text Retrieval Mar 8, 2020 Cross-Modal Retrieval Image-text Retrieval
Code Code Available 1Fine-grained Video-Text Retrieval with Hierarchical Graph Reasoning Mar 1, 2020 Cross-Modal Retrieval Retrieval
Code Code Available 1