Where Does the Performance Improvement Come From? -- A Reproducibility Concern about Image-Text Retrieval Mar 8, 2022 Image-text Retrieval Information Retrieval
Code Code Available 1An Unsupervised Cross-Modal Hashing Method Robust to Noisy Training Image-Text Correspondences in Remote Sensing Feb 26, 2022 Image-text Retrieval Meta-Learning
Code Code Available 0Vision-Language Pre-Training with Triple Contrastive Learning Feb 21, 2022 Contrastive Learning cross-modal alignment
Code Code Available 2CommerceMM: Large-Scale Commerce MultiModal Representation Learning with Omni Retrieval Feb 15, 2022 Image-text Retrieval Representation Learning
— Unverified 0Wukong: A 100 Million Large-scale Chinese Cross-modal Pre-training Benchmark Feb 14, 2022 Benchmarking Contrastive Learning
Code Code Available 0DALL-Eval: Probing the Reasoning Skills and Social Biases of Text-to-Image Generation Models Feb 8, 2022 Diagnostic Image Captioning
Code Code Available 3BLIP: Bootstrapping Language-Image Pre-training for Unified Vision-Language Understanding and Generation Jan 28, 2022 Image Captioning Image-text matching
Code Code Available 5Synopses of Movie Narratives: a Video-Language Dataset for Story Understanding Jan 16, 2022 Retrieval Text Retrieval
— Unverified 0Bridging Video-text Retrieval with Multiple Choice Questions Jan 13, 2022 Action Recognition Linear evaluation
Code Code Available 1Negative Sample is Negative in Its Own Way: Tailoring Negative Sentences for Image-Text Retrieval Dec 17, 2021 Image-text Retrieval Retrieval
— Unverified 0Cross-modal Contrastive Learning for Speech Translation Dec 17, 2021 Contrastive Learning Retrieval
— Unverified 0Audio Retrieval with Natural Language Queries: A Benchmark Study Dec 17, 2021 AudioCaps Audio captioning
Code Code Available 1CLIP-Lite: Information Efficient Visual Representation Learning with Language Supervision Dec 14, 2021 Contrastive Learning Representation Learning
Code Code Available 1Unified Multimodal Pre-training and Prompt-based Tuning for Vision-Language Understanding and Generation Dec 10, 2021 Image-text matching Image-text Retrieval
— Unverified 0Densifying Sparse Representations for Passage Retrieval by Representational Slicing Dec 9, 2021 Passage Retrieval Retrieval
Code Code Available 1Video-Text Pre-training with Learned Regions Dec 2, 2021 Representation Learning Retrieval
Code Code Available 1UFO: A UniFied TransfOrmer for Vision-Language Representation Learning Nov 19, 2021 Image Captioning Image-text matching
— Unverified 0Constructing Phrase-level Semantic Labels to Form Multi-GrainedSupervision for Image-Text Retrieval Nov 16, 2021 Form Image-text Retrieval
— Unverified 0ViQuAE, a Dataset for Knowledge-based Visual Question Answering about Named Entities Nov 16, 2021 Articles Face Recognition
Code Code Available 0SwAMP: Swapped Assignment of Multi-Modal Pairs for Cross-Modal Retrieval Nov 10, 2021 Contrastive Learning Cross-Modal Retrieval
— Unverified 0CLIP2TV: Align, Match and Distill for Video-Text Retrieval Nov 10, 2021 Representation Learning Retrieval
— Unverified 0FILIP: Fine-grained Interactive Language-Image Pre-Training Nov 9, 2021 image-classification Image Classification
Code Code Available 1Negative Sample is Negative in Its Own Way: Tailoring Negative Sentences for Image-Text Retrieval Nov 5, 2021 Image-text Retrieval Retrieval
Code Code Available 0VLMo: Unified Vision-Language Pre-Training with Mixture-of-Modality-Experts Nov 3, 2021 Image Retrieval Image-text Retrieval
Code Code Available 1Less is More: Pretrain a Strong Siamese Encoder for Dense Text Retrieval Using a Weak Decoder Nov 1, 2021 Decoder Language Modeling
Code Code Available 1Deep Keyphrase Completion Oct 29, 2021 Decoder Keyphrase Extraction
— Unverified 0Dense Hierarchical Retrieval for Open-Domain Question Answering Oct 28, 2021 Open-Domain Question Answering Question Answering
Code Code Available 1Zero-Shot Dense Retrieval with Momentum Adversarial Domain Invariant Representations Oct 14, 2021 Representation Learning Retrieval
— Unverified 0ViSeRet: A simple yet effective approach to moment retrieval via fine-grained video segmentation Oct 11, 2021 Moment Retrieval Retrieval
— Unverified 0Adversarial Retriever-Ranker for dense text retrieval Oct 7, 2021 Natural Questions Retrieval
— Unverified 0A Proposed Conceptual Framework for a Representational Approach to Information Retrieval Oct 4, 2021 Information Retrieval Retrieval
— Unverified 0CrossCLR: Cross-modal Contrastive Learning For Multi-modal Video Representations Sep 30, 2021 Contrastive Learning Retrieval
— Unverified 0Zero-Shot Dense Retrieval with Momentum Adversarial Domain Invariant Representation Sep 29, 2021 Representation Learning Retrieval
— Unverified 0Learning Context-Adapted Video-Text Retrieval by Attending to User Comments Sep 29, 2021 Retrieval Text Retrieval
— Unverified 0Constructing Phrase-level Semantic Labels to Form Multi-Grained Supervision for Image-Text Retrieval Sep 12, 2021 Form Image-text Retrieval
— Unverified 0EfficientCLIP: Efficient Cross-Modal Pre-training by Ensemble Confident Learning and Language Modeling Sep 10, 2021 Cross-Modal Retrieval Language Modeling
— Unverified 0Improving Video-Text Retrieval by Multi-Stream Corpus Alignment and Dual Softmax Loss Sep 9, 2021 Mixture-of-Experts Retrieval
Code Code Available 1Text Retrieval for Language Learners: Graded Vocabulary vs. Open Learner Model Sep 1, 2021 Retrieval Text Retrieval
— Unverified 0In-Batch Negatives for Knowledge Distillation with Tightly-Coupled Teachers for Dense Retrieval Aug 1, 2021 Document Ranking Knowledge Distillation
— Unverified 0Multimodal or Text? Retrieval or BERT? Benchmarking Classifiers for the Shared Task on Hateful Memes Aug 1, 2021 Benchmarking Binary Classification
— Unverified 0HANet: Hierarchical Alignment Networks for Video-Text Retrieval Jul 26, 2021 Retrieval Text Matching
Code Code Available 1Multi-stage Pre-training over Simplified Multimodal Pre-training Models Jul 22, 2021 Image-text Retrieval Retrieval
Code Code Available 0WikiGraphs: A Wikipedia Text - Knowledge Graph Paired Dataset Jul 20, 2021 Articles Conditional Text Generation
Code Code Available 0More Robust Dense Retrieval with Contrastive Dual Learning Jul 16, 2021 Contrastive Learning Information Retrieval
Code Code Available 1Align before Fuse: Vision and Language Representation Learning with Momentum Distillation Jul 16, 2021 Cross-Modal Retrieval Grounded language learning
Code Code Available 1Dynamic Modality Interaction Modeling for Image-Text Retrieval Jul 11, 2021 cross-modal alignment Cross-Modal Retrieval
Code Code Available 1Probing Inter-modality: Visual Parsing with Self-Attention for Vision-Language Pre-training Jun 25, 2021 Image-text Retrieval Question Answering
— Unverified 0CLIP2Video: Mastering Video-Text Retrieval via Image CLIP Jun 21, 2021 Language Modeling Language Modelling
Code Code Available 1CoSMo: Content-Style Modulation for Image Retrieval With Text Feedback Jun 19, 2021 Image Retrieval Image-text Retrieval
Code Code Available 1Adaptive Cross-Modal Prototypes for Cross-Domain Visual-Language Retrieval Jun 19, 2021 Inductive Bias Retrieval
— Unverified 0