Robust Cross-Modal Representation Learning with Progressive Self-Distillation Apr 10, 2022 Contrastive Learning Image Captioning
— Unverified 0Socratic Models: Composing Zero-Shot Multimodal Reasoning with Language Apr 1, 2022 Diversity Image Captioning
Code Code Available 0Image-text Retrieval: A Survey on Recent Research and Development Mar 28, 2022 Image-text Retrieval Retrieval
— Unverified 0Single-Stream Multi-Level Alignment for Vision-Language Pretraining Mar 27, 2022 Image-text Retrieval Question Answering
Code Code Available 0Audio-text Retrieval in Context Mar 25, 2022 AudioCaps Retrieval
— Unverified 0Synopses of Movie Narratives: a Video-Language Dataset for Story Understanding Mar 11, 2022 Retrieval Text Retrieval
— Unverified 0LoopITR: Combining Dual and Cross Encoder Architectures for Image-Text Retrieval Mar 10, 2022 Image-text Retrieval Retrieval
— Unverified 0An Uncommon Task: Participatory Design in Legal AI Mar 8, 2022 Retrieval Text Retrieval
— Unverified 0An Unsupervised Cross-Modal Hashing Method Robust to Noisy Training Image-Text Correspondences in Remote Sensing Feb 26, 2022 Image-text Retrieval Meta-Learning
Code Code Available 0CommerceMM: Large-Scale Commerce MultiModal Representation Learning with Omni Retrieval Feb 15, 2022 Image-text Retrieval Representation Learning
— Unverified 0Wukong: A 100 Million Large-scale Chinese Cross-modal Pre-training Benchmark Feb 14, 2022 Benchmarking Contrastive Learning
Code Code Available 0Synopses of Movie Narratives: a Video-Language Dataset for Story Understanding Jan 16, 2022 Retrieval Text Retrieval
— Unverified 0Negative Sample is Negative in Its Own Way: Tailoring Negative Sentences for Image-Text Retrieval Dec 17, 2021 Image-text Retrieval Retrieval
— Unverified 0Cross-modal Contrastive Learning for Speech Translation Dec 17, 2021 Contrastive Learning Retrieval
— Unverified 0Unified Multimodal Pre-training and Prompt-based Tuning for Vision-Language Understanding and Generation Dec 10, 2021 Image-text matching Image-text Retrieval
— Unverified 0UFO: A UniFied TransfOrmer for Vision-Language Representation Learning Nov 19, 2021 Image Captioning Image-text matching
— Unverified 0Constructing Phrase-level Semantic Labels to Form Multi-GrainedSupervision for Image-Text Retrieval Nov 16, 2021 Form Image-text Retrieval
— Unverified 0ViQuAE, a Dataset for Knowledge-based Visual Question Answering about Named Entities Nov 16, 2021 Articles Face Recognition
Code Code Available 0CLIP2TV: Align, Match and Distill for Video-Text Retrieval Nov 10, 2021 Representation Learning Retrieval
— Unverified 0SwAMP: Swapped Assignment of Multi-Modal Pairs for Cross-Modal Retrieval Nov 10, 2021 Contrastive Learning Cross-Modal Retrieval
— Unverified 0Negative Sample is Negative in Its Own Way: Tailoring Negative Sentences for Image-Text Retrieval Nov 5, 2021 Image-text Retrieval Retrieval
Code Code Available 0Deep Keyphrase Completion Oct 29, 2021 Decoder Keyphrase Extraction
— Unverified 0Zero-Shot Dense Retrieval with Momentum Adversarial Domain Invariant Representations Oct 14, 2021 Representation Learning Retrieval
— Unverified 0ViSeRet: A simple yet effective approach to moment retrieval via fine-grained video segmentation Oct 11, 2021 Moment Retrieval Retrieval
— Unverified 0Adversarial Retriever-Ranker for dense text retrieval Oct 7, 2021 Natural Questions Retrieval
— Unverified 0A Proposed Conceptual Framework for a Representational Approach to Information Retrieval Oct 4, 2021 Information Retrieval Retrieval
— Unverified 0CrossCLR: Cross-modal Contrastive Learning For Multi-modal Video Representations Sep 30, 2021 Contrastive Learning Retrieval
— Unverified 0Zero-Shot Dense Retrieval with Momentum Adversarial Domain Invariant Representation Sep 29, 2021 Representation Learning Retrieval
— Unverified 0Learning Context-Adapted Video-Text Retrieval by Attending to User Comments Sep 29, 2021 Retrieval Text Retrieval
— Unverified 0Constructing Phrase-level Semantic Labels to Form Multi-Grained Supervision for Image-Text Retrieval Sep 12, 2021 Form Image-text Retrieval
— Unverified 0EfficientCLIP: Efficient Cross-Modal Pre-training by Ensemble Confident Learning and Language Modeling Sep 10, 2021 Cross-Modal Retrieval Language Modeling
— Unverified 0Text Retrieval for Language Learners: Graded Vocabulary vs. Open Learner Model Sep 1, 2021 Retrieval Text Retrieval
— Unverified 0Multimodal or Text? Retrieval or BERT? Benchmarking Classifiers for the Shared Task on Hateful Memes Aug 1, 2021 Benchmarking Binary Classification
— Unverified 0In-Batch Negatives for Knowledge Distillation with Tightly-Coupled Teachers for Dense Retrieval Aug 1, 2021 Document Ranking Knowledge Distillation
— Unverified 0Multi-stage Pre-training over Simplified Multimodal Pre-training Models Jul 22, 2021 Image-text Retrieval Retrieval
Code Code Available 0WikiGraphs: A Wikipedia Text - Knowledge Graph Paired Dataset Jul 20, 2021 Articles Conditional Text Generation
Code Code Available 0Probing Inter-modality: Visual Parsing with Self-Attention for Vision-Language Pre-training Jun 25, 2021 Image-text Retrieval Question Answering
— Unverified 0Adaptive Cross-Modal Prototypes for Cross-Domain Visual-Language Retrieval Jun 19, 2021 Inductive Bias Retrieval
— Unverified 0Can BERT Dig It? -- Named Entity Recognition for Information Retrieval in the Archaeology Domain Jun 14, 2021 Information Retrieval named-entity-recognition
— Unverified 0Are we there yet? Exploring clinical domain knowledge of BERT models Jun 1, 2021 Language Modelling Open-Domain Question Answering
— Unverified 0Survey of Visual-Semantic Embedding Methods for Zero-Shot Image Retrieval May 16, 2021 Graph Generation Image Captioning
— Unverified 0Playing Lottery Tickets with Vision and Language Apr 23, 2021 Image-text Retrieval Question Answering
— Unverified 0Ultra-High Dimensional Sparse Representations with Binarization for Efficient Text Retrieval Apr 15, 2021 Binarization Information Retrieval
— Unverified 0Continual learning in cross-modal retrieval Apr 14, 2021 Continual Learning cross-modal alignment
— Unverified 0Spotify at TREC 2020: Genre-Aware Abstractive Podcast Summarization Apr 7, 2021 Retrieval Text Retrieval
— Unverified 0UC2: Universal Cross-lingual Cross-modal Vision-and-Language Pre-training Apr 1, 2021 Image-text matching Image-text Retrieval
— Unverified 0TREC 2020 Podcasts Track Overview Mar 29, 2021 Information Retrieval Retrieval
— Unverified 0Memory Enhanced Embedding Learning for Cross-Modal Video-Text Retrieval Mar 29, 2021 Retrieval Text Retrieval
— Unverified 0HiT: Hierarchical Transformer with Momentum Contrast for Video-Text Retrieval Mar 28, 2021 Retrieval Text Retrieval
— Unverified 0Rudder: A Cross Lingual Video and Text Retrieval Dataset Mar 9, 2021 Natural Language Queries Retrieval
Code Code Available 0