Design of the topology for contrastive visual-textual alignment Sep 5, 2022 Contrastive Learning Image-to-Text Retrieval
Code Code Available 0Universal Vision-Language Dense Retrieval: Learning A Unified Representation Space for Multi-Modal Retrieval Sep 1, 2022 Image Retrieval Open-Domain Question Answering
Code Code Available 1Efficient Vision-Language Pretraining with Visual Concepts and Hierarchical Alignment Aug 29, 2022 cross-modal alignment Image-text Retrieval
Code Code Available 1Contrastive Audio-Language Learning for Music Aug 25, 2022 Audio to Text Retrieval Descriptive
Code Code Available 1Revising Image-Text Retrieval via Multi-Modal Entailment Aug 22, 2022 Image-text Retrieval Natural Language Inference
— Unverified 0CODER: Coupled Diversity-Sensitive Momentum Contrastive Learning for Image-Text Retrieval Aug 21, 2022 Clustering Contrastive Learning
— Unverified 0VLMAE: Vision-Language Masked Autoencoder Aug 19, 2022 Image-text Retrieval Language Modeling
— Unverified 0On the Value of Behavioral Representations for Dense Retrieval Aug 11, 2022 Retrieval Text Retrieval
— Unverified 0Boosting Video-Text Retrieval with Explicit High-Level Semantics Aug 8, 2022 Retrieval Text Retrieval
— Unverified 0Paired Cross-Modal Data Augmentation for Fine-Grained Image-to-Text Retrieval Jul 29, 2022 Cross-Modal Retrieval Data Augmentation
— Unverified 0X-CLIP: End-to-End Multi-grained Contrastive Learning for Video-Text Retrieval Jul 15, 2022 Contrastive Learning Retrieval
Code Code Available 1Parameter-Efficient Prompt Tuning Makes Generalized and Calibrated Neural Text Retrievers Jul 14, 2022 Retrieval Text Retrieval
Code Code Available 4LaT: Latent Translation with Cycle-Consistency for Video-Text Retrieval Jul 11, 2022 Representation Learning Retrieval
— Unverified 0Intra-Modal Constraint Loss For Image-Text Retrieval Jul 11, 2022 Cross-Modal Retrieval Image-text Retrieval
Code Code Available 0GazBy: Gaze-Based BERT Model to Incorporate Human Attention in Neural Information Retrieval Jul 4, 2022 Information Retrieval Retrieval
— Unverified 0Dynamic Contrastive Distillation for Image-Text Retrieval Jul 4, 2022 Contrastive Learning GPU
— Unverified 0A Dense Representation Framework for Lexical and Semantic Matching Jun 20, 2022 Retrieval Semantic Text Matching
Code Code Available 1Towards Robust Ranker for Text Retrieval Jun 16, 2022 Passage Retrieval Reranking
— Unverified 0MixGen: A New Multi-Modal Data Augmentation Jun 16, 2022 Data Augmentation Image-text Retrieval
Code Code Available 1Coarse-to-Fine Vision-Language Pre-training with Fusion in the Backbone Jun 15, 2022 Described Object Detection Image Captioning
Code Code Available 1Uni-Perceiver-MoE: Learning Sparse Generalist Models with Conditional MoEs Jun 9, 2022 Image Captioning Image Classification
Code Code Available 2Egocentric Video-Language Pretraining Jun 3, 2022 Action Recognition Contrastive Learning
Code Code Available 2VL-BEiT: Generative Vision-Language Pretraining Jun 2, 2022 image-classification Image Classification
— Unverified 0Cross-lingual and Multilingual CLIP Jun 1, 2022 Contrastive Learning Image-text Retrieval
Code Code Available 2Cross-View Language Modeling: Towards Unified Cross-Lingual Cross-Modal Pre-training Jun 1, 2022 Contrastive Learning Cross-Lingual Transfer
Code Code Available 1Generalizing Multimodal Pre-training into Multilingual via Language Acquisition May 29, 2022 Language Acquisition Retrieval
— Unverified 0Fast and Light-Weight Answer Text Retrieval in Dialogue Systems May 27, 2022 Re-Ranking Retrieval
Code Code Available 1Prompt-based Learning for Unpaired Image Captioning May 26, 2022 Image Captioning Image-text Retrieval
— Unverified 0mPLUG: Effective and Efficient Vision-Language Learning by Cross-modal Skip-connections May 24, 2022 Computational Efficiency cross-modal alignment
Code Code Available 1HiVLP: Hierarchical Vision-Language Pre-Training for Fast Image-Text Retrieval May 24, 2022 Cross-Modal Retrieval Image-text Retrieval
— Unverified 0HLATR: Enhance Multi-stage Text Retrieval with Hybrid List Aware Transformer Reranking May 21, 2022 Passage Ranking Passage Re-Ranking
Code Code Available 1CCMB: A Large-scale Chinese Cross-modal Benchmark May 8, 2022 image-classification Image Classification
Code Code Available 1Cross-modal Contrastive Learning for Speech Translation May 5, 2022 Contrastive Learning Retrieval
Code Code Available 1Scene-Text Aware Image and Text Retrieval with Dual-Encoder May 1, 2022 Retrieval Text Retrieval
— Unverified 0TRAttack”:" Text Rewriting Attack Against Text Retrieval May 1, 2022 Retrieval Text Retrieval
— Unverified 0Generative Multi-hop Retrieval Apr 27, 2022 Decoder GPU
Code Code Available 1MILES: Visual BERT Pre-training with Injected Language Semantics for Video-text Retrieval Apr 26, 2022 Action Recognition Retrieval
Code Code Available 1Progressive Learning for Image Retrieval with Hybrid-Modality Queries Apr 24, 2022 Image Retrieval Image-text Retrieval
— Unverified 0MUGEN: A Playground for Video-Audio-Text Multimodal Understanding and GENeration Apr 17, 2022 Navigate Retrieval
Code Code Available 1COTS: Collaborative Two-Stream Vision-Language Pre-Training Model for Cross-Modal Retrieval Apr 15, 2022 Contrastive Learning Cross-Modal Retrieval
— Unverified 0Robust Cross-Modal Representation Learning with Progressive Self-Distillation Apr 10, 2022 Contrastive Learning Image Captioning
— Unverified 0Socratic Models: Composing Zero-Shot Multimodal Reasoning with Language Apr 1, 2022 Diversity Image Captioning
Code Code Available 0On Metric Learning for Audio-Text Cross-Modal Retrieval Mar 29, 2022 AudioCaps Cross-Modal Retrieval
Code Code Available 1Image-text Retrieval: A Survey on Recent Research and Development Mar 28, 2022 Image-text Retrieval Retrieval
— Unverified 0Single-Stream Multi-Level Alignment for Vision-Language Pretraining Mar 27, 2022 Image-text Retrieval Question Answering
Code Code Available 0Audio-text Retrieval in Context Mar 25, 2022 AudioCaps Retrieval
— Unverified 0Synopses of Movie Narratives: a Video-Language Dataset for Story Understanding Mar 11, 2022 Retrieval Text Retrieval
— Unverified 0LaPraDoR: Unsupervised Pretrained Dense Retriever for Zero-Shot Text Retrieval Mar 11, 2022 Contrastive Learning Re-Ranking
Code Code Available 1LoopITR: Combining Dual and Cross Encoder Architectures for Image-Text Retrieval Mar 10, 2022 Image-text Retrieval Retrieval
— Unverified 0An Uncommon Task: Participatory Design in Legal AI Mar 8, 2022 Retrieval Text Retrieval
— Unverified 0