GLEN: Generative Retrieval via Lexical Index Learning Nov 6, 2023 Learning-To-Rank Retrieval
Code Code Available 1A Prior Instruction Representation Framework for Remote Sensing Image-text Retrieval Oct 27, 2023 Cross-Modal Retrieval Image-text Retrieval
Code Code Available 1MARVEL: Unlocking the Multi-Modal Capability of Dense Retrieval via Visual Module Plugin Oct 21, 2023 Language Modelling Retrieval
Code Code Available 1MolCA: Molecular Graph-Language Modeling with Cross-Modal Projector and Uni-Modal Adapter Oct 19, 2023 Contrastive Learning IUPAC Name Prediction
Code Code Available 1Extending Multi-modal Contrastive Representations Oct 13, 2023 3D Object Classification Representation Learning
Code Code Available 1Fine-Tuning LLaMA for Multi-Stage Text Retrieval Oct 12, 2023 Passage Retrieval Retrieval
Code Code Available 1ESA: External Space Attention Aggregation for Image-Text Retrieval Oct 10, 2023 Image-text Retrieval Retrieval
Code Code Available 1Building an Open-Vocabulary Video CLIP Model with Better Architectures, Optimization and Data Oct 8, 2023 Action Recognition Continual Learning
Code Code Available 1Prototype-based Aleatoric Uncertainty Quantification for Cross-modal Retrieval Sep 29, 2023 Cross-Modal Retrieval Image-text matching
Code Code Available 1Unified Coarse-to-Fine Alignment for Video-Text Retrieval Sep 18, 2023 Retrieval Text Retrieval
Code Code Available 1LinkTransformer: A Unified Package for Record Linkage with Transformer Language Models Sep 2, 2023 Blocking Language Modelling
Code Code Available 1UniPT: Universal Parallel Tuning for Transfer Learning with Efficient Parameter and Memory Aug 28, 2023 Question Answering Retrieval
Code Code Available 1Towards Fast and Accurate Image-Text Retrieval with Self-Supervised Fine-Grained Alignment Aug 27, 2023 Contrastive Learning Image-text Retrieval
Code Code Available 1Parameter-Efficient Transfer Learning for Remote Sensing Image-Text Retrieval Aug 24, 2023 Cross-Modal Retrieval Image-text matching
Code Code Available 1Multi-event Video-Text Retrieval Aug 22, 2023 Language Modelling Retrieval
Code Code Available 1ALIP: Adaptive Language-Image Pre-training with Synthetic Caption Aug 16, 2023 Action Classification Image-text Retrieval
Code Code Available 1Helping Hands: An Object-Aware Ego-Centric Video Recognition Model Aug 15, 2023 Decoder Object
Code Code Available 1Vision-Language Dataset Distillation Aug 15, 2023 Dataset Distillation image-classification
Code Code Available 1AdvCLIP: Downstream-agnostic Adversarial Examples in Multimodal Contrastive Learning Aug 14, 2023 Contrastive Learning Generative Adversarial Network
Code Code Available 1Set-level Guidance Attack: Boosting Adversarial Transferability of Vision-Language Pre-training Models Jul 26, 2023 Image-text Retrieval Retrieval
Code Code Available 1PRIOR: Prototype Representation Joint Learning from Medical Images and Reports Jul 24, 2023 Contrastive Learning Image to text
Code Code Available 1mCLIP: Multilingual CLIP via Cross-lingual Transfer Jul 10, 2023 Contrastive Learning Cross-Lingual Transfer
Code Code Available 1Learning to Rank in Generative Retrieval Jun 27, 2023 Learning-To-Rank Passage Ranking
Code Code Available 1Contrasting Intra-Modal and Ranking Cross-Modal Hard Negatives to Enhance Visio-Linguistic Compositional Understanding Jun 15, 2023 Contrastive Learning image-classification
Code Code Available 1Efficient Token-Guided Image-Text Retrieval with Consistent Multimodal Contrastive Training Jun 15, 2023 Image-text Retrieval Representation Learning
Code Code Available 1Babel-ImageNet: Massively Multilingual Evaluation of Vision-and-Language Representations Jun 14, 2023 image-classification Image Classification
Code Code Available 1Global and Local Semantic Completion Learning for Vision-Language Pre-training Jun 12, 2023 cross-modal alignment Image-text Retrieval
Code Code Available 1Multi-modal Pre-training for Medical Vision-language Understanding and Generation: An Empirical Study with A New Benchmark Jun 10, 2023 Image-text Retrieval Medical Report Generation
Code Code Available 1Revisiting the Role of Language Priors in Vision-Language Models Jun 2, 2023 Image-text matching Image-text Retrieval
Code Code Available 1Test-Time Adaptation with CLIP Reward for Zero-Shot Generalization in Vision-Language Models May 29, 2023 Image Captioning Image Classification
Code Code Available 1FuseCap: Leveraging Large Language Models for Enriched Fused Image Captions May 28, 2023 Attribute Image Captioning
Code Code Available 1CrossGET: Cross-Guided Ensemble of Tokens for Accelerating Vision-Language Transformers May 27, 2023 Image Captioning Image Retrieval
Code Code Available 1S-CLIP: Semi-supervised Vision-Language Learning using Few Specialist Captions May 23, 2023 Contrastive Learning Image-text Retrieval
Code Code Available 1Enhancing Vision-Language Pre-Training with Jointly Learned Questioner and Dense Captioner May 19, 2023 Dense Captioning Image Captioning
Code Code Available 1Region-Aware Pretraining for Open-Vocabulary Object Detection with Vision Transformers May 11, 2023 Contrastive Learning Image-text Retrieval
Code Code Available 1Cross-Modal Retrieval for Motion and Text via DopTriple Loss May 7, 2023 Cross-Modal Retrieval Retrieval
Code Code Available 1Understanding Differential Search Index for Text Retrieval May 3, 2023 Information Retrieval Retrieval
Code Code Available 1From Association to Generation: Text-only Captioning by Unsupervised Cross-modal Mapping Apr 26, 2023 Decoder Image Captioning
Code Code Available 1Learnable Pillar-based Re-ranking for Image-Text Retrieval Apr 25, 2023 Image-text Retrieval Re-Ranking
Code Code Available 1Rethinking Benchmarks for Cross-modal Image-text Retrieval Apr 21, 2023 Cross-Modal Retrieval Image-text Retrieval
Code Code Available 1Image-text Retrieval via Preserving Main Semantics of Vision Apr 20, 2023 Cross-Modal Retrieval Image-text Retrieval
Code Code Available 1SViTT: Temporal Learning of Sparse Video-Text Transformers Apr 18, 2023 Question Answering Retrieval
Code Code Available 1Hyperbolic Image-Text Representations Apr 18, 2023 image-classification Image Classification
Code Code Available 1Equivariant Similarity for Vision-Language Foundation Models Mar 25, 2023 Image-text Retrieval Retrieval
Code Code Available 1Cross-Modal Retrieval with Partially Mismatched Pairs Feb 22, 2023 Contrastive Learning Cross-Modal Retrieval
Code Code Available 1Multimodal Federated Learning via Contrastive Representation Ensemble Feb 17, 2023 Federated Learning Image-text Retrieval
Code Code Available 1UniAdapter: Unified Parameter-Efficient Transfer Learning for Cross-modal Modeling Feb 13, 2023 Image-text Retrieval Retrieval
Code Code Available 1LexLIP: Lexicon-Bottlenecked Language-Image Pre-Training for Large-Scale Image-Text Retrieval Feb 6, 2023 Image-text Retrieval Retrieval
Code Code Available 1UPop: Unified and Progressive Pruning for Compressing Vision-Language Transformers Jan 31, 2023 Image Captioning Image Classification
Code Code Available 1Revisiting Temporal Modeling for CLIP-based Image-to-Video Knowledge Transferring Jan 26, 2023 Representation Learning Retrieval
Code Code Available 1