PEFA: Parameter-Free Adapters for Large-scale Embedding-based Retrieval Models Dec 5, 2023 Retrieval Text Retrieval
Code Code Available 0LightCLIP: Learning Multi-Level Interaction for Lightweight Vision-Language Models Dec 1, 2023 image-classification Image Classification
— Unverified 0Synthesize, Diagnose, and Optimize: Towards Fine-Grained Vision-Language Understanding Nov 30, 2023 Attribute Compositional Zero-Shot Learning
Code Code Available 1MLLMs-Augmented Visual-Language Representation Learning Nov 30, 2023 Image-text Retrieval Representation Learning
Code Code Available 1RETSim: Resilient and Efficient Text Similarity Nov 28, 2023 Adversarial Text Clustering
Code Code Available 4IG Captioner: Information Gain Captioners are Strong Zero-shot Classifiers Nov 27, 2023 Caption Generation Image-text Retrieval
— Unverified 0Invisible Relevance Bias: Text-Image Retrieval Models Prefer AI-Generated Images Nov 23, 2023 Cross-Modal Retrieval Image Retrieval
Code Code Available 0Towards Robust Text Retrieval with Progressive Learning Nov 20, 2023 Machine Reading Comprehension Question Answering
Code Code Available 0Text Retrieval with Multi-Stage Re-Ranking Models Nov 14, 2023 Language Modeling Language Modelling
Code Code Available 0Noisy Pair Corrector for Dense Retrieval Nov 7, 2023 Code Search Retrieval
— Unverified 0GLEN: Generative Retrieval via Lexical Index Learning Nov 6, 2023 Learning-To-Rank Retrieval
Code Code Available 1A New Fine-grained Alignment Method for Image-text Matching Nov 3, 2023 Image-text matching Image-text Retrieval
— Unverified 0FLAP: Fast Language-Audio Pre-training Nov 2, 2023 AudioCaps Contrastive Learning
— Unverified 0MCAD: Multi-teacher Cross-modal Alignment Distillation for efficient image-text retrieval Oct 30, 2023 cross-modal alignment Image-text Retrieval
— Unverified 0Harvest Video Foundation Models via Efficient Post-Pretraining Oct 30, 2023 Question Answering Text Retrieval
Code Code Available 0End-to-End Autoregressive Retrieval via Bootstrapping for Smart Reply Systems Oct 29, 2023 Diversity Retrieval
— Unverified 0A Prior Instruction Representation Framework for Remote Sensing Image-text Retrieval Oct 27, 2023 Cross-Modal Retrieval Image-text Retrieval
Code Code Available 1MARVEL: Unlocking the Multi-Modal Capability of Dense Retrieval via Visual Module Plugin Oct 21, 2023 Language Modelling Retrieval
Code Code Available 1SILC: Improving Vision Language Pretraining with Self-Distillation Oct 20, 2023 Classification Contrastive Learning
— Unverified 0Frozen Transformers in Language Models Are Effective Visual Encoder Layers Oct 19, 2023 Action Recognition Image-text Retrieval
Code Code Available 2MolCA: Molecular Graph-Language Modeling with Cross-Modal Projector and Uni-Modal Adapter Oct 19, 2023 Contrastive Learning IUPAC Name Prediction
Code Code Available 1Extending Multi-modal Contrastive Representations Oct 13, 2023 3D Object Classification Representation Learning
Code Code Available 1Fine-Tuning LLaMA for Multi-Stage Text Retrieval Oct 12, 2023 Passage Retrieval Retrieval
Code Code Available 1On Using GUI Interaction Data to Improve Text Retrieval-based Bug Localization Oct 12, 2023 Information Retrieval Retrieval
Code Code Available 0Ziya-Visual: Bilingual Large Vision-Language Model via Multi-Task Instruction Tuning Oct 12, 2023 Image Captioning Image-text Retrieval
— Unverified 0Direction-Oriented Visual-semantic Embedding Model for Remote Sensing Image-text Retrieval Oct 12, 2023 Cross-Modal Retrieval Image-text Retrieval
— Unverified 0VeCLIP: Improving CLIP Training via Visual-enriched Captions Oct 11, 2023 Image-text Retrieval Retrieval
Code Code Available 2ESA: External Space Attention Aggregation for Image-Text Retrieval Oct 10, 2023 Image-text Retrieval Retrieval
Code Code Available 1Building an Open-Vocabulary Video CLIP Model with Better Architectures, Optimization and Data Oct 8, 2023 Action Recognition Continual Learning
Code Code Available 1Policy-Gradient Training of Language Models for Ranking Oct 6, 2023 Decision Making Domain Generalization
— Unverified 0Constructing Image-Text Pair Dataset from Books Oct 3, 2023 Image-text Retrieval Optical Character Recognition (OCR)
— Unverified 0LanguageBind: Extending Video-Language Pretraining to N-modality by Language-based Semantic Alignment Oct 3, 2023 Audio Classification Contrastive Learning
Code Code Available 4Prototype-based Aleatoric Uncertainty Quantification for Cross-modal Retrieval Sep 29, 2023 Cross-Modal Retrieval Image-text matching
Code Code Available 1Uncertainty-Aware Alignment Network for Cross-Domain Video-Text Retrieval Sep 21, 2023 Domain Adaptation Retrieval
— Unverified 0Implicit Differentiable Outlier Detection Enable Robust Deep Multimodal Analysis Sep 21, 2023 Cross-Modal Retrieval Image Captioning
Code Code Available 0Uncertainty-Aware Alignment Network for Cross-Domain Video-Text Retrieval Sep 21, 2023 Domain Adaptation Retrieval
— Unverified 0Enhancing Open-Domain Table Question Answering via Syntax- and Structure-aware Dense Retrieval Sep 19, 2023 Question Answering Retrieval
Code Code Available 0Unified Coarse-to-Fine Alignment for Video-Text Retrieval Sep 18, 2023 Retrieval Text Retrieval
Code Code Available 1Dynamic Visual Semantic Sub-Embeddings and Fast Re-Ranking Sep 15, 2023 Image-text matching Re-Ranking
— Unverified 0Dual Relation Alignment for Composed Image Retrieval Sep 5, 2023 Image Retrieval Image-text Retrieval
— Unverified 0MultiWay-Adapater: Adapting large-scale multi-modal models for scalable image-text retrieval Sep 4, 2023 Image-text Retrieval Retrieval
Code Code Available 0LinkTransformer: A Unified Package for Record Linkage with Transformer Language Models Sep 2, 2023 Blocking Language Modelling
Code Code Available 1Contrastive Feature Masking Open-Vocabulary Vision Transformer Sep 2, 2023 Contrastive Learning Image-text Retrieval
— Unverified 0Killing two birds with one stone: Can an audio captioning system also be used for audio-text retrieval? Aug 29, 2023 AudioCaps Audio captioning
— Unverified 0UniPT: Universal Parallel Tuning for Transfer Learning with Efficient Parameter and Memory Aug 28, 2023 Question Answering Retrieval
Code Code Available 1Towards Fast and Accurate Image-Text Retrieval with Self-Supervised Fine-Grained Alignment Aug 27, 2023 Contrastive Learning Image-text Retrieval
Code Code Available 1DLIP: Distilling Language-Image Pre-training Aug 24, 2023 Image Captioning Image-text Retrieval
— Unverified 0Parameter-Efficient Transfer Learning for Remote Sensing Image-Text Retrieval Aug 24, 2023 Cross-Modal Retrieval Image-text matching
Code Code Available 1EVE: Efficient Vision-Language Pre-training with Masked Prediction and Modality-Aware MoE Aug 23, 2023 Image-text matching Image-text Retrieval
— Unverified 0Hybrid Retrieval and Multi-stage Text Ranking Solution at TREC 2022 Deep Learning Track Aug 23, 2023 Document Ranking Language Modeling
— Unverified 0