Text Retrieval

Text Retrieval is the task of finding the most text result (such as an answer, paragraph, or passage) given a query (which could be a question, keywords, or any relevant text)

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 301–350 of 671 papers

Title	Date	Tasks	Status	Hype
Multi-event Video-Text Retrieval	Aug 22, 2023	Language ModellingRetrieval	CodeCode Available	1
ALIP: Adaptive Language-Image Pre-training with Synthetic Caption	Aug 16, 2023	Action ClassificationImage-text Retrieval	CodeCode Available	1
Helping Hands: An Object-Aware Ego-Centric Video Recognition Model	Aug 15, 2023	DecoderObject	CodeCode Available	1
Vision-Language Dataset Distillation	Aug 15, 2023	Dataset Distillationimage-classification	CodeCode Available	1
AdvCLIP: Downstream-agnostic Adversarial Examples in Multimodal Contrastive Learning	Aug 14, 2023	Contrastive LearningGenerative Adversarial Network	CodeCode Available	1
Free-ATM: Exploring Unsupervised Learning on Diffusion-Generated Images with Free Attention Masks	Aug 13, 2023	Contrastive Learningimage-classification	—Unverified	0
Embedding-based Retrieval with LLM for Effective Agriculture Information Extracting from Unstructured Data	Aug 6, 2023	Language ModelingLanguage Modelling	—Unverified	0
The All-Seeing Project: Towards Panoptic Visual Recognition and Understanding of the Open World	Aug 3, 2023	AllQuestion Answering	CodeCode Available	2
Defense of Adversarial Ranking Attack in Text Retrieval: Benchmark and Baseline via Detection	Jul 31, 2023	Adversarial AttackInformation Retrieval	—Unverified	0
Set-level Guidance Attack: Boosting Adversarial Transferability of Vision-Language Pre-training Models	Jul 26, 2023	Image-text RetrievalRetrieval	CodeCode Available	1
PRIOR: Prototype Representation Joint Learning from Medical Images and Reports	Jul 24, 2023	Contrastive LearningImage to text	CodeCode Available	1
Towards a Visual-Language Foundation Model for Computational Pathology	Jul 24, 2023	Contrastive Learningimage-classification	—Unverified	0
Extracting Molecular Properties from Natural Language with Multimodal Contrastive Learning	Jul 22, 2023	Contrastive LearningProperty Prediction	—Unverified	0
Distilling Knowledge from Text-to-Image Generative Models Improves Visio-Linguistic Reasoning in CLIP	Jul 18, 2023	AttributeImage-text Retrieval	—Unverified	0
mCLIP: Multilingual CLIP via Cross-lingual Transfer	Jul 10, 2023	Contrastive LearningCross-Lingual Transfer	CodeCode Available	1
Stop Pre-Training: Adapt Visual-Language Models to Unseen Languages	Jun 29, 2023	Image-text RetrievalMachine Translation	CodeCode Available	0
Learning to Rank in Generative Retrieval	Jun 27, 2023	Learning-To-RankPassage Ranking	CodeCode Available	1
Switch-BERT: Learning to Model Multimodal Interactions by Switching Attention and Input	Jun 25, 2023	DiversityImage-text Retrieval	—Unverified	0
TaCA: Upgrading Your Visual Foundation Model with Task-agnostic Compatible Adapter	Jun 22, 2023	Question AnsweringRetrieval	CodeCode Available	0
RS5M and GeoRSCLIP: A Large Scale Vision-Language Dataset and A Large Vision-Language Model for Remote Sensing	Jun 20, 2023	Cross-Modal RetrievalImage Retrieval	CodeCode Available	2
MSVD-Indonesian: A Benchmark for Multimodal Video-Text Tasks in Indonesian	Jun 20, 2023	Cross-Lingual TransferRetrieval	CodeCode Available	0
Align, Adapt and Inject: Sound-guided Unified Image Generation	Jun 20, 2023	Image GenerationRetrieval	—Unverified	0
RemoteCLIP: A Vision Language Foundation Model for Remote Sensing	Jun 19, 2023	ClassificationCross-Modal Retrieval	CodeCode Available	2
Contrasting Intra-Modal and Ranking Cross-Modal Hard Negatives to Enhance Visio-Linguistic Compositional Understanding	Jun 15, 2023	Contrastive Learningimage-classification	CodeCode Available	1
Efficient Token-Guided Image-Text Retrieval with Consistent Multimodal Contrastive Training	Jun 15, 2023	Image-text RetrievalRepresentation Learning	CodeCode Available	1
Babel-ImageNet: Massively Multilingual Evaluation of Vision-and-Language Representations	Jun 14, 2023	image-classificationImage Classification	CodeCode Available	1
h2oGPT: Democratizing Large Language Models	Jun 13, 2023	ChatbotFairness	CodeCode Available	6
Global and Local Semantic Completion Learning for Vision-Language Pre-training	Jun 12, 2023	cross-modal alignmentImage-text Retrieval	CodeCode Available	1
Multi-modal Pre-training for Medical Vision-language Understanding and Generation: An Empirical Study with A New Benchmark	Jun 10, 2023	Image-text RetrievalMedical Report Generation	CodeCode Available	1
Revisiting the Role of Language Priors in Vision-Language Models	Jun 2, 2023	Image-text matchingImage-text Retrieval	CodeCode Available	1
Test-Time Adaptation with CLIP Reward for Zero-Shot Generalization in Vision-Language Models	May 29, 2023	Image CaptioningImage Classification	CodeCode Available	1
FuseCap: Leveraging Large Language Models for Enriched Fused Image Captions	May 28, 2023	AttributeImage Captioning	CodeCode Available	1
CrossGET: Cross-Guided Ensemble of Tokens for Accelerating Vision-Language Transformers	May 27, 2023	Image CaptioningImage Retrieval	CodeCode Available	1
Integrating Listwise Ranking into Pairwise-based Image-Text Retrieval	May 26, 2023	Image-text RetrievalRetrieval	CodeCode Available	0
Enhancing the Ranking Context of Dense Retrieval Methods through Reciprocal Nearest Neighbors	May 25, 2023	Contrastive LearningReranking	CodeCode Available	0
PaCE: Unified Multi-modal Dialogue Pre-training with Progressive and Compositional Experts	May 24, 2023	Dialogue State TrackingImage Retrieval	CodeCode Available	0
S-CLIP: Semi-supervised Vision-Language Learning using Few Specialist Captions	May 23, 2023	Contrastive LearningImage-text Retrieval	CodeCode Available	1
When the Music Stops: Tip-of-the-Tongue Retrieval for Music	May 23, 2023	BenchmarkingLanguage Modeling	CodeCode Available	0
i-Code Studio: A Configurable and Composable Framework for Integrative AI	May 23, 2023	Question AnsweringRetrieval	—Unverified	0
VLAB: Enhancing Video Language Pre-training by Feature Adapting and Blending	May 22, 2023	Question AnsweringRetrieval	—Unverified	0
Enhancing Vision-Language Pre-Training with Jointly Learned Questioner and Dense Captioner	May 19, 2023	Dense CaptioningImage Captioning	CodeCode Available	1
TOME: A Two-stage Approach for Model-based Retrieval	May 18, 2023	Natural QuestionsRetrieval	—Unverified	0
ONE-PEACE: Exploring One General Representation Model Toward Unlimited Modalities	May 18, 2023	1 Image, 2*2 StitchiAction Classification	CodeCode Available	3
Mask to reconstruct: Cooperative Semantics Completion for Video-text Retrieval	May 13, 2023	RetrievalText Retrieval	—Unverified	0
Region-Aware Pretraining for Open-Vocabulary Object Detection with Vision Transformers	May 11, 2023	Contrastive LearningImage-text Retrieval	CodeCode Available	1
Alternating Gradient Descent and Mixture-of-Experts for Integrated Multimodal Perception	May 10, 2023	Classificationimage-classification	—Unverified	0
Cross-Modal Retrieval for Motion and Text via DopTriple Loss	May 7, 2023	Cross-Modal RetrievalRetrieval	CodeCode Available	1
Understanding Differential Search Index for Text Retrieval	May 3, 2023	Information RetrievalRetrieval	CodeCode Available	1
From Association to Generation: Text-only Captioning by Unsupervised Cross-modal Mapping	Apr 26, 2023	DecoderImage Captioning	CodeCode Available	1
Hypernymization of named entity-rich captions for grounding-based multi-modal pretraining	Apr 25, 2023	ArticlesImage-text Retrieval	—Unverified	0

Show:10 25 50

← PrevPage 7 of 14Next →

No leaderboard results yet.