Image Retrieval

Image Retrieval is a fundamental and long-standing computer vision task that involves finding images similar to a given query from a large database. It is often considered a form of fine-grained, instance-level classification. The task is integral to image recognition alongside classification and cross-modal retrieval. By leveraging visual similarity and other criteria, image retrieval enables users to efficiently discover relevant images, making it a crucial tool in applications such as search and recommendation.

Extending CLIP for Category-to-image Retrieval in E-commerce

( Image credit: DELF )

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 1–50 of 2239 papers

Title	Date	Tasks	Status	Hype
MCoT-RE: Multi-Faceted Chain-of-Thought and Re-Ranking for Training-Free Zero-Shot Composed Image Retrieval	Jul 17, 2025	Image RetrievalRe-Ranking	—Unverified	0
FAR-Net: Multi-Stage Fusion Network with Enhanced Semantic Alignment and Adaptive Reconciliation for Composed Image Retrieval	Jul 17, 2025	Image Retrieval	—Unverified	0
RadiomicsRetrieval: A Customizable Framework for Medical Image Retrieval Using Radiomics Features	Jul 11, 2025	Contrastive LearningImage Retrieval	CodeCode Available	1
Orchestrator-Agent Trust: A Modular Agentic AI Visual Classification System with Trust-Aware Orchestration and RAG-Based Reasoning	Jul 9, 2025	BenchmarkingImage Retrieval	CodeCode Available	0
MS-DPPs: Multi-Source Determinantal Point Processes for Contextual Diversity Refinement of Composite Attributes in Text to Image Retrieval	Jul 9, 2025	DiversityImage Retrieval	CodeCode Available	0
Automatic Synthesis of High-Quality Triplet Data for Composed Image Retrieval	Jul 8, 2025	Image RetrievalLarge Language Model	—Unverified	0
Llama Nemoretriever Colembed: Top-Performing Text-Image Retrieval Model	Jul 7, 2025	Image RetrievalLanguage Modeling	—Unverified	0
An analysis of vision-language models for fabric retrieval	Jul 7, 2025	AttributeCross-Modal Retrieval	—Unverified	0
Mask-aware Text-to-Image Retrieval: Referring Expression Segmentation Meets Cross-modal Retrieval	Jun 28, 2025	Cross-Modal RetrievalImage Captioning	—Unverified	0
On the Burstiness of Faces in Set	Jun 25, 2025	Face RecognitionImage Retrieval	—Unverified	0
Referring Expression Instance Retrieval and A Strong End-to-End Baseline	Jun 23, 2025	Image RetrievalReferring Expression	—Unverified	0
Class Agnostic Instance-level Descriptor for Visual Instance Search	Jun 20, 2025	Content-Based Image RetrievalImage Retrieval	—Unverified	0
Fine-grained Image Retrieval via Dual-Vision Adaptation	Jun 19, 2025	Image RetrievalKnowledge Distillation	—Unverified	0
Hierarchical Multi-Positive Contrastive Learning for Patent Image Retrieval	Jun 16, 2025	Contrastive LearningDomain Adaptation	—Unverified	0
A Semantically-Aware Relevance Measure for Content-Based Medical Image Retrieval Evaluation	Jun 16, 2025	Content-Based Image RetrievalDescriptive	—Unverified	0
Improving Personalized Search with Regularized Low-Rank Parameter Updates	Jun 11, 2025	General KnowledgeImage Retrieval	CodeCode Available	0
Hierarchical Image Matching for UAV Absolute Visual Localization via Semantic and Structural Constraints	Jun 11, 2025	Image RetrievalVisual Localization	—Unverified	0
Hidden Bias in the Machine: Stereotypes in Text-to-Image Models	Jun 9, 2025	FairnessImage Retrieval	—Unverified	0
Quantization-based Bounds on the Wasserstein Metric	Jun 1, 2025	Computational EfficiencyDomain Adaptation	—Unverified	0
SORCE: Small Object Retrieval in Complex Environments	May 30, 2025	BenchmarkingImage Retrieval	CodeCode Available	0
Sketch Down the FLOPs: Towards Efficient Networks for Human Sketch	May 29, 2025	Image RetrievalKnowledge Distillation	—Unverified	0
Fast Feature Matching of UAV Images via Matrix Band Reduction-based GPU Data Schedule	May 28, 2025	CPUGPU	—Unverified	0
ConText-CIR: Learning from Concepts in Text for Composed Image Retrieval	May 27, 2025	Image RetrievalRetrieval	CodeCode Available	1
Can Visual Encoder Learn to See Arrows?	May 26, 2025	Contrastive LearningImage Retrieval	—Unverified	0
MLLM-Guided VLM Fine-Tuning with Joint Inference for Zero-Shot Composed Image Retrieval	May 26, 2025	Image RetrievalLarge Language Model	—Unverified	0
Multimodal Reasoning Agent for Zero-Shot Composed Image Retrieval	May 26, 2025	Contrastive LearningImage Retrieval	—Unverified	0
Visualized Text-to-Image Retrieval	May 26, 2025	Image RetrievalQuestion Answering	CodeCode Available	1
One Surrogate to Fool Them All: Universal, Transferable, and Targeted Adversarial Attacks with CLIP	May 26, 2025	AllImage Retrieval	CodeCode Available	1
TNG-CLIP:Training-Time Negation Data Generation for Negation Awareness of CLIP	May 24, 2025	Image CaptioningImage Generation	—Unverified	0
DetailFusion: A Dual-branch Framework with Detail Enhancement for Composed Image Retrieval	May 23, 2025	Image RetrievalRetrieval	—Unverified	0
DART^3: Leveraging Distance for Test Time Adaptation in Person Re-Identification	May 23, 2025	Domain AdaptationImage Retrieval	—Unverified	0
Highlighting What Matters: Promptable Embeddings for Attribute-Focused Image Retrieval	May 21, 2025	AttributeImage Retrieval	—Unverified	0
SCENIR: Visual Semantic Clarity through Unsupervised Scene Graph Retrieval	May 21, 2025	counterfactualGraph Generation	CodeCode Available	0
IA-T2I: Internet-Augmented Text-to-Image Generation	May 21, 2025	Image GenerationImage Retrieval	—Unverified	0
Multimodal RAG-driven Anomaly Detection and Classification in Laser Powder Bed Fusion using Large Language Models	May 20, 2025	Anomaly DetectionDescriptive	—Unverified	0
Non-planar Object Detection and Identification by Features Matching and Triangulation Growth	May 19, 2025	Image RetrievalIndustrial Robots	—Unverified	0
Improved Bag-of-Words Image Retrieval with Geometric Constraints for Ground Texture Localization	May 16, 2025	Image RetrievalLoop Closure Detection	—Unverified	0
Redundancy-Aware Pretraining of Vision-Language Foundation Models in Remote Sensing	May 16, 2025	Image Retrieval	—Unverified	0
Seeing the Abstract: Translating the Abstract Language for Vision Language Models	May 6, 2025	Image RetrievalRetrieval	CodeCode Available	0
OBD-Finder: Explainable Coarse-to-Fine Text-Centric Oracle Bone Duplicates Discovery	May 4, 2025	Content-Based Image RetrievalImage Retrieval	CodeCode Available	0
Geolocating Earth Imagery from ISS: Integrating Machine Learning with Astronaut Photography for Enhanced Geographic Mapping	Apr 29, 2025	Deep LearningEarth Observation	CodeCode Available	0
From Mapping to Composing: A Two-Stage Framework for Zero-shot Composed Image Retrieval	Apr 25, 2025	Image RetrievalRetrieval	—Unverified	0
CLIPSE -- a minimalistic CLIP-based image search engine for research	Apr 24, 2025	Image Retrieval	CodeCode Available	0
A Multimodal Recaptioning Framework to Account for Perceptual Diversity in Multilingual Vision-Language Modeling	Apr 19, 2025	DiversityImage Retrieval	—Unverified	0
SemCORE: A Semantic-Enhanced Generative Cross-Modal Retrieval Framework with MLLMs	Apr 17, 2025	Cross-Modal RetrievalImage Retrieval	—Unverified	0
Generalized Visual Relation Detection with Diffusion Models	Apr 16, 2025	Graph GenerationHuman-Object Interaction Detection	—Unverified	0
TMCIR: Token Merge Benefits Composed Image Retrieval	Apr 15, 2025	Contrastive Learningcross-modal alignment	—Unverified	0
Visual Re-Ranking with Non-Visual Side Information	Apr 15, 2025	Graph Neural NetworkImage Retrieval	CodeCode Available	0
Focus on Local: Finding Reliable Discriminative Regions for Visual Place Recognition	Apr 14, 2025	Computational EfficiencyImage Retrieval	CodeCode Available	1
FocalLens: Instruction Tuning Enables Zero-Shot Conditional Image Representations	Apr 11, 2025	image-classificationImage Classification	—Unverified	0

Show:10 25 50

← PrevPage 1 of 45Next →

All datasets ROxford (Hard)ROxford (Medium)RParis (Hard)RParis (Medium)CREPE (Compositional REPresentation Evaluation)Fashion IQ Flickr30K 1K test CIRR SOP Flickr30k-CN Oxf5k Flickr30k

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	SuperGlobal	mAP	80.2	—	Unverified
2	AMES	mAP	80	—	Unverified
3	Hypergraph propagation+community selection	mAP	73	—	Unverified
4	Token	mAP	66.57	—	Unverified
5	DELG+ α QE reranking+ RRT reranking	mAP	64	—	Unverified
6	FIRe	mAP	61.2	—	Unverified
7	HOW	mAP	56.9	—	Unverified
8	ResNet101+ArcFace GLDv2-train-clean	mAP	51.6	—	Unverified
9	DELF–HQE+SP	mAP	50.3	—	Unverified
10	HesAff–rSIFT–HQE+SP	mAP	49.7	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	AMES	mAP	90.7	—	Unverified
2	Hypergraph propagation+Community selection	mAP	88.4	—	Unverified
3	Token	mAP	82.28	—	Unverified
4	FIRe	mAP	81.8	—	Unverified
5	DELG+ α QE reranking + RRT reranking	mAP	80.4	—	Unverified
6	HOW	mAP	79.4	—	Unverified
7	ResNet101+ArcFace GLDv2-train-clean	mAP	74.2	—	Unverified
8	DELF–HQE+SP	mAP	73.4	—	Unverified
9	HesAff–rSIFT–HQE+SP	mAP	71.3	—	Unverified
10	DELF–ASMK*+SP	mAP	67.8	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	AMES	mAP	89.7	—	Unverified
2	SuperGlobal	mAP	86.7	—	Unverified
3	Hypergraph propagation	mAP	83.3	—	Unverified
4	Token	mAP	78.56	—	Unverified
5	DELG+ α QE reranking + RRT reranking	mAP	77.7	—	Unverified
6	ResNet101+ArcFace GLDv2-train-clean	mAP	70.3	—	Unverified
7	FIRe	mAP	70	—	Unverified
8	DELF–HQE+SP	mAP	69.3	—	Unverified
9	HOW	mAP	62.4	—	Unverified
10	R–R-MAC	mAP	59.4	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	AMES	mAP	94.9	—	Unverified
2	Hypergraph propagation	mAP	92.6	—	Unverified
3	Token	mAP	89.34	—	Unverified
4	DELG+ α QE reranking + RRT reranking	mAP	88.5	—	Unverified
5	FIRe	mAP	85.3	—	Unverified
6	ResNet101+ArcFace GLDv2-train-clean	mAP	84.9	—	Unverified
7	DELF–HQE+SP	mAP	84	—	Unverified
8	HOW	mAP	81.6	—	Unverified
9	R–R-MAC	mAP	78.9	—	Unverified
10	R–GeM	mAP	77.2	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	Swin-T (MosaiCLIP, CC-12M)	Recall@1 (HN-Atom, UC)	44.5	—	Unverified
2	RN-50 (MosaiCLIP, CC-12M)	Recall@1 (HN-Atom, UC)	44.4	—	Unverified
3	MosaiCLIP (YFCC-FT)	Recall@1 (HN-Atom, UC)	41.5	—	Unverified
4	RN-50 (NegCLIP, CC-12M)	Recall@1 (HN-Atom, UC)	41.4	—	Unverified
5	MosaiCLIP (CC-FT)	Recall@1 (HN-Atom, UC)	40.9	—	Unverified
6	Swin-T (NegCLIP, CC-12M)	Recall@1 (HN-Atom, UC)	39.6	—	Unverified
7	CLIP (YFCC-FT)	Recall@1 (HN-Atom, UC)	39.5	—	Unverified
8	ViT-L-14 (LAION400M)	Recall@1 (HN-Atom + HN-Comp, SC)	39.44	—	Unverified
9	NegCLIP (YFCC-FT)	Recall@1 (HN-Atom, UC)	39	—	Unverified
10	CLIP-FT (YFCC-FT)	Recall@1 (HN-Atom, UC)	38.3	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	DQU-CIR	(Recall@10+Recall@50)/2	71.77	—	Unverified
2	TMCIR	(Recall@10+Recall@50)/2	66.56	—	Unverified
3	SPN4CIR (SPRC)	(Recall@10+Recall@50)/2	66.41	—	Unverified
4	SPRC	(Recall@10+Recall@50)/2	64.85	—	Unverified
5	Candidate Set Re-ranking	(Recall@10+Recall@50)/2	62.15	—	Unverified
6	RUTIR (BLIP B/16)	(Recall@10+Recall@50)/2	61.32	—	Unverified
7	CASE	(Recall@10+Recall@50)/2	59.73	—	Unverified
8	CaLa	(Recall@10+Recall@50)/2	57.96	—	Unverified
9	BLIP4CIR+Bi	(Recall@10+Recall@50)/2	55.4	—	Unverified
10	CLIP4Cir (v3)	(Recall@10+Recall@50)/2	55.36	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	X-VLM (base)	R@1	86.9	—	Unverified
2	RCAR	R@1	62.6	—	Unverified
3	SGRAF	R@1	58.5	—	Unverified
4	LGSGM	R@1	57.4	—	Unverified
5	VisualSparta	R@1	57.4	—	Unverified
6	TERAN MrSw	R@1	56.5	—	Unverified
7	TERAN Symm.	R@1	55.7	—	Unverified
8	VSRN	R@1	54.7	—	Unverified
9	CAMP	R@1	51.5	—	Unverified
10	SCAN i-t	R@1	44	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	TMCIR	(Recall@5+Recall_subset@1)/2	83.46	—	Unverified
2	SPN4CIR (SPRC)	(Recall@5+Recall_subset@1)/2	82.69	—	Unverified
3	SPRC2	(Recall@5+Recall_subset@1)/2	82.66	—	Unverified
4	SPRC	(Recall@5+Recall_subset@1)/2	81.39	—	Unverified
5	Candidate Set Re-ranking	(Recall@5+Recall_subset@1)/2	80.9	—	Unverified
6	CaLa	(Recall@5+Recall_subset@1)/2	78.74	—	Unverified
7	CASE (Pre-trained on LaSCo.Ca)	(Recall@5+Recall_subset@1)/2	78.25	—	Unverified
8	CASE	(Recall@5+Recall_subset@1)/2	77.5	—	Unverified
9	VISTA (base)	(Recall@5+Recall_subset@1)/2	75.9	—	Unverified
10	MMRet-MLLM	(Recall@5+Recall_subset@1)/2	75.7	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	Unicom+ViT-L@336px	R@1	91.2	—	Unverified
2	ROADMAP (DeiT-B)	R@1	86	—	Unverified
3	CGD (SG/GS)	R@1	84.2	—	Unverified
4	ROADMAP (ResNet-50)	R@1	83.1	—	Unverified
5	ProxyNCA++	R@1	81.4	—	Unverified
6	PNP Loss	R@1	81.1	—	Unverified
7	Cross-Batch Memory	R@1	80.6	—	Unverified
8	Smooth-AP	R@1	80.1	—	Unverified
9	NormSoftmax2048 (ResNet-50)	R@1	79.5	—	Unverified
10	EPSHN512	R@1	78.3	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	InternVL-G-FT	R@1	85.9	—	Unverified
2	InternVL-C-FT	R@1	85.2	—	Unverified
3	CN-CLIP (ViT-L/14@336px)	R@1	84.4	—	Unverified
4	R2D2 (ViT-L/14)	R@1	84.4	—	Unverified
5	CN-CLIP (ViT-H/14)	R@1	83.8	—	Unverified
6	CN-CLIP (ViT-L/14)	R@1	82.7	—	Unverified
7	CN-CLIP (ViT-B/16)	R@1	79.1	—	Unverified
8	R2D2 (ViT-B)	R@1	78.3	—	Unverified
9	Wukong (ViT-L/14)	R@1	77.4	—	Unverified
10	Wukong (ViT-B/32)	R@1	67.6	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	Offline Diffusion	MAP	96.2	—	Unverified
2	CNN+IME layer	MAP	92	—	Unverified
3	DELF+FT+ATT+DIR+QE	MAP	90	—	Unverified
4	DIR+QE*	MAP	89	—	Unverified