Image Retrieval

Image Retrieval is a fundamental and long-standing computer vision task that involves finding images similar to a given query from a large database. It is often considered a form of fine-grained, instance-level classification. The task is integral to image recognition alongside classification and cross-modal retrieval. By leveraging visual similarity and other criteria, image retrieval enables users to efficiently discover relevant images, making it a crucial tool in applications such as search and recommendation.

Extending CLIP for Category-to-image Retrieval in E-commerce

( Image credit: DELF )

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 2151–2200 of 2239 papers

Title	Date	Tasks	Status
Error-Corrected Margin-Based Deep Cross-Modal Hashing for Facial Image Retrieval	Apr 3, 2020	AttributeDecoder	—Unverified
The Use of Object Labels and Spatial Prepositions as Keywords in a Web-Retrieval-Based Image Caption Generation System	Apr 1, 2017	Caption GenerationImage Retrieval	—Unverified
They're All Doctors: Synthesizing Diverse Counterfactuals to Mitigate Associative Bias	Jun 17, 2024	Allcounterfactual	—Unverified
Cross-modal Subspace Learning for Fine-grained Sketch-based Image Retrieval	May 28, 2017	Cross-Modal RetrievalImage Retrieval	—Unverified
Thin-Slicing for Pose: Learning to Understand Pose Without Explicit Pose Estimation	Jun 1, 2016	Action RecognitionImage Retrieval	—Unverified
Evaluating Contrastive Models for Instance-based Image Retrieval	Apr 30, 2021	Image RetrievalRetrieval	—Unverified
Cross-Modal Retrieval with Implicit Concept Association	Apr 12, 2018	Cross-Modal RetrievalImage Retrieval	—Unverified
A Hybrid Approach for Improved Content-based Image Retrieval using Segmentation	Feb 11, 2015	Content-Based Image RetrievalImage Retrieval	—Unverified
Evaluating the WordsEye Text-to-Scene System: Imaginative and Realistic Sentences	May 1, 2018	Coreference ResolutionImage Retrieval	—Unverified
Event Recognition in Videos by Learning from Heterogeneous Web Sources	Jun 1, 2013	Domain AdaptationImage Retrieval	—Unverified
Evidential Transformers for Improved Image Retrieval	Sep 2, 2024	Content-Based Image RetrievalImage Retrieval	—Unverified
Cross-Modal Pre-Aligned Method with Global and Local Information for Remote-Sensing Image and Text Retrieval	Nov 22, 2024	Image RetrievalReranking	—Unverified
ExchNet: A Unified Hashing Network for Large-Scale Fine-Grained Image Retrieval	Aug 4, 2020	Image RetrievalRetrieval	—Unverified
Exemplar SVMs as Visual Feature Encoders	Jun 1, 2015	image-classificationImage Classification	—Unverified
Experiments of Distance Measurements in a Foliage Plant Retrieval System	Nov 20, 2013	Image RetrievalRetrieval	—Unverified
Explainable Semantic Space by Grounding Language to Vision with Cross-Modal Contrastive Learning	Nov 13, 2021	Contrastive LearningImage Retrieval	—Unverified
Exploiting Contextual Uncertainty of Visual Data for Efficient Training of Deep Models	Nov 4, 2024	Active Learningimage-classification	—Unverified
Exploiting Deep Features for Remote Sensing Image Retrieval: A Systematic Investigation	Jul 23, 2017	Image RetrievalRetrieval	—Unverified
Exploiting Distribution Constraints for Scalable and Efficient Image Retrieval	Oct 9, 2024	Image RetrievalRetrieval	—Unverified
Exploiting Image Generality for Lexical Entailment Detection	Jul 1, 2015	Image RetrievalLexical Entailment	—Unverified
Exploiting Latent Codes: Interactive Fashion Product Generation, Similar Image Retrieval, and Cross-Category Recommendation using Variational Autoencoders	Sep 2, 2020	Image RetrievalRecommendation Systems	—Unverified
Exploiting Local Features from Deep Networks for Image Retrieval	Apr 20, 2015	ClassificationGeneral Classification	—Unverified
Exploiting Unlabelled Photos for Stronger Fine-Grained SBIR	Mar 24, 2023	Image RetrievalKnowledge Distillation	—Unverified
Exploiting weakly-labeled Web images to improve object classification: a domain adaptation approach	Dec 1, 2010	Domain AdaptationGeneral Classification	—Unverified
This is what a pandemic looks like: Visual framing of COVID-19 on search engines	Sep 22, 2022	Image Retrieval	—Unverified
Cross-Modality Perturbation Synergy Attack for Person Re-identification	Jan 18, 2024	Image RetrievalPerson Re-Identification	—Unverified
Exploring Auxiliary Context: Discrete Semantic Transfer Hashing for Scalable Image Retrieval	Apr 25, 2019	Content-Based Image RetrievalImage Retrieval	—Unverified
Exploring Content Based Image Retrieval for Highly Imbalanced Melanoma Data using Style Transfer, Semantic Image Segmentation and Ensemble Learning	Oct 12, 2021	Content-Based Image RetrievalEnsemble Learning	—Unverified
Exploring EEG for Object Detection and Retrieval	Apr 9, 2015	Content-Based Image RetrievalEEG	—Unverified
Exploring Implicit Image Statistics for Visual Representativeness Modeling	Jun 1, 2013	Image Retrieval	—Unverified
Cross-modal Image Retrieval with Deep Mutual Information Maximization	Mar 10, 2021	Cross-Modal RetrievalImage Retrieval	—Unverified
Three-Stream Joint Network for Zero-Shot Sketch-Based Image Retrieval	Apr 12, 2022	Image RetrievalRetrieval	—Unverified
Cross-Modal Denoising: A Novel Training Paradigm for Enhancing Speech-Image Retrieval	Aug 15, 2024	cross-modal alignmentDenoising	—Unverified
Cross-Modal Coordination Across a Diverse Set of Input Modalities	Jan 29, 2024	Cross-Modal RetrievalImage Retrieval	—Unverified
Three Things to Know about Deep Metric Learning	Dec 17, 2024	GPUImage Retrieval	—Unverified
Three Tiers Neighborhood Graph and Multi-graph Fusion Ranking for Multi-feature Image Retrieval: A Manifold Aspect	Sep 24, 2016	Image RetrievalRe-Ranking	—Unverified
Extending Cross-Modal Retrieval with Interactive Learning to Improve Image Retrieval Performance in Forensics	Aug 28, 2023	Cross-Modal RetrievalImage Retrieval	—Unverified
Face Image Retrieval With Attribute Manipulation	Jan 1, 2021	AttributeFace Image Retrieval	—Unverified
Cross-Modal Attention Alignment Network with Auxiliary Text Description for zero-shot sketch-based image retrieval	Jul 1, 2024	cross-modal alignmentImage Retrieval	—Unverified
FaD-VLP: Fashion Vision-and-Language Pre-training towards Unified Retrieval and Captioning	Oct 26, 2022	Cross-Modal RetrievalDecoder	—Unverified
FAemb: A Function Approximation-Based Embedding Method for Image Retrieval	Jun 1, 2015	Image RetrievalRetrieval	—Unverified
FairCLIP: Social Bias Elimination based on Attribute Prototype Learning and Representation Neutralization	Oct 26, 2022	AttributeFairness	—Unverified
FaIRCoP: Facial Image Retrieval using Contrastive Personalization	May 28, 2022	Contrastive LearningFace Image Retrieval	—Unverified
Tiny Descriptors for Image Retrieval with Unsupervised Triplet Hashing	Nov 10, 2015	image-classificationImage Classification	—Unverified
FakeInversion: Learning to Detect Images from Unseen Text-to-Image Models by Inverting Stable Diffusion	Jun 12, 2024	Image Retrieval	—Unverified
Crossmodal-3600: A Massively Multilingual Multimodal Evaluation Dataset	May 25, 2022	Image CaptioningImage Retrieval	—Unverified
Aggregating Local Deep Features for Image Retrieval	Dec 1, 2015	image-classificationImage Classification	—Unverified
FAR-Net: Multi-Stage Fusion Network with Enhanced Semantic Alignment and Adaptive Reconciliation for Composed Image Retrieval	Jul 17, 2025	Image Retrieval	—Unverified
TMCIR: Token Merge Benefits Composed Image Retrieval	Apr 15, 2025	Contrastive Learningcross-modal alignment	—Unverified
Fashion Image Retrieval with Multi-Granular Alignment	Feb 16, 2023	Image RetrievalMetric Learning	—Unverified

Show:10 25 50

← PrevPage 44 of 45Next →

All datasets ROxford (Hard)ROxford (Medium)RParis (Hard)RParis (Medium)CREPE (Compositional REPresentation Evaluation)Fashion IQ Flickr30K 1K test CIRR SOP Flickr30k-CN Oxf5k Flickr30k

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	SuperGlobal	mAP	80.2	—	Unverified
2	AMES	mAP	80	—	Unverified
3	Hypergraph propagation+community selection	mAP	73	—	Unverified
4	Token	mAP	66.57	—	Unverified
5	DELG+ α QE reranking+ RRT reranking	mAP	64	—	Unverified
6	FIRe	mAP	61.2	—	Unverified
7	HOW	mAP	56.9	—	Unverified
8	ResNet101+ArcFace GLDv2-train-clean	mAP	51.6	—	Unverified
9	DELF–HQE+SP	mAP	50.3	—	Unverified
10	HesAff–rSIFT–HQE+SP	mAP	49.7	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	AMES	mAP	90.7	—	Unverified
2	Hypergraph propagation+Community selection	mAP	88.4	—	Unverified
3	Token	mAP	82.28	—	Unverified
4	FIRe	mAP	81.8	—	Unverified
5	DELG+ α QE reranking + RRT reranking	mAP	80.4	—	Unverified
6	HOW	mAP	79.4	—	Unverified
7	ResNet101+ArcFace GLDv2-train-clean	mAP	74.2	—	Unverified
8	DELF–HQE+SP	mAP	73.4	—	Unverified
9	HesAff–rSIFT–HQE+SP	mAP	71.3	—	Unverified
10	DELF–ASMK*+SP	mAP	67.8	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	AMES	mAP	89.7	—	Unverified
2	SuperGlobal	mAP	86.7	—	Unverified
3	Hypergraph propagation	mAP	83.3	—	Unverified
4	Token	mAP	78.56	—	Unverified
5	DELG+ α QE reranking + RRT reranking	mAP	77.7	—	Unverified
6	ResNet101+ArcFace GLDv2-train-clean	mAP	70.3	—	Unverified
7	FIRe	mAP	70	—	Unverified
8	DELF–HQE+SP	mAP	69.3	—	Unverified
9	HOW	mAP	62.4	—	Unverified
10	R–R-MAC	mAP	59.4	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	AMES	mAP	94.9	—	Unverified
2	Hypergraph propagation	mAP	92.6	—	Unverified
3	Token	mAP	89.34	—	Unverified
4	DELG+ α QE reranking + RRT reranking	mAP	88.5	—	Unverified
5	FIRe	mAP	85.3	—	Unverified
6	ResNet101+ArcFace GLDv2-train-clean	mAP	84.9	—	Unverified
7	DELF–HQE+SP	mAP	84	—	Unverified
8	HOW	mAP	81.6	—	Unverified
9	R–R-MAC	mAP	78.9	—	Unverified
10	R–GeM	mAP	77.2	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	Swin-T (MosaiCLIP, CC-12M)	Recall@1 (HN-Atom, UC)	44.5	—	Unverified
2	RN-50 (MosaiCLIP, CC-12M)	Recall@1 (HN-Atom, UC)	44.4	—	Unverified
3	MosaiCLIP (YFCC-FT)	Recall@1 (HN-Atom, UC)	41.5	—	Unverified
4	RN-50 (NegCLIP, CC-12M)	Recall@1 (HN-Atom, UC)	41.4	—	Unverified
5	MosaiCLIP (CC-FT)	Recall@1 (HN-Atom, UC)	40.9	—	Unverified
6	Swin-T (NegCLIP, CC-12M)	Recall@1 (HN-Atom, UC)	39.6	—	Unverified
7	CLIP (YFCC-FT)	Recall@1 (HN-Atom, UC)	39.5	—	Unverified
8	ViT-L-14 (LAION400M)	Recall@1 (HN-Atom + HN-Comp, SC)	39.44	—	Unverified
9	NegCLIP (YFCC-FT)	Recall@1 (HN-Atom, UC)	39	—	Unverified
10	CLIP-FT (YFCC-FT)	Recall@1 (HN-Atom, UC)	38.3	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	DQU-CIR	(Recall@10+Recall@50)/2	71.77	—	Unverified
2	TMCIR	(Recall@10+Recall@50)/2	66.56	—	Unverified
3	SPN4CIR (SPRC)	(Recall@10+Recall@50)/2	66.41	—	Unverified
4	SPRC	(Recall@10+Recall@50)/2	64.85	—	Unverified
5	Candidate Set Re-ranking	(Recall@10+Recall@50)/2	62.15	—	Unverified
6	RUTIR (BLIP B/16)	(Recall@10+Recall@50)/2	61.32	—	Unverified
7	CASE	(Recall@10+Recall@50)/2	59.73	—	Unverified
8	CaLa	(Recall@10+Recall@50)/2	57.96	—	Unverified
9	BLIP4CIR+Bi	(Recall@10+Recall@50)/2	55.4	—	Unverified
10	CLIP4Cir (v3)	(Recall@10+Recall@50)/2	55.36	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	X-VLM (base)	R@1	86.9	—	Unverified
2	RCAR	R@1	62.6	—	Unverified
3	SGRAF	R@1	58.5	—	Unverified
4	LGSGM	R@1	57.4	—	Unverified
5	VisualSparta	R@1	57.4	—	Unverified
6	TERAN MrSw	R@1	56.5	—	Unverified
7	TERAN Symm.	R@1	55.7	—	Unverified
8	VSRN	R@1	54.7	—	Unverified
9	CAMP	R@1	51.5	—	Unverified
10	SCAN i-t	R@1	44	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	TMCIR	(Recall@5+Recall_subset@1)/2	83.46	—	Unverified
2	SPN4CIR (SPRC)	(Recall@5+Recall_subset@1)/2	82.69	—	Unverified
3	SPRC2	(Recall@5+Recall_subset@1)/2	82.66	—	Unverified
4	SPRC	(Recall@5+Recall_subset@1)/2	81.39	—	Unverified
5	Candidate Set Re-ranking	(Recall@5+Recall_subset@1)/2	80.9	—	Unverified
6	CaLa	(Recall@5+Recall_subset@1)/2	78.74	—	Unverified
7	CASE (Pre-trained on LaSCo.Ca)	(Recall@5+Recall_subset@1)/2	78.25	—	Unverified
8	CASE	(Recall@5+Recall_subset@1)/2	77.5	—	Unverified
9	VISTA (base)	(Recall@5+Recall_subset@1)/2	75.9	—	Unverified
10	MMRet-MLLM	(Recall@5+Recall_subset@1)/2	75.7	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	Unicom+ViT-L@336px	R@1	91.2	—	Unverified
2	ROADMAP (DeiT-B)	R@1	86	—	Unverified
3	CGD (SG/GS)	R@1	84.2	—	Unverified
4	ROADMAP (ResNet-50)	R@1	83.1	—	Unverified
5	ProxyNCA++	R@1	81.4	—	Unverified
6	PNP Loss	R@1	81.1	—	Unverified
7	Cross-Batch Memory	R@1	80.6	—	Unverified
8	Smooth-AP	R@1	80.1	—	Unverified
9	NormSoftmax2048 (ResNet-50)	R@1	79.5	—	Unverified
10	EPSHN512	R@1	78.3	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	InternVL-G-FT	R@1	85.9	—	Unverified
2	InternVL-C-FT	R@1	85.2	—	Unverified
3	CN-CLIP (ViT-L/14@336px)	R@1	84.4	—	Unverified
4	R2D2 (ViT-L/14)	R@1	84.4	—	Unverified
5	CN-CLIP (ViT-H/14)	R@1	83.8	—	Unverified
6	CN-CLIP (ViT-L/14)	R@1	82.7	—	Unverified
7	CN-CLIP (ViT-B/16)	R@1	79.1	—	Unverified
8	R2D2 (ViT-B)	R@1	78.3	—	Unverified
9	Wukong (ViT-L/14)	R@1	77.4	—	Unverified
10	Wukong (ViT-B/32)	R@1	67.6	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	Offline Diffusion	MAP	96.2	—	Unverified
2	CNN+IME layer	MAP	92	—	Unverified
3	DELF+FT+ATT+DIR+QE	MAP	90	—	Unverified
4	DIR+QE*	MAP	89	—	Unverified