Image Retrieval

Image Retrieval is a fundamental and long-standing computer vision task that involves finding images similar to a given query from a large database. It is often considered a form of fine-grained, instance-level classification. The task is integral to image recognition alongside classification and cross-modal retrieval. By leveraging visual similarity and other criteria, image retrieval enables users to efficiently discover relevant images, making it a crucial tool in applications such as search and recommendation.

Extending CLIP for Category-to-image Retrieval in E-commerce

( Image credit: DELF )

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 401–450 of 2239 papers

Title	Date	Tasks	Status	Hype	Score
Compounding the Performance Improvements of Assembled Techniques in a Convolutional Neural Network	Jan 17, 2020	Fine-Grained Image ClassificationFine-Grained Visual Recognition	CodeCode Available	1	5
Learning Instance-level Spatial-Temporal Patterns for Person Re-identification	Jul 31, 2021	Image RetrievalPerson Re-Identification	CodeCode Available	1	5
Learning Super-Features for Image Retrieval	Jan 31, 2022	Image RetrievalRetrieval	CodeCode Available	1	5
comp-syn: Perceptually Grounded Word Embeddings with Color	Oct 8, 2020	Image RetrievalWord Embeddings	CodeCode Available	1	5
Soft Contrastive Learning for Visual Localization	Dec 1, 2020	Contrastive LearningImage Retrieval	CodeCode Available	1	5
Hard negative examples are hard, but useful	Jul 24, 2020	Image RetrievalMetric Learning	CodeCode Available	1	5
Hierarchical Attention Fusion for Geo-Localization	Feb 18, 2021	geo-localizationImage Retrieval	CodeCode Available	1	5
SpaGBOL: Spatial-Graph-Based Orientated Localisation	Sep 23, 2024	Camera LocalizationCross-View Geo-Localisation	CodeCode Available	1	5
SuperLoss: A Generic Loss for Robust Curriculum Learning	Dec 1, 2020	image-classificationImage Classification	CodeCode Available	1	5
Supervised Metric Learning to Rank for Retrieval via Contextual Similarity Optimization	Oct 4, 2022	Contrastive LearningImage Retrieval	CodeCode Available	1	5
Modality-Agnostic Attention Fusion for visual search with text feedback	Jun 30, 2020	Image RetrievalRetrieval	CodeCode Available	1	5
MosAIc: Finding Artistic Connections across Culture with Conditional Image Retrieval	Jul 14, 2020	Cultural Vocal Bursts Intensity PredictionImage Retrieval	CodeCode Available	1	5
Conditioned and Composed Image Retrieval Combining and Partially Fine-Tuning CLIP-Based Features	Jun 19, 2022	Composed Image Retrieval (CoIR)Content-Based Image Retrieval	CodeCode Available	1	5
HERS: Homomorphically Encrypted Representation Search	Mar 27, 2020	Image Retrieval	CodeCode Available	1	5
Accurate 3-DoF Camera Geo-Localization via Ground-to-Satellite Image Matching	Mar 26, 2022	geo-localizationImage Retrieval	CodeCode Available	1	5
HiHPQ: Hierarchical Hyperbolic Product Quantization for Unsupervised Image Retrieval	Jan 14, 2024	Contrastive LearningImage Retrieval	CodeCode Available	1	5
Hierarchy-of-Visual-Words: a Learning-based Approach for Trademark Image Retrieval	Aug 7, 2019	Image RetrievalRetrieval	CodeCode Available	1	5
Improving the HardNet Descriptor	Jul 19, 2020	Dimensionality ReductionImage Retrieval	CodeCode Available	1	5
Towards Content-based Pixel Retrieval in Revisited Oxford and Paris	Sep 11, 2023	Image RetrievalInstance Segmentation	CodeCode Available	1	5
Histopathology Slide Indexing and Search: Are We There Yet?	Jun 29, 2023	DiagnosticImage Retrieval	CodeCode Available	1	5
Towards Unifying Medical Vision-and-Language Pre-training via Soft Prompts	Feb 17, 2023	Image RetrievalImage-text Classification	CodeCode Available	1	5
Hot-Refresh Model Upgrades with Regression-Alleviating Compatible Training in Image Retrieval	Jan 24, 2022	Image Retrievalregression	CodeCode Available	1	5
ILIAS: Instance-Level Image retrieval At Scale	Feb 17, 2025	BenchmarkingImage Retrieval	CodeCode Available	1	5
HybridHash: Hybrid Convolutional and Self-Attention Deep Hashing for Image Retrieval	May 13, 2024	Deep HashingImage Retrieval	CodeCode Available	1	5
HyP^2 Loss: Beyond Hypersphere Metric Space for Multi-label Image Retrieval	Aug 14, 2022	Deep HashingImage Retrieval	CodeCode Available	1	5
Hyperbolic Image Embeddings	Apr 3, 2019	Few-Shot LearningGeneral Classification	CodeCode Available	1	5
Privacy-Preserving Model Upgrades with Bidirectional Compatible Training in Image Retrieval	Apr 29, 2022	Image RetrievalPrivacy Preserving	CodeCode Available	1	5
Transform-Invariant Convolutional Neural Networks for Image Classification and Search	Nov 28, 2019	ClassificationGeneral Classification	CodeCode Available	1	5
AugNet: End-to-End Unsupervised Visual Representation Learning with Image Augmentation	Jun 11, 2021	ClusteringImage Augmentation	CodeCode Available	1	5
Two-stage Discriminative Re-ranking for Large-scale Landmark Retrieval	Mar 25, 2020	DiversityImage Retrieval	CodeCode Available	1	5
Image Generation Diversity Issues and How to Tame Them	Nov 25, 2024	DiversityImage Generation	CodeCode Available	1	5
Image Retrieval from Contextual Descriptions	Mar 29, 2022	Image RetrievalRetrieval	CodeCode Available	1	5
Universal Vision-Language Dense Retrieval: Learning A Unified Representation Space for Multi-Modal Retrieval	Sep 1, 2022	Image RetrievalOpen-Domain Question Answering	CodeCode Available	1	5
ConText-CIR: Learning from Concepts in Text for Composed Image Retrieval	May 27, 2025	Image RetrievalRetrieval	CodeCode Available	1	5
Image Retrieval on Real-life Images with Pre-trained Vision-and-Language Models	Aug 9, 2021	Composed Image Retrieval (CoIR)Image Retrieval	CodeCode Available	1	5
ImageScope: Unifying Language-Guided Image Retrieval via Large Multimodal Model Collective Reasoning	Mar 13, 2025	Image RetrievalRetrieval	CodeCode Available	1	5
Shuffle and Learn: Minimizing Mutual Information for Unsupervised Hashing	Nov 20, 2020	Image RetrievalPerson Re-Identification	CodeCode Available	1	5
Contextually Affinitive Neighborhood Refinery for Deep Clustering	Dec 12, 2023	ClusteringDeep Clustering	CodeCode Available	1	5
Contextual Similarity Aggregation with Self-attention for Visual Re-ranking	Oct 26, 2021	Content-Based Image RetrievalData Augmentation	CodeCode Available	1	5
IMPACT: A Large-scale Integrated Multimodal Patent Analysis and Creation Dataset for Design Patents	Dec 10, 2024	Cross-Modal RetrievalImage Classification	CodeCode Available	1	5
Improving Point Cloud Based Place Recognition with Ranking-based Loss and Large Batch Training	Mar 2, 2022	3D Place RecognitionImage Retrieval	CodeCode Available	1	5
UPop: Unified and Progressive Pruning for Compressing Vision-Language Transformers	Jan 31, 2023	Image CaptioningImage Classification	CodeCode Available	1	5
Improving Composed Image Retrieval via Contrastive Learning with Scaling Positives and Negatives	Apr 17, 2024	Contrastive LearningImage Retrieval	CodeCode Available	1	5
Contrastive Language-Image Pre-training for the Italian Language	Aug 19, 2021	Image RetrievalMulti-label zero-shot learning	CodeCode Available	1	5
Improving Deep Metric Learning by Divide and Conquer	Sep 9, 2021	Image RetrievalMetric Learning	CodeCode Available	1	5
Contrastive Quantization with Code Memory for Unsupervised Image Retrieval	Sep 11, 2021	Contrastive LearningDeep Hashing	CodeCode Available	1	5
Conversational Fashion Image Retrieval via Multiturn Natural Language Feedback	Jun 8, 2021	AttributeImage Retrieval	CodeCode Available	1	5
Instance-level Image Retrieval using Reranking Transformers	Mar 22, 2021	Image RetrievalReranking	CodeCode Available	1	5
Vision-Language Models in Remote Sensing: Current Progress and Future Trends	May 9, 2023	Image CaptioningImage Generation	CodeCode Available	1	5
Zero-Shot Day-Night Domain Adaptation with a Physics Prior	Aug 11, 2021	Domain AdaptationImage Retrieval	CodeCode Available	1	5

Show:10 25 50

← PrevPage 9 of 45Next →

All datasets ROxford (Hard)ROxford (Medium)RParis (Hard)RParis (Medium)CREPE (Compositional REPresentation Evaluation)Fashion IQ Flickr30K 1K test CIRR SOP Flickr30k-CN Oxf5k Flickr30k

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	SuperGlobal	mAP	80.2	—	Unverified
2	AMES	mAP	80	—	Unverified
3	Hypergraph propagation+community selection	mAP	73	—	Unverified
4	Token	mAP	66.57	—	Unverified
5	DELG+ α QE reranking+ RRT reranking	mAP	64	—	Unverified
6	FIRe	mAP	61.2	—	Unverified
7	HOW	mAP	56.9	—	Unverified
8	ResNet101+ArcFace GLDv2-train-clean	mAP	51.6	—	Unverified
9	DELF–HQE+SP	mAP	50.3	—	Unverified
10	HesAff–rSIFT–HQE+SP	mAP	49.7	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	AMES	mAP	90.7	—	Unverified
2	Hypergraph propagation+Community selection	mAP	88.4	—	Unverified
3	Token	mAP	82.28	—	Unverified
4	FIRe	mAP	81.8	—	Unverified
5	DELG+ α QE reranking + RRT reranking	mAP	80.4	—	Unverified
6	HOW	mAP	79.4	—	Unverified
7	ResNet101+ArcFace GLDv2-train-clean	mAP	74.2	—	Unverified
8	DELF–HQE+SP	mAP	73.4	—	Unverified
9	HesAff–rSIFT–HQE+SP	mAP	71.3	—	Unverified
10	DELF–ASMK*+SP	mAP	67.8	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	AMES	mAP	89.7	—	Unverified
2	SuperGlobal	mAP	86.7	—	Unverified
3	Hypergraph propagation	mAP	83.3	—	Unverified
4	Token	mAP	78.56	—	Unverified
5	DELG+ α QE reranking + RRT reranking	mAP	77.7	—	Unverified
6	ResNet101+ArcFace GLDv2-train-clean	mAP	70.3	—	Unverified
7	FIRe	mAP	70	—	Unverified
8	DELF–HQE+SP	mAP	69.3	—	Unverified
9	HOW	mAP	62.4	—	Unverified
10	R–R-MAC	mAP	59.4	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	AMES	mAP	94.9	—	Unverified
2	Hypergraph propagation	mAP	92.6	—	Unverified
3	Token	mAP	89.34	—	Unverified
4	DELG+ α QE reranking + RRT reranking	mAP	88.5	—	Unverified
5	FIRe	mAP	85.3	—	Unverified
6	ResNet101+ArcFace GLDv2-train-clean	mAP	84.9	—	Unverified
7	DELF–HQE+SP	mAP	84	—	Unverified
8	HOW	mAP	81.6	—	Unverified
9	R–R-MAC	mAP	78.9	—	Unverified
10	R–GeM	mAP	77.2	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	Swin-T (MosaiCLIP, CC-12M)	Recall@1 (HN-Atom, UC)	44.5	—	Unverified
2	RN-50 (MosaiCLIP, CC-12M)	Recall@1 (HN-Atom, UC)	44.4	—	Unverified
3	MosaiCLIP (YFCC-FT)	Recall@1 (HN-Atom, UC)	41.5	—	Unverified
4	RN-50 (NegCLIP, CC-12M)	Recall@1 (HN-Atom, UC)	41.4	—	Unverified
5	MosaiCLIP (CC-FT)	Recall@1 (HN-Atom, UC)	40.9	—	Unverified
6	Swin-T (NegCLIP, CC-12M)	Recall@1 (HN-Atom, UC)	39.6	—	Unverified
7	CLIP (YFCC-FT)	Recall@1 (HN-Atom, UC)	39.5	—	Unverified
8	ViT-L-14 (LAION400M)	Recall@1 (HN-Atom + HN-Comp, SC)	39.44	—	Unverified
9	NegCLIP (YFCC-FT)	Recall@1 (HN-Atom, UC)	39	—	Unverified
10	CLIP-FT (YFCC-FT)	Recall@1 (HN-Atom, UC)	38.3	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	DQU-CIR	(Recall@10+Recall@50)/2	71.77	—	Unverified
2	TMCIR	(Recall@10+Recall@50)/2	66.56	—	Unverified
3	SPN4CIR (SPRC)	(Recall@10+Recall@50)/2	66.41	—	Unverified
4	SPRC	(Recall@10+Recall@50)/2	64.85	—	Unverified
5	Candidate Set Re-ranking	(Recall@10+Recall@50)/2	62.15	—	Unverified
6	RUTIR (BLIP B/16)	(Recall@10+Recall@50)/2	61.32	—	Unverified
7	CASE	(Recall@10+Recall@50)/2	59.73	—	Unverified
8	CaLa	(Recall@10+Recall@50)/2	57.96	—	Unverified
9	BLIP4CIR+Bi	(Recall@10+Recall@50)/2	55.4	—	Unverified
10	CLIP4Cir (v3)	(Recall@10+Recall@50)/2	55.36	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	X-VLM (base)	R@1	86.9	—	Unverified
2	RCAR	R@1	62.6	—	Unverified
3	SGRAF	R@1	58.5	—	Unverified
4	LGSGM	R@1	57.4	—	Unverified
5	VisualSparta	R@1	57.4	—	Unverified
6	TERAN MrSw	R@1	56.5	—	Unverified
7	TERAN Symm.	R@1	55.7	—	Unverified
8	VSRN	R@1	54.7	—	Unverified
9	CAMP	R@1	51.5	—	Unverified
10	SCAN i-t	R@1	44	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	TMCIR	(Recall@5+Recall_subset@1)/2	83.46	—	Unverified
2	SPN4CIR (SPRC)	(Recall@5+Recall_subset@1)/2	82.69	—	Unverified
3	SPRC2	(Recall@5+Recall_subset@1)/2	82.66	—	Unverified
4	SPRC	(Recall@5+Recall_subset@1)/2	81.39	—	Unverified
5	Candidate Set Re-ranking	(Recall@5+Recall_subset@1)/2	80.9	—	Unverified
6	CaLa	(Recall@5+Recall_subset@1)/2	78.74	—	Unverified
7	CASE (Pre-trained on LaSCo.Ca)	(Recall@5+Recall_subset@1)/2	78.25	—	Unverified
8	CASE	(Recall@5+Recall_subset@1)/2	77.5	—	Unverified
9	VISTA (base)	(Recall@5+Recall_subset@1)/2	75.9	—	Unverified
10	MMRet-MLLM	(Recall@5+Recall_subset@1)/2	75.7	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	Unicom+ViT-L@336px	R@1	91.2	—	Unverified
2	ROADMAP (DeiT-B)	R@1	86	—	Unverified
3	CGD (SG/GS)	R@1	84.2	—	Unverified
4	ROADMAP (ResNet-50)	R@1	83.1	—	Unverified
5	ProxyNCA++	R@1	81.4	—	Unverified
6	PNP Loss	R@1	81.1	—	Unverified
7	Cross-Batch Memory	R@1	80.6	—	Unverified
8	Smooth-AP	R@1	80.1	—	Unverified
9	NormSoftmax2048 (ResNet-50)	R@1	79.5	—	Unverified
10	EPSHN512	R@1	78.3	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	InternVL-G-FT	R@1	85.9	—	Unverified
2	InternVL-C-FT	R@1	85.2	—	Unverified
3	CN-CLIP (ViT-L/14@336px)	R@1	84.4	—	Unverified
4	R2D2 (ViT-L/14)	R@1	84.4	—	Unverified
5	CN-CLIP (ViT-H/14)	R@1	83.8	—	Unverified
6	CN-CLIP (ViT-L/14)	R@1	82.7	—	Unverified
7	CN-CLIP (ViT-B/16)	R@1	79.1	—	Unverified
8	R2D2 (ViT-B)	R@1	78.3	—	Unverified
9	Wukong (ViT-L/14)	R@1	77.4	—	Unverified
10	Wukong (ViT-B/32)	R@1	67.6	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	Offline Diffusion	MAP	96.2	—	Unverified
2	CNN+IME layer	MAP	92	—	Unverified
3	DELF+FT+ATT+DIR+QE	MAP	90	—	Unverified
4	DIR+QE*	MAP	89	—	Unverified