Image Retrieval

Image Retrieval is a fundamental and long-standing computer vision task that involves finding images similar to a given query from a large database. It is often considered a form of fine-grained, instance-level classification. The task is integral to image recognition alongside classification and cross-modal retrieval. By leveraging visual similarity and other criteria, image retrieval enables users to efficiently discover relevant images, making it a crucial tool in applications such as search and recommendation.

Extending CLIP for Category-to-image Retrieval in E-commerce

( Image credit: DELF )

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 451–500 of 2239 papers

Title	Date	Tasks	Status	Hype	Score
Targeted Attack for Deep Hashing based Retrieval	Apr 15, 2020	Deep HashingImage Retrieval	CodeCode Available	1	5
Classification is a Strong Baseline for Deep Metric Learning	Nov 30, 2018	BinarizationClassification	CodeCode Available	0	5
MABNet: Master Assistant Buddy Network with Hybrid Learning for Image Retrieval	Mar 6, 2023	Image RetrievalRetrieval	CodeCode Available	0	5
MaMMUT: A Simple Architecture for Joint Learning for MultiModal Tasks	Mar 29, 2023	Cross-Modal RetrievalDecoder	CodeCode Available	0	5
Beyond Product Quantization: Deep Progressive Quantization for Image Retrieval	Jun 16, 2019	Image RetrievalQuantization	CodeCode Available	0	5
LoOp: Looking for Optimal Hard Negative Embeddings for Deep Metric Learning	Aug 20, 2021	Image RetrievalMetric Learning	CodeCode Available	0	5
LowCLIP: Adapting the CLIP Model Architecture for Low-Resource Languages in Multimodal Image Retrieval Task	Aug 25, 2024	Computational EfficiencyImage Augmentation	CodeCode Available	0	5
LogoNet: a fine-grained network for instance-level logo sketch retrieval	Apr 5, 2023	2kBenchmarking	CodeCode Available	0	5
Matchable Image Retrieval by Learning from Surface Reconstruction	Nov 26, 2018	3D ReconstructionImage Retrieval	CodeCode Available	0	5
Local Features and Visual Words Emerge in Activations	May 15, 2019	Image RetrievalRetrieval	CodeCode Available	0	5
Local Features and Visual Words Emerge in Activations	Jun 1, 2019	Image RetrievalRetrieval	CodeCode Available	0	5
Leveraging Unlabeled Data for Crowd Counting by Learning to Rank	Mar 8, 2018	Crowd CountingImage Retrieval	CodeCode Available	0	5
Lifelong Histopathology Whole Slide Image Retrieval via Distance Consistency Rehearsal	Jul 11, 2024	Image RetrievalRetrieval	CodeCode Available	0	5
Patch-Wise Self-Supervised Visual Representation Learning: A Fine-Grained Approach	Oct 28, 2023	Copy Detectionimage-classification	CodeCode Available	0	5
Let's Transfer Transformations of Shared Semantic Representations	Mar 2, 2019	AttributeImage Retrieval	CodeCode Available	0	5
Cross-Modality Sub-Image Retrieval using Contrastive Multimodal Image Representations	Jan 10, 2022	Content-Based Image RetrievalImage Retrieval	CodeCode Available	0	5
Benchmarking Vision-Language Contrastive Methods for Medical Representation Learning	Jun 11, 2024	BenchmarkingContrastive Learning	CodeCode Available	0	5
Learning to Play Guess Who? and Inventing a Grounded Language as a Consequence	Nov 10, 2016	Image Retrieval	CodeCode Available	0	5
Learning to Learn from Web Data through Deep Semantic Embeddings	Aug 20, 2018	Image RetrievalRetrieval	CodeCode Available	0	5
Learning to Minimize the Remainder in Supervised Learning	Jan 23, 2022	image-classificationImage Classification	CodeCode Available	0	5
Learning Self-Regularized Adversarial Views for Self-Supervised Vision Transformers	Oct 16, 2022	Data AugmentationImage Retrieval	CodeCode Available	0	5
Cross-Modal Attribute Insertions for Assessing the Robustness of Vision-and-Language Learning	Jun 19, 2023	AttributeImage Retrieval	CodeCode Available	0	5
Domain-Aware SE Network for Sketch-based Image Retrieval with Multiplicative Euclidean Margin Softmax	Dec 11, 2018	Image RetrievalRetrieval	CodeCode Available	0	5
Learning Metrics from Teachers: Compact Networks for Image Embedding	Apr 7, 2019	Face Recognitionimage-classification	CodeCode Available	0	5
Learning Semantic Proxies from Visual Prompts for Parameter-Efficient Fine-Tuning in Deep Metric Learning	Feb 4, 2024	Image RetrievalMetric Learning	CodeCode Available	0	5
Learning from Memory: Non-Parametric Memory Augmented Self-Supervised Learning of Visual Features	Jul 3, 2024	Image RetrievalRetrieval	CodeCode Available	0	5
Cross-Media Similarity Evaluation for Web Image Retrieval in the Wild	Sep 5, 2017	Image RetrievalRetrieval	CodeCode Available	0	5
Cross-Modal Coherence for Text-to-Image Retrieval	Sep 22, 2021	Image RetrievalRetrieval	CodeCode Available	0	5
CrossLocate: Cross-modal Large-scale Visual Geo-Localization in Natural Environments using Rendered Modalities	Jan 1, 2022	Camera LocalizationCamera Pose Estimation	CodeCode Available	0	5
Batch DropBlock Network for Person Re-identification and Beyond	Nov 17, 2018	Image RetrievalMetric Learning	CodeCode Available	0	5
Learning Discriminative and Transformation Covariant Local Feature Detectors	Jul 1, 2017	Image Retrieval	CodeCode Available	0	5
Adding Cues to Binary Feature Descriptors for Visual Place Recognition	Sep 18, 2018	Image RetrievalRetrieval	CodeCode Available	0	5
Cross-Domain Image Matching with Deep Feature Maps	Apr 6, 2018	Image RetrievalRetrieval	CodeCode Available	0	5
Learning Deep Representations of Fine-grained Visual Descriptions	May 17, 2016	AttributeImage Retrieval	CodeCode Available	0	5
Learning with Average Precision: Training Image Retrieval with a Listwise Loss	Jun 18, 2019	Image RetrievalRetrieval	CodeCode Available	0	5
Learning discriminative and transformation covariant local feature detectors.	Jul 1, 2017	Image Retrieval	CodeCode Available	0	5
Cross-dimensional Weighting for Aggregated Deep Convolutional Features	Dec 13, 2015	Image Retrieval	CodeCode Available	0	5
Barcode Annotations for Medical Image Retrieval: A Preliminary Investigation	May 19, 2015	General ClassificationImage Retrieval	CodeCode Available	0	5
CriSp: Leveraging Tread Depth Maps for Enhanced Crime-Scene Shoeprint Matching	Apr 25, 2024	BenchmarkingData Augmentation	CodeCode Available	0	5
Learning compact binary descriptors with unsupervised deep neural networks	Jun 1, 2016	Image RetrievalObject	CodeCode Available	0	5
Learning Deep Local Features With Multiple Dynamic Attentions for Large-Scale Image Retrieval	Jan 1, 2021	Image RetrievalMetric Learning	CodeCode Available	0	5
Learning Disentangled Representations via Mutual Information Estimation	Dec 9, 2019	DisentanglementGeneral Classification	CodeCode Available	0	5
Correspondence-Free Domain Alignment for Unsupervised Cross-Domain Image Retrieval	Feb 13, 2023	Image RetrievalRetrieval	CodeCode Available	0	5
A Zero-Shot Framework for Sketch-based Image Retrieval	Jul 31, 2018	Image RetrievalRetrieval	CodeCode Available	0	5
Correcting the Triplet Selection Bias for Triplet Loss	Sep 1, 2018	Face RecognitionFine-Grained Image Classification	CodeCode Available	0	5
Automating 3D Dataset Generation with Neural Radiance Fields	Mar 20, 2025	3D Pose EstimationDataset Generation	CodeCode Available	0	5
Large Language Models and Multimodal Retrieval for Visual Word Sense Disambiguation	Oct 21, 2023	Answer GenerationImage Retrieval	CodeCode Available	0	5
COOKIE: Contrastive Cross-Modal Knowledge Sharing Pre-Training for Vision-Language Representation	Jan 1, 2021	Contrastive LearningCross-Modal Retrieval	CodeCode Available	0	5
Looking at Outfit to Parse Clothing	Mar 4, 2017	Image RetrievalRetrieval	CodeCode Available	0	5
Large-Scale Historical Watermark Recognition: dataset and a new consistency-based approach	Aug 27, 2019	16kImage Retrieval	CodeCode Available	0	5

Show:10 25 50

← PrevPage 10 of 45Next →

All datasets ROxford (Hard)ROxford (Medium)RParis (Hard)RParis (Medium)CREPE (Compositional REPresentation Evaluation)Fashion IQ Flickr30K 1K test CIRR SOP Flickr30k-CN Oxf5k Flickr30k

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	SuperGlobal	mAP	80.2	—	Unverified
2	AMES	mAP	80	—	Unverified
3	Hypergraph propagation+community selection	mAP	73	—	Unverified
4	Token	mAP	66.57	—	Unverified
5	DELG+ α QE reranking+ RRT reranking	mAP	64	—	Unverified
6	FIRe	mAP	61.2	—	Unverified
7	HOW	mAP	56.9	—	Unverified
8	ResNet101+ArcFace GLDv2-train-clean	mAP	51.6	—	Unverified
9	DELF–HQE+SP	mAP	50.3	—	Unverified
10	HesAff–rSIFT–HQE+SP	mAP	49.7	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	AMES	mAP	90.7	—	Unverified
2	Hypergraph propagation+Community selection	mAP	88.4	—	Unverified
3	Token	mAP	82.28	—	Unverified
4	FIRe	mAP	81.8	—	Unverified
5	DELG+ α QE reranking + RRT reranking	mAP	80.4	—	Unverified
6	HOW	mAP	79.4	—	Unverified
7	ResNet101+ArcFace GLDv2-train-clean	mAP	74.2	—	Unverified
8	DELF–HQE+SP	mAP	73.4	—	Unverified
9	HesAff–rSIFT–HQE+SP	mAP	71.3	—	Unverified
10	DELF–ASMK*+SP	mAP	67.8	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	AMES	mAP	89.7	—	Unverified
2	SuperGlobal	mAP	86.7	—	Unverified
3	Hypergraph propagation	mAP	83.3	—	Unverified
4	Token	mAP	78.56	—	Unverified
5	DELG+ α QE reranking + RRT reranking	mAP	77.7	—	Unverified
6	ResNet101+ArcFace GLDv2-train-clean	mAP	70.3	—	Unverified
7	FIRe	mAP	70	—	Unverified
8	DELF–HQE+SP	mAP	69.3	—	Unverified
9	HOW	mAP	62.4	—	Unverified
10	R–R-MAC	mAP	59.4	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	AMES	mAP	94.9	—	Unverified
2	Hypergraph propagation	mAP	92.6	—	Unverified
3	Token	mAP	89.34	—	Unverified
4	DELG+ α QE reranking + RRT reranking	mAP	88.5	—	Unverified
5	FIRe	mAP	85.3	—	Unverified
6	ResNet101+ArcFace GLDv2-train-clean	mAP	84.9	—	Unverified
7	DELF–HQE+SP	mAP	84	—	Unverified
8	HOW	mAP	81.6	—	Unverified
9	R–R-MAC	mAP	78.9	—	Unverified
10	R–GeM	mAP	77.2	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	Swin-T (MosaiCLIP, CC-12M)	Recall@1 (HN-Atom, UC)	44.5	—	Unverified
2	RN-50 (MosaiCLIP, CC-12M)	Recall@1 (HN-Atom, UC)	44.4	—	Unverified
3	MosaiCLIP (YFCC-FT)	Recall@1 (HN-Atom, UC)	41.5	—	Unverified
4	RN-50 (NegCLIP, CC-12M)	Recall@1 (HN-Atom, UC)	41.4	—	Unverified
5	MosaiCLIP (CC-FT)	Recall@1 (HN-Atom, UC)	40.9	—	Unverified
6	Swin-T (NegCLIP, CC-12M)	Recall@1 (HN-Atom, UC)	39.6	—	Unverified
7	CLIP (YFCC-FT)	Recall@1 (HN-Atom, UC)	39.5	—	Unverified
8	ViT-L-14 (LAION400M)	Recall@1 (HN-Atom + HN-Comp, SC)	39.44	—	Unverified
9	NegCLIP (YFCC-FT)	Recall@1 (HN-Atom, UC)	39	—	Unverified
10	CLIP-FT (YFCC-FT)	Recall@1 (HN-Atom, UC)	38.3	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	DQU-CIR	(Recall@10+Recall@50)/2	71.77	—	Unverified
2	TMCIR	(Recall@10+Recall@50)/2	66.56	—	Unverified
3	SPN4CIR (SPRC)	(Recall@10+Recall@50)/2	66.41	—	Unverified
4	SPRC	(Recall@10+Recall@50)/2	64.85	—	Unverified
5	Candidate Set Re-ranking	(Recall@10+Recall@50)/2	62.15	—	Unverified
6	RUTIR (BLIP B/16)	(Recall@10+Recall@50)/2	61.32	—	Unverified
7	CASE	(Recall@10+Recall@50)/2	59.73	—	Unverified
8	CaLa	(Recall@10+Recall@50)/2	57.96	—	Unverified
9	BLIP4CIR+Bi	(Recall@10+Recall@50)/2	55.4	—	Unverified
10	CLIP4Cir (v3)	(Recall@10+Recall@50)/2	55.36	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	X-VLM (base)	R@1	86.9	—	Unverified
2	RCAR	R@1	62.6	—	Unverified
3	SGRAF	R@1	58.5	—	Unverified
4	VisualSparta	R@1	57.4	—	Unverified
5	LGSGM	R@1	57.4	—	Unverified
6	TERAN MrSw	R@1	56.5	—	Unverified
7	TERAN Symm.	R@1	55.7	—	Unverified
8	VSRN	R@1	54.7	—	Unverified
9	CAMP	R@1	51.5	—	Unverified
10	SCAN i-t	R@1	44	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	TMCIR	(Recall@5+Recall_subset@1)/2	83.46	—	Unverified
2	SPN4CIR (SPRC)	(Recall@5+Recall_subset@1)/2	82.69	—	Unverified
3	SPRC2	(Recall@5+Recall_subset@1)/2	82.66	—	Unverified
4	SPRC	(Recall@5+Recall_subset@1)/2	81.39	—	Unverified
5	Candidate Set Re-ranking	(Recall@5+Recall_subset@1)/2	80.9	—	Unverified
6	CaLa	(Recall@5+Recall_subset@1)/2	78.74	—	Unverified
7	CASE (Pre-trained on LaSCo.Ca)	(Recall@5+Recall_subset@1)/2	78.25	—	Unverified
8	CASE	(Recall@5+Recall_subset@1)/2	77.5	—	Unverified
9	VISTA (base)	(Recall@5+Recall_subset@1)/2	75.9	—	Unverified
10	MMRet-MLLM	(Recall@5+Recall_subset@1)/2	75.7	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	Unicom+ViT-L@336px	R@1	91.2	—	Unverified
2	ROADMAP (DeiT-B)	R@1	86	—	Unverified
3	CGD (SG/GS)	R@1	84.2	—	Unverified
4	ROADMAP (ResNet-50)	R@1	83.1	—	Unverified
5	ProxyNCA++	R@1	81.4	—	Unverified
6	PNP Loss	R@1	81.1	—	Unverified
7	Cross-Batch Memory	R@1	80.6	—	Unverified
8	Smooth-AP	R@1	80.1	—	Unverified
9	NormSoftmax2048 (ResNet-50)	R@1	79.5	—	Unverified
10	EPSHN512	R@1	78.3	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	InternVL-G-FT	R@1	85.9	—	Unverified
2	InternVL-C-FT	R@1	85.2	—	Unverified
3	R2D2 (ViT-L/14)	R@1	84.4	—	Unverified
4	CN-CLIP (ViT-L/14@336px)	R@1	84.4	—	Unverified
5	CN-CLIP (ViT-H/14)	R@1	83.8	—	Unverified
6	CN-CLIP (ViT-L/14)	R@1	82.7	—	Unverified
7	CN-CLIP (ViT-B/16)	R@1	79.1	—	Unverified
8	R2D2 (ViT-B)	R@1	78.3	—	Unverified
9	Wukong (ViT-L/14)	R@1	77.4	—	Unverified
10	Wukong (ViT-B/32)	R@1	67.6	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	Offline Diffusion	MAP	96.2	—	Unverified
2	CNN+IME layer	MAP	92	—	Unverified
3	DELF+FT+ATT+DIR+QE	MAP	90	—	Unverified
4	DIR+QE*	MAP	89	—	Unverified