Image Retrieval

Image Retrieval is a fundamental and long-standing computer vision task that involves finding images similar to a given query from a large database. It is often considered a form of fine-grained, instance-level classification. The task is integral to image recognition alongside classification and cross-modal retrieval. By leveraging visual similarity and other criteria, image retrieval enables users to efficiently discover relevant images, making it a crucial tool in applications such as search and recommendation.

Extending CLIP for Category-to-image Retrieval in E-commerce

( Image credit: DELF )

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 1151–1200 of 2239 papers

Title	Date	Tasks	Status
Kernel Square-Loss Exemplar Machines for Image Retrieval	Jul 1, 2017	Image RetrievalRetrieval	—Unverified
Keypoint-Aligned Embeddings for Image Retrieval and Re-identification	Aug 26, 2020	Image RetrievalMulti-Task Learning	—Unverified
Keypoint Encoding for Improved Feature Extraction from Compressed Video at Low Bitrates	Jun 27, 2015	Image RetrievalKeypoint Detection	—Unverified
Keyword-Based Diverse Image Retrieval by Semantics-aware Contrastive Learning and Transformer	May 6, 2023	Contrastive LearningDiversity	—Unverified
Knowledge Aware Semantic Concept Expansion for Image-Text Matching	Aug 10, 2019	Common Sense ReasoningContent-Based Image Retrieval	—Unverified
Knowledge-aware Text-Image Retrieval for Remote Sensing Images	May 6, 2024	DiversityEarth Observation	—Unverified
Challenging deep image descriptors for retrieval in heterogeneous iconographic collections	Sep 19, 2019	Content-Based Image RetrievalImage Retrieval	—Unverified
CELESTIAL: Classification Enabled via Labelless Embeddings with Self-supervised Telescope Image Analysis Learning	Jan 20, 2022	Image RetrievalRetrieval	—Unverified
Celeb-FBI: A Benchmark Dataset on Human Full Body Images and Age, Gender, Height and Weight Estimation using Deep Learning Approach	Jul 3, 2024	Image Retrieval	—Unverified
Language Guided Local Infiltration for Interactive Image Retrieval	Apr 16, 2023	Image RetrievalRetrieval	—Unverified
Language learning using Speech to Image retrieval	Sep 9, 2019	Grounded language learningImage Retrieval	—Unverified
CEIA: CLIP-Based Event-Image Alignment for Open-World Event-Based Understanding	Jul 9, 2024	Contrastive LearningDomain Adaptation	—Unverified
CBIR using Pre-Trained Neural Networks	Oct 27, 2021	Image RetrievalRetrieval	—Unverified
Large Language Model Informed Patent Image Retrieval	Apr 30, 2024	Image RetrievalLanguage Modeling	—Unverified
Two Birds, One Stone: Jointly Learning Binary Code for Large-Scale Face Image Retrieval and Attributes Prediction	Dec 1, 2015	Face Image RetrievalImage Retrieval	—Unverified
Large Language Models for Captioning and Retrieving Remote Sensing Images	Feb 9, 2024	Cross-Modal RetrievalDecoder	—Unverified
Large-margin Learning of Compact Binary Image Encodings	Feb 26, 2014	Content-Based Image RetrievalGeneral Classification	—Unverified
Large Scale Deep Convolutional Neural Network Features Search with Lucene	Mar 31, 2016	Content-Based Image RetrievalImage Retrieval	—Unverified
Adversarial Attack on Deep Product Quantization Network for Image Retrieval	Feb 26, 2020	Adversarial AttackImage Retrieval	—Unverified
Two-Stage Hashing for Fast Document Retrieval	Jun 1, 2014	Image RetrievalInformation Retrieval	—Unverified
UMBC\_EBIQUITY-CORE: Semantic Textual Similarity Systems	Jun 1, 2013	Document ClassificationImage Retrieval	—Unverified
Large scale near-duplicate image retrieval using Triples of Adjacent Ranked Features (TARF) with embedded geometric information	Mar 19, 2016	Image RetrievalRetrieval	—Unverified
Vision-Language Models Learn Super Images for Efficient Partially Relevant Video Retrieval	Dec 1, 2023	Image RetrievalPartially Relevant Video Retrieval	—Unverified
Large Scale Visual Food Recognition	Mar 30, 2021	Fine-Grained Visual RecognitionFood Recognition	—Unverified
CBIDR: A novel method for information retrieval combining image and data by means of TOPSIS applied to medical diagnosis	Sep 26, 2024	Content-Based Image RetrievalDiagnostic	—Unverified
LAVIS: A Library for Language-Vision Intelligence	Sep 15, 2022	BenchmarkingImage Captioning	—Unverified
C-BEV: Contrastive Bird's Eye View Training for Cross-View Image Retrieval and 3-DoF Pose Estimation	Dec 13, 2023	Image RetrievalPose Estimation	—Unverified
CAVL: Learning Contrastive and Adaptive Representations of Vision and Language	Apr 10, 2023	Image RetrievalPhrase Grounding	—Unverified
LDOP: Local Directional Order Pattern for Robust Face Retrieval	Feb 28, 2018	Image RetrievalRetrieval	—Unverified
Catplayinginthesnow: Impact of Prior Segmentation on a Model of Visually Grounded Speech	Jun 15, 2020	Image RetrievalLanguage Acquisition	—Unverified
Catching Image Retrieval Generalization	Jun 23, 2023	Image RetrievalMetric Learning	—Unverified
Learning a Facial Expression Embedding Disentangled From Identity	Jun 19, 2021	Data AugmentationEmotion Recognition	—Unverified
Case-based Similar Image Retrieval for Weakly Annotated Large Histopathological Images of Malignant Lymphoma Using Deep Metric Learning	Jul 8, 2021	Image RetrievalMetric Learning	—Unverified
Learning a Recurrent Visual Representation for Image Caption Generation	Nov 20, 2014	Caption GenerationImage Retrieval	—Unverified
Captured by Captions: On Memorization and its Mitigation in CLIP Models	Feb 11, 2025	Image RetrievalMemorization	—Unverified
Learning Attributes Equals Multi-Source Domain Generalization	May 3, 2016	AttributeDomain Generalization	—Unverified
``Caption'' as a Coherence Relation: Evidence and Implications	Jun 1, 2019	DiversityImage Retrieval	—Unverified
Learning-based Relational Object Matching Across Views	May 3, 2023	Graph Neural NetworkImage Retrieval	—Unverified
Learning Binary and Sparse Permutation-Invariant Representations for Fast and Memory Efficient Whole Slide Image Search	Aug 29, 2022	GPUImage Retrieval	—Unverified
Learning Binary Codes and Binary Weights for Efficient Classification	Mar 14, 2016	ClassificationGeneral Classification	—Unverified
Learning by Hallucinating: Vision-Language Pre-training with Weak Supervision	Oct 24, 2022	cross-modal alignmentCross-Modal Retrieval	—Unverified
Learning Colour Representations of Search Queries	Jun 17, 2020	Image Retrieval	—Unverified
Learning Compact Binary Descriptors With Unsupervised Deep Neural Networks	Jun 1, 2016	Image RetrievalObject	—Unverified
Uncertainty-based Cross-Modal Retrieval with Probabilistic Representations	Apr 20, 2022	Cross-Modal RetrievalImage Retrieval	—Unverified
Learning Comprehensive Representations with Richer Self for Text-to-Image Person Re-Identification	Oct 17, 2023	Image RetrievalImage-text matching	—Unverified
Learning Cross-Modal Deep Embeddings for Multi-Object Image Retrieval using Text and Sketch	Apr 28, 2018	Image RetrievalRetrieval	—Unverified
Learning cross space mapping via DNN using large scale click-through logs	Feb 26, 2023	image-classificationImage Classification	—Unverified
Learning Deep Binary Descriptor With Multi-Quantization	Jul 1, 2017	BinarizationImage Retrieval	—Unverified
Understanding Attention for Vision-and-Language Tasks	Dec 17, 2021	Image GenerationImage Retrieval	—Unverified
Learning deep representation of multityped objects and tasks	Mar 4, 2016	Image RetrievalRetrieval	—Unverified

Show:10 25 50

← PrevPage 24 of 45Next →

All datasets ROxford (Hard)ROxford (Medium)RParis (Hard)RParis (Medium)CREPE (Compositional REPresentation Evaluation)Fashion IQ Flickr30K 1K test CIRR SOP Flickr30k-CN Oxf5k Flickr30k

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	SuperGlobal	mAP	80.2	—	Unverified
2	AMES	mAP	80	—	Unverified
3	Hypergraph propagation+community selection	mAP	73	—	Unverified
4	Token	mAP	66.57	—	Unverified
5	DELG+ α QE reranking+ RRT reranking	mAP	64	—	Unverified
6	FIRe	mAP	61.2	—	Unverified
7	HOW	mAP	56.9	—	Unverified
8	ResNet101+ArcFace GLDv2-train-clean	mAP	51.6	—	Unverified
9	DELF–HQE+SP	mAP	50.3	—	Unverified
10	HesAff–rSIFT–HQE+SP	mAP	49.7	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	AMES	mAP	90.7	—	Unverified
2	Hypergraph propagation+Community selection	mAP	88.4	—	Unverified
3	Token	mAP	82.28	—	Unverified
4	FIRe	mAP	81.8	—	Unverified
5	DELG+ α QE reranking + RRT reranking	mAP	80.4	—	Unverified
6	HOW	mAP	79.4	—	Unverified
7	ResNet101+ArcFace GLDv2-train-clean	mAP	74.2	—	Unverified
8	DELF–HQE+SP	mAP	73.4	—	Unverified
9	HesAff–rSIFT–HQE+SP	mAP	71.3	—	Unverified
10	DELF–ASMK*+SP	mAP	67.8	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	AMES	mAP	89.7	—	Unverified
2	SuperGlobal	mAP	86.7	—	Unverified
3	Hypergraph propagation	mAP	83.3	—	Unverified
4	Token	mAP	78.56	—	Unverified
5	DELG+ α QE reranking + RRT reranking	mAP	77.7	—	Unverified
6	ResNet101+ArcFace GLDv2-train-clean	mAP	70.3	—	Unverified
7	FIRe	mAP	70	—	Unverified
8	DELF–HQE+SP	mAP	69.3	—	Unverified
9	HOW	mAP	62.4	—	Unverified
10	R–R-MAC	mAP	59.4	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	AMES	mAP	94.9	—	Unverified
2	Hypergraph propagation	mAP	92.6	—	Unverified
3	Token	mAP	89.34	—	Unverified
4	DELG+ α QE reranking + RRT reranking	mAP	88.5	—	Unverified
5	FIRe	mAP	85.3	—	Unverified
6	ResNet101+ArcFace GLDv2-train-clean	mAP	84.9	—	Unverified
7	DELF–HQE+SP	mAP	84	—	Unverified
8	HOW	mAP	81.6	—	Unverified
9	R–R-MAC	mAP	78.9	—	Unverified
10	R–GeM	mAP	77.2	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	Swin-T (MosaiCLIP, CC-12M)	Recall@1 (HN-Atom, UC)	44.5	—	Unverified
2	RN-50 (MosaiCLIP, CC-12M)	Recall@1 (HN-Atom, UC)	44.4	—	Unverified
3	MosaiCLIP (YFCC-FT)	Recall@1 (HN-Atom, UC)	41.5	—	Unverified
4	RN-50 (NegCLIP, CC-12M)	Recall@1 (HN-Atom, UC)	41.4	—	Unverified
5	MosaiCLIP (CC-FT)	Recall@1 (HN-Atom, UC)	40.9	—	Unverified
6	Swin-T (NegCLIP, CC-12M)	Recall@1 (HN-Atom, UC)	39.6	—	Unverified
7	CLIP (YFCC-FT)	Recall@1 (HN-Atom, UC)	39.5	—	Unverified
8	ViT-L-14 (LAION400M)	Recall@1 (HN-Atom + HN-Comp, SC)	39.44	—	Unverified
9	NegCLIP (YFCC-FT)	Recall@1 (HN-Atom, UC)	39	—	Unverified
10	CLIP-FT (YFCC-FT)	Recall@1 (HN-Atom, UC)	38.3	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	DQU-CIR	(Recall@10+Recall@50)/2	71.77	—	Unverified
2	TMCIR	(Recall@10+Recall@50)/2	66.56	—	Unverified
3	SPN4CIR (SPRC)	(Recall@10+Recall@50)/2	66.41	—	Unverified
4	SPRC	(Recall@10+Recall@50)/2	64.85	—	Unverified
5	Candidate Set Re-ranking	(Recall@10+Recall@50)/2	62.15	—	Unverified
6	RUTIR (BLIP B/16)	(Recall@10+Recall@50)/2	61.32	—	Unverified
7	CASE	(Recall@10+Recall@50)/2	59.73	—	Unverified
8	CaLa	(Recall@10+Recall@50)/2	57.96	—	Unverified
9	BLIP4CIR+Bi	(Recall@10+Recall@50)/2	55.4	—	Unverified
10	CLIP4Cir (v3)	(Recall@10+Recall@50)/2	55.36	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	X-VLM (base)	R@1	86.9	—	Unverified
2	RCAR	R@1	62.6	—	Unverified
3	SGRAF	R@1	58.5	—	Unverified
4	LGSGM	R@1	57.4	—	Unverified
5	VisualSparta	R@1	57.4	—	Unverified
6	TERAN MrSw	R@1	56.5	—	Unverified
7	TERAN Symm.	R@1	55.7	—	Unverified
8	VSRN	R@1	54.7	—	Unverified
9	CAMP	R@1	51.5	—	Unverified
10	SCAN i-t	R@1	44	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	TMCIR	(Recall@5+Recall_subset@1)/2	83.46	—	Unverified
2	SPN4CIR (SPRC)	(Recall@5+Recall_subset@1)/2	82.69	—	Unverified
3	SPRC2	(Recall@5+Recall_subset@1)/2	82.66	—	Unverified
4	SPRC	(Recall@5+Recall_subset@1)/2	81.39	—	Unverified
5	Candidate Set Re-ranking	(Recall@5+Recall_subset@1)/2	80.9	—	Unverified
6	CaLa	(Recall@5+Recall_subset@1)/2	78.74	—	Unverified
7	CASE (Pre-trained on LaSCo.Ca)	(Recall@5+Recall_subset@1)/2	78.25	—	Unverified
8	CASE	(Recall@5+Recall_subset@1)/2	77.5	—	Unverified
9	VISTA (base)	(Recall@5+Recall_subset@1)/2	75.9	—	Unverified
10	MMRet-MLLM	(Recall@5+Recall_subset@1)/2	75.7	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	Unicom+ViT-L@336px	R@1	91.2	—	Unverified
2	ROADMAP (DeiT-B)	R@1	86	—	Unverified
3	CGD (SG/GS)	R@1	84.2	—	Unverified
4	ROADMAP (ResNet-50)	R@1	83.1	—	Unverified
5	ProxyNCA++	R@1	81.4	—	Unverified
6	PNP Loss	R@1	81.1	—	Unverified
7	Cross-Batch Memory	R@1	80.6	—	Unverified
8	Smooth-AP	R@1	80.1	—	Unverified
9	NormSoftmax2048 (ResNet-50)	R@1	79.5	—	Unverified
10	EPSHN512	R@1	78.3	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	InternVL-G-FT	R@1	85.9	—	Unverified
2	InternVL-C-FT	R@1	85.2	—	Unverified
3	CN-CLIP (ViT-L/14@336px)	R@1	84.4	—	Unverified
4	R2D2 (ViT-L/14)	R@1	84.4	—	Unverified
5	CN-CLIP (ViT-H/14)	R@1	83.8	—	Unverified
6	CN-CLIP (ViT-L/14)	R@1	82.7	—	Unverified
7	CN-CLIP (ViT-B/16)	R@1	79.1	—	Unverified
8	R2D2 (ViT-B)	R@1	78.3	—	Unverified
9	Wukong (ViT-L/14)	R@1	77.4	—	Unverified
10	Wukong (ViT-B/32)	R@1	67.6	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	Offline Diffusion	MAP	96.2	—	Unverified
2	CNN+IME layer	MAP	92	—	Unverified
3	DELF+FT+ATT+DIR+QE	MAP	90	—	Unverified
4	DIR+QE*	MAP	89	—	Unverified