SOTAVerified

Image Retrieval

Image Retrieval is a fundamental and long-standing computer vision task that involves finding images similar to a given query from a large database. It is often considered a form of fine-grained, instance-level classification. The task is integral to image recognition alongside classification and cross-modal retrieval. By leveraging visual similarity and other criteria, image retrieval enables users to efficiently discover relevant images, making it a crucial tool in applications such as search and recommendation.

Extending CLIP for Category-to-image Retrieval in E-commerce

( Image credit: DELF )

Papers

Showing 701750 of 2239 papers

TitleStatusHype
Leveraging High-Resolution Features for Improved Deep Hashing-based Image Retrieval0
Flickr30K-CFQ: A Compact and Fragmented Query Dataset for Text-image Retrieval0
Refining Knowledge Transfer on Audio-Image Temporal Agreement for Audio-Text Cross Retrieval0
Vector search with small radiuses0
Does the Performance of Text-to-Image Retrieval Models Generalize Beyond Captions-as-a-Query?Code0
Training Self-localization Models for Unseen Unfamiliar Places via Teacher-to-Student Data-Free Knowledge Transfer0
Text-to-Image Diffusion Models are Great Sketch-Photo Matchmakers0
You'll Never Walk Alone: A Sketch and Text Duet for Fine-Grained Image Retrieval0
How to Handle Sketch-Abstraction in Sketch-Based Image Retrieval?0
Texture image retrieval using a classification and contourlet-based features0
The Claude 3 Model Family: Opus, Sonnet, Haiku0
Image2Sentence based Asymmetrical Zero-shot Composed Image Retrieval0
Asymmetric Feature Fusion for Image Retrieval0
Structure Similarity Preservation Learning for Asymmetric Image RetrievalCode0
Unsupervised Cross-Domain Image Retrieval via Prototypical Optimal Transport0
Fine-tuning CLIP Text Encoders with Two-step Paraphrasing0
CFIR: Fast and Effective Long-Text To Image Retrieval for Large CorporaCode0
Weatherproofing Retrieval for Localization with Generative AI and Geometric Consistency0
Large Language Models for Captioning and Retrieving Remote Sensing Images0
Measuring Machine Learning Harms from Stereotypes Requires Understanding Who Is Harmed by Which Errors in What Ways0
Learning Semantic Proxies from Visual Prompts for Parameter-Efficient Fine-Tuning in Deep Metric LearningCode0
Zero-shot sketch-based remote sensing image retrieval based on multi-level and attention-guided tokenizationCode0
PICS: Pipeline for Image Captioning and Search0
Cross-Modal Coordination Across a Diverse Set of Input Modalities0
Bridging Generative and Discriminative Models for Unified Visual Perception with Diffusion Priors0
Transformer-based Clipped Contrastive Quantization Learning for Unsupervised Image Retrieval0
Enhancing Image Retrieval : A Comprehensive Study on Photo Search using the CLIP Mode0
PlaceFormer: Transformer-based Visual Place Recognition using Multi-Scale Patch Selection and Fusion0
Cross-Modality Perturbation Synergy Attack for Person Re-identification0
Siamese Content-based Search Engine for a More Transparent Skin and Breast Cancer Diagnosis through Histological Imaging0
On Image Search in Histopathology0
Modality-Aware Representation Learning for Zero-shot Sketch-based Image RetrievalCode0
Analysis and Validation of Image Search Engines in Histopathology0
Benchmarking PathCLIP for Pathology Image Analysis0
BEV-TSR: Text-Scene Retrieval in BEV Space for Autonomous Driving0
Attribute-Guided Pedestrian Retrieval: Bridging Person Re-ID with Internal Attribute Variability0
ConCon-Chi: Concept-Context Chimera Benchmark for Personalized Vision-Language TasksCode0
Leveraging Camera Triplets for Efficient and Accurate Structure-from-Motion0
Characteristics Matching Based Hash Codes Generation for Efficient Fine-grained Image Retrieval0
Recursive Distillation for Open-Set Distributed Robot Localization0
VQA4CIR: Boosting Composed Image Retrieval with Visual Question AnsweringCode0
Advancing Image Retrieval with Few-Shot Learning and Relevance FeedbackCode0
Symmetrical Bidirectional Knowledge Alignment for Zero-Shot Sketch-Based Image RetrievalCode0
Training-free Zero-shot Composed Image Retrieval with Local Concept Reranking0
C-BEV: Contrastive Bird's Eye View Training for Cross-View Image Retrieval and 3-DoF Pose Estimation0
Advancements in Content-Based Image Retrieval: A Comprehensive Survey of Relevance Feedback Techniques0
Dynamic Weighted Combiner for Mixed-Modal Image RetrievalCode0
The Contemporary Art of Image Search: Iterative User Intent Expansion via Vision-Language Model0
Unveiling Objects with SOLA: An Annotation-Free Image Search on the Object Level for Automotive Data Sets0
Improve Supervised Representation Learning with Masked Image Modeling0
Show:102550
← PrevPage 15 of 45Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1SuperGlobalmAP80.2Unverified
2AMESmAP80Unverified
3Hypergraph propagation+community selectionmAP73Unverified
4TokenmAP66.57Unverified
5DELG+ α QE reranking+ RRT rerankingmAP64Unverified
6FIRemAP61.2Unverified
7HOWmAP56.9Unverified
8ResNet101+ArcFace GLDv2-train-cleanmAP51.6Unverified
9DELF–HQE+SPmAP50.3Unverified
10HesAff–rSIFT–HQE+SPmAP49.7Unverified
#ModelMetricClaimedVerifiedStatus
1AMESmAP90.7Unverified
2Hypergraph propagation+Community selectionmAP88.4Unverified
3TokenmAP82.28Unverified
4FIRemAP81.8Unverified
5DELG+ α QE reranking + RRT rerankingmAP80.4Unverified
6HOWmAP79.4Unverified
7ResNet101+ArcFace GLDv2-train-cleanmAP74.2Unverified
8DELF–HQE+SPmAP73.4Unverified
9HesAff–rSIFT–HQE+SPmAP71.3Unverified
10DELF–ASMK*+SPmAP67.8Unverified
#ModelMetricClaimedVerifiedStatus
1AMESmAP89.7Unverified
2SuperGlobalmAP86.7Unverified
3Hypergraph propagationmAP83.3Unverified
4TokenmAP78.56Unverified
5DELG+ α QE reranking + RRT rerankingmAP77.7Unverified
6ResNet101+ArcFace GLDv2-train-cleanmAP70.3Unverified
7FIRemAP70Unverified
8DELF–HQE+SPmAP69.3Unverified
9HOWmAP62.4Unverified
10R–R-MACmAP59.4Unverified
#ModelMetricClaimedVerifiedStatus
1AMESmAP94.9Unverified
2Hypergraph propagationmAP92.6Unverified
3TokenmAP89.34Unverified
4DELG+ α QE reranking + RRT rerankingmAP88.5Unverified
5FIRemAP85.3Unverified
6ResNet101+ArcFace GLDv2-train-cleanmAP84.9Unverified
7DELF–HQE+SPmAP84Unverified
8HOWmAP81.6Unverified
9R–R-MACmAP78.9Unverified
10R–GeMmAP77.2Unverified
#ModelMetricClaimedVerifiedStatus
1Swin-T (MosaiCLIP, CC-12M)Recall@1 (HN-Atom, UC)44.5Unverified
2RN-50 (MosaiCLIP, CC-12M)Recall@1 (HN-Atom, UC)44.4Unverified
3MosaiCLIP (YFCC-FT)Recall@1 (HN-Atom, UC)41.5Unverified
4RN-50 (NegCLIP, CC-12M)Recall@1 (HN-Atom, UC)41.4Unverified
5MosaiCLIP (CC-FT)Recall@1 (HN-Atom, UC)40.9Unverified
6Swin-T (NegCLIP, CC-12M)Recall@1 (HN-Atom, UC)39.6Unverified
7CLIP (YFCC-FT)Recall@1 (HN-Atom, UC)39.5Unverified
8ViT-L-14 (LAION400M)Recall@1 (HN-Atom + HN-Comp, SC)39.44Unverified
9NegCLIP (YFCC-FT)Recall@1 (HN-Atom, UC)39Unverified
10CLIP-FT (YFCC-FT)Recall@1 (HN-Atom, UC)38.3Unverified
#ModelMetricClaimedVerifiedStatus
1DQU-CIR(Recall@10+Recall@50)/271.77Unverified
2TMCIR(Recall@10+Recall@50)/266.56Unverified
3SPN4CIR (SPRC)(Recall@10+Recall@50)/266.41Unverified
4SPRC(Recall@10+Recall@50)/264.85Unverified
5Candidate Set Re-ranking(Recall@10+Recall@50)/262.15Unverified
6RUTIR (BLIP B/16)(Recall@10+Recall@50)/261.32Unverified
7CASE(Recall@10+Recall@50)/259.73Unverified
8CaLa(Recall@10+Recall@50)/257.96Unverified
9BLIP4CIR+Bi(Recall@10+Recall@50)/255.4Unverified
10CLIP4Cir (v3)(Recall@10+Recall@50)/255.36Unverified
#ModelMetricClaimedVerifiedStatus
1X-VLM (base)R@186.9Unverified
2RCARR@162.6Unverified
3SGRAFR@158.5Unverified
4LGSGMR@157.4Unverified
5VisualSpartaR@157.4Unverified
6TERAN MrSwR@156.5Unverified
7TERAN Symm.R@155.7Unverified
8VSRNR@154.7Unverified
9CAMPR@151.5Unverified
10SCAN i-tR@144Unverified
#ModelMetricClaimedVerifiedStatus
1TMCIR(Recall@5+Recall_subset@1)/283.46Unverified
2SPN4CIR (SPRC)(Recall@5+Recall_subset@1)/282.69Unverified
3SPRC2(Recall@5+Recall_subset@1)/282.66Unverified
4SPRC(Recall@5+Recall_subset@1)/281.39Unverified
5Candidate Set Re-ranking(Recall@5+Recall_subset@1)/280.9Unverified
6CaLa(Recall@5+Recall_subset@1)/278.74Unverified
7CASE (Pre-trained on LaSCo.Ca)(Recall@5+Recall_subset@1)/278.25Unverified
8CASE(Recall@5+Recall_subset@1)/277.5Unverified
9VISTA (base)(Recall@5+Recall_subset@1)/275.9Unverified
10MMRet-MLLM(Recall@5+Recall_subset@1)/275.7Unverified
#ModelMetricClaimedVerifiedStatus
1Unicom+ViT-L@336pxR@191.2Unverified
2ROADMAP (DeiT-B)R@186Unverified
3CGD (SG/GS)R@184.2Unverified
4ROADMAP (ResNet-50)R@183.1Unverified
5ProxyNCA++R@181.4Unverified
6PNP LossR@181.1Unverified
7Cross-Batch MemoryR@180.6Unverified
8Smooth-APR@180.1Unverified
9NormSoftmax2048 (ResNet-50)R@179.5Unverified
10EPSHN512R@178.3Unverified
#ModelMetricClaimedVerifiedStatus
1InternVL-G-FTR@185.9Unverified
2InternVL-C-FTR@185.2Unverified
3CN-CLIP (ViT-L/14@336px)R@184.4Unverified
4R2D2 (ViT-L/14)R@184.4Unverified
5CN-CLIP (ViT-H/14)R@183.8Unverified
6CN-CLIP (ViT-L/14)R@182.7Unverified
7CN-CLIP (ViT-B/16)R@179.1Unverified
8R2D2 (ViT-B)R@178.3Unverified
9Wukong (ViT-L/14)R@177.4Unverified
10Wukong (ViT-B/32)R@167.6Unverified
#ModelMetricClaimedVerifiedStatus
1Offline DiffusionMAP96.2Unverified
2CNN+IME layerMAP92Unverified
3DELF+FT+ATT+DIR+QEMAP90Unverified
4DIR+QE*MAP89Unverified