SOTAVerified

Image Retrieval

Image Retrieval is a fundamental and long-standing computer vision task that involves finding images similar to a given query from a large database. It is often considered a form of fine-grained, instance-level classification. The task is integral to image recognition alongside classification and cross-modal retrieval. By leveraging visual similarity and other criteria, image retrieval enables users to efficiently discover relevant images, making it a crucial tool in applications such as search and recommendation.

Extending CLIP for Category-to-image Retrieval in E-commerce

( Image credit: DELF )

Papers

Showing 851900 of 2239 papers

TitleStatusHype
Fractal Descriptors of Texture Images Based on the Triangular Prism Dimension0
Fractional Local Neighborhood Intensity Pattern for Image Retrieval using Genetic Algorithm0
Freehand Sketch Recognition Using Deep Features0
Dual Embedding Expansion for Vehicle Re-identification0
Freestyle Sketch-in-the-Loop Image Segmentation0
Freeview Sketching: View-Aware Fine-Grained Sketch-Based Image Retrieval0
Frequency Disentangled Residual Network0
From A Glance to "Gotcha": Interactive Facial Image Retrieval with Progressive Relevance Feedback0
HORUS: Multimodal Large Language Models Framework for Video Retrieval at VBS 20250
From Mapping to Composing: A Two-Stage Framework for Zero-shot Composed Image Retrieval0
From Pixels to Prose: Advancing Multi-Modal Language Models for Remote Sensing0
Content-Based Medical Image Retrieval with Opponent Class Adaptive Margin Loss0
A Revisit on Deep Hashings for Large-scale Content Based Image Retrieval0
Homography augumented momentum constrastive learning for SAR image retrieval0
Full-attention based Neural Architecture Search using Context Auto-regression0
Full-Network Embedding in a Multimodal Embedding Pipeline0
Further results on dissimilarity spaces for hyperspectral images RF-CBIR0
Fuse and Attend: Generalized Embedding Learning for Art and Sketches0
Hot-Refresh Model Upgrades with Regression-Free Compatible Training in Image Retrieval0
G2DA: Geometry-Guided Dual-Alignment Learning for RGB-Infrared Person Re-Identification0
Gabor Barcodes for Medical Image Retrieval0
Garment Attribute Manipulation with Multi-level Attention0
Draw and Tell: Multimodal Descriptions Outperform Verbal- or Sketch-Only Descriptions in an Image Retrieval Task0
Do We Really Need Scene-specific Pose Encoders?0
Clean Image May be Dangerous: Data Poisoning Attacks Against Deep Hashing0
GeneCIS: A Benchmark for General Conditional Image Similarity0
A Review on Image Texture Analysis Methods0
Histopathology WSI Encoding based on GCNs for Scalable and Efficient Retrieval of Diagnostically Relevant Regions0
HyCIR: Boosting Zero-Shot Composed Image Retrieval with Synthetic Labels0
Generalising Fine-Grained Sketch-Based Image Retrieval0
Don't Mention the Shoe! A Learning to Rank Approach to Content Selection for Image Description Generation0
Generalized Multi-view Embedding for Visual Recognition and Cross-modal Retrieval0
Class-Specific Variational Auto-Encoder for Content-Based Image Retrieval0
Generalized Visual Relation Detection with Diffusion Models0
Generating Binary Tags for Fast Medical Image Retrieval Based on Convolutional Nets and Radon Transform0
Generating Compositional Color Representations from Text0
Contrastive Cross-Modal Knowledge Sharing Pre-training for Vision-Language Representation Learning and Retrieval0
Generating Holistic 3D Scene Abstractions for Text-based Image Retrieval0
Domain-invariant feature learning in brain MR imaging for content-based image retrieval0
Class-Independent Increment: An Efficient Approach for Multi-label Class-Incremental Learning0
A Review of Image Retrieval Techniques: Data Augmentation and Adversarial Learning Approaches0
Generative Adversarial Image Synthesis with Decision Tree Latent Controller0
Domain-Independent Captioning of Domain-Specific Images0
Generative Attribute Controller With Conditional Filtered Generative Adversarial Networks0
Classifying magnetic resonance image modalities with convolutional neural networks0
A Generic Image Retrieval Method for Date Estimation of Historical Document Collections0
Generative Zero-Shot Composed Image Retrieval0
Genetic Algorithms for the Optimization of Diffusion Parameters in Content-Based Image Retrieval0
GeoCapsNet: Aerial to Ground view Image Geo-localization using Capsule Network0
Class Anchor Margin Loss for Content-Based Image Retrieval0
Show:102550
← PrevPage 18 of 45Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1SuperGlobalmAP80.2Unverified
2AMESmAP80Unverified
3Hypergraph propagation+community selectionmAP73Unverified
4TokenmAP66.57Unverified
5DELG+ α QE reranking+ RRT rerankingmAP64Unverified
6FIRemAP61.2Unverified
7HOWmAP56.9Unverified
8ResNet101+ArcFace GLDv2-train-cleanmAP51.6Unverified
9DELF–HQE+SPmAP50.3Unverified
10HesAff–rSIFT–HQE+SPmAP49.7Unverified
#ModelMetricClaimedVerifiedStatus
1AMESmAP90.7Unverified
2Hypergraph propagation+Community selectionmAP88.4Unverified
3TokenmAP82.28Unverified
4FIRemAP81.8Unverified
5DELG+ α QE reranking + RRT rerankingmAP80.4Unverified
6HOWmAP79.4Unverified
7ResNet101+ArcFace GLDv2-train-cleanmAP74.2Unverified
8DELF–HQE+SPmAP73.4Unverified
9HesAff–rSIFT–HQE+SPmAP71.3Unverified
10DELF–ASMK*+SPmAP67.8Unverified
#ModelMetricClaimedVerifiedStatus
1AMESmAP89.7Unverified
2SuperGlobalmAP86.7Unverified
3Hypergraph propagationmAP83.3Unverified
4TokenmAP78.56Unverified
5DELG+ α QE reranking + RRT rerankingmAP77.7Unverified
6ResNet101+ArcFace GLDv2-train-cleanmAP70.3Unverified
7FIRemAP70Unverified
8DELF–HQE+SPmAP69.3Unverified
9HOWmAP62.4Unverified
10R–R-MACmAP59.4Unverified
#ModelMetricClaimedVerifiedStatus
1AMESmAP94.9Unverified
2Hypergraph propagationmAP92.6Unverified
3TokenmAP89.34Unverified
4DELG+ α QE reranking + RRT rerankingmAP88.5Unverified
5FIRemAP85.3Unverified
6ResNet101+ArcFace GLDv2-train-cleanmAP84.9Unverified
7DELF–HQE+SPmAP84Unverified
8HOWmAP81.6Unverified
9R–R-MACmAP78.9Unverified
10R–GeMmAP77.2Unverified
#ModelMetricClaimedVerifiedStatus
1Swin-T (MosaiCLIP, CC-12M)Recall@1 (HN-Atom, UC)44.5Unverified
2RN-50 (MosaiCLIP, CC-12M)Recall@1 (HN-Atom, UC)44.4Unverified
3MosaiCLIP (YFCC-FT)Recall@1 (HN-Atom, UC)41.5Unverified
4RN-50 (NegCLIP, CC-12M)Recall@1 (HN-Atom, UC)41.4Unverified
5MosaiCLIP (CC-FT)Recall@1 (HN-Atom, UC)40.9Unverified
6Swin-T (NegCLIP, CC-12M)Recall@1 (HN-Atom, UC)39.6Unverified
7CLIP (YFCC-FT)Recall@1 (HN-Atom, UC)39.5Unverified
8ViT-L-14 (LAION400M)Recall@1 (HN-Atom + HN-Comp, SC)39.44Unverified
9NegCLIP (YFCC-FT)Recall@1 (HN-Atom, UC)39Unverified
10CLIP-FT (YFCC-FT)Recall@1 (HN-Atom, UC)38.3Unverified
#ModelMetricClaimedVerifiedStatus
1DQU-CIR(Recall@10+Recall@50)/271.77Unverified
2TMCIR(Recall@10+Recall@50)/266.56Unverified
3SPN4CIR (SPRC)(Recall@10+Recall@50)/266.41Unverified
4SPRC(Recall@10+Recall@50)/264.85Unverified
5Candidate Set Re-ranking(Recall@10+Recall@50)/262.15Unverified
6RUTIR (BLIP B/16)(Recall@10+Recall@50)/261.32Unverified
7CASE(Recall@10+Recall@50)/259.73Unverified
8CaLa(Recall@10+Recall@50)/257.96Unverified
9BLIP4CIR+Bi(Recall@10+Recall@50)/255.4Unverified
10CLIP4Cir (v3)(Recall@10+Recall@50)/255.36Unverified
#ModelMetricClaimedVerifiedStatus
1X-VLM (base)R@186.9Unverified
2RCARR@162.6Unverified
3SGRAFR@158.5Unverified
4LGSGMR@157.4Unverified
5VisualSpartaR@157.4Unverified
6TERAN MrSwR@156.5Unverified
7TERAN Symm.R@155.7Unverified
8VSRNR@154.7Unverified
9CAMPR@151.5Unverified
10SCAN i-tR@144Unverified
#ModelMetricClaimedVerifiedStatus
1TMCIR(Recall@5+Recall_subset@1)/283.46Unverified
2SPN4CIR (SPRC)(Recall@5+Recall_subset@1)/282.69Unverified
3SPRC2(Recall@5+Recall_subset@1)/282.66Unverified
4SPRC(Recall@5+Recall_subset@1)/281.39Unverified
5Candidate Set Re-ranking(Recall@5+Recall_subset@1)/280.9Unverified
6CaLa(Recall@5+Recall_subset@1)/278.74Unverified
7CASE (Pre-trained on LaSCo.Ca)(Recall@5+Recall_subset@1)/278.25Unverified
8CASE(Recall@5+Recall_subset@1)/277.5Unverified
9VISTA (base)(Recall@5+Recall_subset@1)/275.9Unverified
10MMRet-MLLM(Recall@5+Recall_subset@1)/275.7Unverified
#ModelMetricClaimedVerifiedStatus
1Unicom+ViT-L@336pxR@191.2Unverified
2ROADMAP (DeiT-B)R@186Unverified
3CGD (SG/GS)R@184.2Unverified
4ROADMAP (ResNet-50)R@183.1Unverified
5ProxyNCA++R@181.4Unverified
6PNP LossR@181.1Unverified
7Cross-Batch MemoryR@180.6Unverified
8Smooth-APR@180.1Unverified
9NormSoftmax2048 (ResNet-50)R@179.5Unverified
10EPSHN512R@178.3Unverified
#ModelMetricClaimedVerifiedStatus
1InternVL-G-FTR@185.9Unverified
2InternVL-C-FTR@185.2Unverified
3CN-CLIP (ViT-L/14@336px)R@184.4Unverified
4R2D2 (ViT-L/14)R@184.4Unverified
5CN-CLIP (ViT-H/14)R@183.8Unverified
6CN-CLIP (ViT-L/14)R@182.7Unverified
7CN-CLIP (ViT-B/16)R@179.1Unverified
8R2D2 (ViT-B)R@178.3Unverified
9Wukong (ViT-L/14)R@177.4Unverified
10Wukong (ViT-B/32)R@167.6Unverified
#ModelMetricClaimedVerifiedStatus
1Offline DiffusionMAP96.2Unverified
2CNN+IME layerMAP92Unverified
3DELF+FT+ATT+DIR+QEMAP90Unverified
4DIR+QE*MAP89Unverified