SOTAVerified

Image Retrieval

Image Retrieval is a fundamental and long-standing computer vision task that involves finding images similar to a given query from a large database. It is often considered a form of fine-grained, instance-level classification. The task is integral to image recognition alongside classification and cross-modal retrieval. By leveraging visual similarity and other criteria, image retrieval enables users to efficiently discover relevant images, making it a crucial tool in applications such as search and recommendation.

Extending CLIP for Category-to-image Retrieval in E-commerce

( Image credit: DELF )

Papers

Showing 19011950 of 2239 papers

TitleStatusHype
Image-Image Search for Comparable Corpora Construction0
Unsupervised Learning of Spoken Language with Visual Context0
Fast Supervised Discrete Hashing and its AnalysisCode0
Generating Holistic 3D Scene Abstractions for Text-based Image Retrieval0
Voronoi-based compact image descriptors: Efficient Region-of-Interest retrieval with VLAD and deep-learning-based descriptors0
Training and Evaluating Multimodal Word Embeddings with Large-scale Web Annotated Images0
Interferences in match kernels0
Inverting The Generator Of A Generative Adversarial Network0
Compensating for Large In-Plane Rotations in Natural Images0
A Discriminatively Learned CNN Embedding for Person Re-identificationCode0
On the Exploration of Convolutional Fusion Networks for Visual Recognition0
Efficient Diffusion on Region Manifolds: Recovering Small Objects with Compact CNN RepresentationsCode0
Generalisation and Sharing in Triplet Convnets for Sketch based Visual Search0
Feature Extraction and Soft Computing Methods for Aerospace Structure Defect Classification0
Learning to Play Guess Who? and Inventing a Grounded Language as a ConsequenceCode0
Texture and Color-based Image Retrieval Using the Local Extrema Features and Riemannian Distance0
What Is the Best Practice for CNNs Applied to Visual Instance Retrieval?Code0
Comparing Data Sources and Architectures for Deep Visual Representation Learning in Semantics0
Local Similarity-Aware Deep Feature Embedding0
End-to-end Learning of Deep Visual Representations for Image RetrievalCode0
Multi-view metric learning for multi-instance image classification0
Adaptive Substring Extraction and Modified Local NBNN Scoring for Binary Feature-based Local Mobile Visual Search without False Positives0
Learning Low Dimensional Convolutional Neural Networks for High-Resolution Remote Sensing Image Retrieval0
Content Based Image Retrieval (CBIR) in Remote Clinical Diagnosis and Healthcare0
Content-Based Image Retrieval Using Multiresolution Analysis Of Shape-Based Classified Images0
MinMax Radon Barcodes for Medical Image Retrieval0
Stacked Autoencoders for Medical Image Search0
Image Retrieval with Fisher Vectors of Binary Features0
Three Tiers Neighborhood Graph and Multi-graph Fusion Ranking for Multi-feature Image Retrieval: A Manifold Aspect0
Perceptual uniform descriptor and Ranking on manifold: A bridge between image representation and ranking for image retrieval0
Barcodes for Medical Image Retrieval Using Autoencoded Radon Transform0
Radon-Gabor Barcodes for Medical Image Retrieval0
Combining Texture and Shape Cues for Object Recognition With Minimal Supervision0
Automatic Visual Theme Discovery from Joint Image and Text Corpora0
Stochastic Learning of Multi-Instance Dictionary for Earth Mover's Distance based Histogram Comparison0
Don't Mention the Shoe! A Learning to Rank Approach to Content Selection for Image Description Generation0
Detecting Dominant Vanishing Points in Natural Scenes with Application to Composition-Sensitive Image Retrieval0
DeepDiary: Automatic Caption Generation for Lifelogging Image StreamsCode0
Deep Hashing: A Joint Approach for Image Signature Learning0
Content-based image retrieval tutorialCode0
Multi-View Product Image Search Using Deep ConvNets Representations0
OnionNet: Sharing Features in Cascaded Deep Classifiers0
Learning Joint Representations of Videos and Sentences with Web Image Search0
SIFT Meets CNN: A Decade Survey of Instance RetrievalCode0
Aggregating Binary Local Descriptors for Image Retrieval0
PicHunt: Social Media Image Retrieval for Improved Law Enforcement0
A Shared Task on Multimodal Machine Translation and Crosslingual Image Description0
A Multi-media Approach to Cross-lingual Entity Knowledge Transfer0
Visual Relationship Detection with Language Priors0
SSDH: Semi-supervised Deep Hashing for Large Scale Image Retrieval0
Show:102550
← PrevPage 39 of 45Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1SuperGlobalmAP80.2Unverified
2AMESmAP80Unverified
3Hypergraph propagation+community selectionmAP73Unverified
4TokenmAP66.57Unverified
5DELG+ α QE reranking+ RRT rerankingmAP64Unverified
6FIRemAP61.2Unverified
7HOWmAP56.9Unverified
8ResNet101+ArcFace GLDv2-train-cleanmAP51.6Unverified
9DELF–HQE+SPmAP50.3Unverified
10HesAff–rSIFT–HQE+SPmAP49.7Unverified
#ModelMetricClaimedVerifiedStatus
1AMESmAP90.7Unverified
2Hypergraph propagation+Community selectionmAP88.4Unverified
3TokenmAP82.28Unverified
4FIRemAP81.8Unverified
5DELG+ α QE reranking + RRT rerankingmAP80.4Unverified
6HOWmAP79.4Unverified
7ResNet101+ArcFace GLDv2-train-cleanmAP74.2Unverified
8DELF–HQE+SPmAP73.4Unverified
9HesAff–rSIFT–HQE+SPmAP71.3Unverified
10DELF–ASMK*+SPmAP67.8Unverified
#ModelMetricClaimedVerifiedStatus
1AMESmAP89.7Unverified
2SuperGlobalmAP86.7Unverified
3Hypergraph propagationmAP83.3Unverified
4TokenmAP78.56Unverified
5DELG+ α QE reranking + RRT rerankingmAP77.7Unverified
6ResNet101+ArcFace GLDv2-train-cleanmAP70.3Unverified
7FIRemAP70Unverified
8DELF–HQE+SPmAP69.3Unverified
9HOWmAP62.4Unverified
10R–R-MACmAP59.4Unverified
#ModelMetricClaimedVerifiedStatus
1AMESmAP94.9Unverified
2Hypergraph propagationmAP92.6Unverified
3TokenmAP89.34Unverified
4DELG+ α QE reranking + RRT rerankingmAP88.5Unverified
5FIRemAP85.3Unverified
6ResNet101+ArcFace GLDv2-train-cleanmAP84.9Unverified
7DELF–HQE+SPmAP84Unverified
8HOWmAP81.6Unverified
9R–R-MACmAP78.9Unverified
10R–GeMmAP77.2Unverified
#ModelMetricClaimedVerifiedStatus
1Swin-T (MosaiCLIP, CC-12M)Recall@1 (HN-Atom, UC)44.5Unverified
2RN-50 (MosaiCLIP, CC-12M)Recall@1 (HN-Atom, UC)44.4Unverified
3MosaiCLIP (YFCC-FT)Recall@1 (HN-Atom, UC)41.5Unverified
4RN-50 (NegCLIP, CC-12M)Recall@1 (HN-Atom, UC)41.4Unverified
5MosaiCLIP (CC-FT)Recall@1 (HN-Atom, UC)40.9Unverified
6Swin-T (NegCLIP, CC-12M)Recall@1 (HN-Atom, UC)39.6Unverified
7CLIP (YFCC-FT)Recall@1 (HN-Atom, UC)39.5Unverified
8ViT-L-14 (LAION400M)Recall@1 (HN-Atom + HN-Comp, SC)39.44Unverified
9NegCLIP (YFCC-FT)Recall@1 (HN-Atom, UC)39Unverified
10CLIP-FT (YFCC-FT)Recall@1 (HN-Atom, UC)38.3Unverified
#ModelMetricClaimedVerifiedStatus
1DQU-CIR(Recall@10+Recall@50)/271.77Unverified
2TMCIR(Recall@10+Recall@50)/266.56Unverified
3SPN4CIR (SPRC)(Recall@10+Recall@50)/266.41Unverified
4SPRC(Recall@10+Recall@50)/264.85Unverified
5Candidate Set Re-ranking(Recall@10+Recall@50)/262.15Unverified
6RUTIR (BLIP B/16)(Recall@10+Recall@50)/261.32Unverified
7CASE(Recall@10+Recall@50)/259.73Unverified
8CaLa(Recall@10+Recall@50)/257.96Unverified
9BLIP4CIR+Bi(Recall@10+Recall@50)/255.4Unverified
10CLIP4Cir (v3)(Recall@10+Recall@50)/255.36Unverified
#ModelMetricClaimedVerifiedStatus
1X-VLM (base)R@186.9Unverified
2RCARR@162.6Unverified
3SGRAFR@158.5Unverified
4VisualSpartaR@157.4Unverified
5LGSGMR@157.4Unverified
6TERAN MrSwR@156.5Unverified
7TERAN Symm.R@155.7Unverified
8VSRNR@154.7Unverified
9CAMPR@151.5Unverified
10SCAN i-tR@144Unverified
#ModelMetricClaimedVerifiedStatus
1TMCIR(Recall@5+Recall_subset@1)/283.46Unverified
2SPN4CIR (SPRC)(Recall@5+Recall_subset@1)/282.69Unverified
3SPRC2(Recall@5+Recall_subset@1)/282.66Unverified
4SPRC(Recall@5+Recall_subset@1)/281.39Unverified
5Candidate Set Re-ranking(Recall@5+Recall_subset@1)/280.9Unverified
6CaLa(Recall@5+Recall_subset@1)/278.74Unverified
7CASE (Pre-trained on LaSCo.Ca)(Recall@5+Recall_subset@1)/278.25Unverified
8CASE(Recall@5+Recall_subset@1)/277.5Unverified
9VISTA (base)(Recall@5+Recall_subset@1)/275.9Unverified
10MMRet-MLLM(Recall@5+Recall_subset@1)/275.7Unverified
#ModelMetricClaimedVerifiedStatus
1Unicom+ViT-L@336pxR@191.2Unverified
2ROADMAP (DeiT-B)R@186Unverified
3CGD (SG/GS)R@184.2Unverified
4ROADMAP (ResNet-50)R@183.1Unverified
5ProxyNCA++R@181.4Unverified
6PNP LossR@181.1Unverified
7Cross-Batch MemoryR@180.6Unverified
8Smooth-APR@180.1Unverified
9NormSoftmax2048 (ResNet-50)R@179.5Unverified
10EPSHN512R@178.3Unverified
#ModelMetricClaimedVerifiedStatus
1InternVL-G-FTR@185.9Unverified
2InternVL-C-FTR@185.2Unverified
3R2D2 (ViT-L/14)R@184.4Unverified
4CN-CLIP (ViT-L/14@336px)R@184.4Unverified
5CN-CLIP (ViT-H/14)R@183.8Unverified
6CN-CLIP (ViT-L/14)R@182.7Unverified
7CN-CLIP (ViT-B/16)R@179.1Unverified
8R2D2 (ViT-B)R@178.3Unverified
9Wukong (ViT-L/14)R@177.4Unverified
10Wukong (ViT-B/32)R@167.6Unverified
#ModelMetricClaimedVerifiedStatus
1Offline DiffusionMAP96.2Unverified
2CNN+IME layerMAP92Unverified
3DELF+FT+ATT+DIR+QEMAP90Unverified
4DIR+QE*MAP89Unverified