SOTAVerified

Image Retrieval

Image Retrieval is a fundamental and long-standing computer vision task that involves finding images similar to a given query from a large database. It is often considered a form of fine-grained, instance-level classification. The task is integral to image recognition alongside classification and cross-modal retrieval. By leveraging visual similarity and other criteria, image retrieval enables users to efficiently discover relevant images, making it a crucial tool in applications such as search and recommendation.

Extending CLIP for Category-to-image Retrieval in E-commerce

( Image credit: DELF )

Papers

Showing 751800 of 2239 papers

TitleStatusHype
Vision-Language Models Learn Super Images for Efficient Partially Relevant Video Retrieval0
HKUST at SemEval-2023 Task 1: Visual Word Sense Disambiguation with Context Augmentation and Visual AssistanceCode0
Transformer-empowered Multi-modal Item Embedding for Enhanced Image Search in E-Commerce0
Reinforcement Learning from Diffusion Feedback: Q* for Image Search0
Invisible Relevance Bias: Text-Image Retrieval Models Prefer AI-Generated ImagesCode0
Medical Image Retrieval Using Pretrained Embeddings0
Attribute-Aware Deep Hashing with Self-Consistency for Large-Scale Fine-Grained Image RetrievalCode0
From Categories to Classifiers: Name-Only Continual Learning by Exploring the Web0
Lesion Search with Self-supervised Learning0
Pretrain like Your Inference: Masked Tuning Improves Zero-Shot Composed Image RetrievalCode0
Attributes Grouping and Mining Hashing for Fine-Grained Image Retrieval0
Training CLIP models on Data from Scientific PapersCode0
DeepPatent2: A Large-Scale Benchmarking Corpus for Technical Drawing UnderstandingCode0
Patch-Wise Self-Supervised Visual Representation Learning: A Fine-Grained ApproachCode0
Semantic-Aware Adversarial Training for Reliable Deep Hashing RetrievalCode0
Large Language Models and Multimodal Retrieval for Visual Word Sense DisambiguationCode0
Representation Learning via Consistent Assignment of Views over Random PartitionsCode0
Evaluating the Fairness of Discriminative Foundation Models in Computer VisionCode0
Brain decoding: toward real-time reconstruction of visual perception0
Learning Comprehensive Representations with Richer Self for Text-to-Image Person Re-Identification0
EfficientOCR: An Extensible, Open-Source Package for Efficiently Digitizing World Knowledge0
Pairwise Similarity Learning is SimPLECode0
Hyp-UML: Hyperbolic Image Retrieval with Uncertainty-aware Metric Learning0
Topological RANSAC for instance verification and retrieval without fine-tuning0
Efficient Retrieval of Images with Irregular Patterns using Morphological Image Analysis: Applications to Industrial and Healthcare datasets0
Sub-token ViT Embedding via Stochastic Resonance TransformersCode0
CIFAR-10-Warehouse: Broad and More Realistic Testbeds in Model Generalization Analysis0
Leveraging Cutting Edge Deep Learning Based Image Matching for Reconstructing a Large Scene from Sparse Images0
NEUCORE: Neural Concept Reasoning for Composed Image Retrieval0
Dark Side Augmentation: Generating Diverse Night Examples for Metric LearningCode0
Object-Centric Open-Vocabulary Image-Retrieval with Aggregated Features0
Resolving References in Visually-Grounded Dialogue via Text GenerationCode0
Implicit Differentiable Outlier Detection Enable Robust Deep Multimodal AnalysisCode0
Decompose Semantic Shifts for Composed Image Retrieval0
Active Learning for Fine-Grained Sketch-Based Image Retrieval0
RadarLCD: Learnable Radar-based Loop Closure Detection Pipeline0
GlobalDoc: A Cross-Modal Vision-Language Framework for Real-World Document Image Retrieval and Classification0
Collecting Visually-Grounded Dialogue with A Game Of SortsCode0
Representation Synthesis by Probabilistic Many-Valued Logic Operation in Self-Supervised Learning0
Dual Relation Alignment for Composed Image Retrieval0
Deep supervised hashing for fast retrieval of radio image cubes0
Learning with Multi-modal Gradient Attention for Explainable Composed Image Retrieval0
Extending Cross-Modal Retrieval with Interactive Learning to Improve Image Retrieval Performance in Forensics0
Learning Efficient Representations for Image-Based Patent Retrieval0
Towards Food Image Retrieval via Generalization-oriented Sampling and Loss Function DesignCode0
Progressive Feature Mining and External Knowledge-Assisted Text-Pedestrian Image Retrieval0
GrowCLIP: Data-aware Automatic Model Growing for Large-scale Contrastive Language-Image Pre-training0
Multi-Grained Attention Network With Mutual Exclusion for Composed Query-Based Image Retrieval0
FashionNTM: Multi-turn Fashion Image Retrieval via Cascaded Memory0
FashionLOGO: Prompting Multimodal Large Language Models for Fashion Logo EmbeddingsCode0
Show:102550
← PrevPage 16 of 45Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1SuperGlobalmAP80.2Unverified
2AMESmAP80Unverified
3Hypergraph propagation+community selectionmAP73Unverified
4TokenmAP66.57Unverified
5DELG+ α QE reranking+ RRT rerankingmAP64Unverified
6FIRemAP61.2Unverified
7HOWmAP56.9Unverified
8ResNet101+ArcFace GLDv2-train-cleanmAP51.6Unverified
9DELF–HQE+SPmAP50.3Unverified
10HesAff–rSIFT–HQE+SPmAP49.7Unverified
#ModelMetricClaimedVerifiedStatus
1AMESmAP90.7Unverified
2Hypergraph propagation+Community selectionmAP88.4Unverified
3TokenmAP82.28Unverified
4FIRemAP81.8Unverified
5DELG+ α QE reranking + RRT rerankingmAP80.4Unverified
6HOWmAP79.4Unverified
7ResNet101+ArcFace GLDv2-train-cleanmAP74.2Unverified
8DELF–HQE+SPmAP73.4Unverified
9HesAff–rSIFT–HQE+SPmAP71.3Unverified
10DELF–ASMK*+SPmAP67.8Unverified
#ModelMetricClaimedVerifiedStatus
1AMESmAP89.7Unverified
2SuperGlobalmAP86.7Unverified
3Hypergraph propagationmAP83.3Unverified
4TokenmAP78.56Unverified
5DELG+ α QE reranking + RRT rerankingmAP77.7Unverified
6ResNet101+ArcFace GLDv2-train-cleanmAP70.3Unverified
7FIRemAP70Unverified
8DELF–HQE+SPmAP69.3Unverified
9HOWmAP62.4Unverified
10R–R-MACmAP59.4Unverified
#ModelMetricClaimedVerifiedStatus
1AMESmAP94.9Unverified
2Hypergraph propagationmAP92.6Unverified
3TokenmAP89.34Unverified
4DELG+ α QE reranking + RRT rerankingmAP88.5Unverified
5FIRemAP85.3Unverified
6ResNet101+ArcFace GLDv2-train-cleanmAP84.9Unverified
7DELF–HQE+SPmAP84Unverified
8HOWmAP81.6Unverified
9R–R-MACmAP78.9Unverified
10R–GeMmAP77.2Unverified
#ModelMetricClaimedVerifiedStatus
1Swin-T (MosaiCLIP, CC-12M)Recall@1 (HN-Atom, UC)44.5Unverified
2RN-50 (MosaiCLIP, CC-12M)Recall@1 (HN-Atom, UC)44.4Unverified
3MosaiCLIP (YFCC-FT)Recall@1 (HN-Atom, UC)41.5Unverified
4RN-50 (NegCLIP, CC-12M)Recall@1 (HN-Atom, UC)41.4Unverified
5MosaiCLIP (CC-FT)Recall@1 (HN-Atom, UC)40.9Unverified
6Swin-T (NegCLIP, CC-12M)Recall@1 (HN-Atom, UC)39.6Unverified
7CLIP (YFCC-FT)Recall@1 (HN-Atom, UC)39.5Unverified
8ViT-L-14 (LAION400M)Recall@1 (HN-Atom + HN-Comp, SC)39.44Unverified
9NegCLIP (YFCC-FT)Recall@1 (HN-Atom, UC)39Unverified
10CLIP-FT (YFCC-FT)Recall@1 (HN-Atom, UC)38.3Unverified
#ModelMetricClaimedVerifiedStatus
1DQU-CIR(Recall@10+Recall@50)/271.77Unverified
2TMCIR(Recall@10+Recall@50)/266.56Unverified
3SPN4CIR (SPRC)(Recall@10+Recall@50)/266.41Unverified
4SPRC(Recall@10+Recall@50)/264.85Unverified
5Candidate Set Re-ranking(Recall@10+Recall@50)/262.15Unverified
6RUTIR (BLIP B/16)(Recall@10+Recall@50)/261.32Unverified
7CASE(Recall@10+Recall@50)/259.73Unverified
8CaLa(Recall@10+Recall@50)/257.96Unverified
9BLIP4CIR+Bi(Recall@10+Recall@50)/255.4Unverified
10CLIP4Cir (v3)(Recall@10+Recall@50)/255.36Unverified
#ModelMetricClaimedVerifiedStatus
1X-VLM (base)R@186.9Unverified
2RCARR@162.6Unverified
3SGRAFR@158.5Unverified
4LGSGMR@157.4Unverified
5VisualSpartaR@157.4Unverified
6TERAN MrSwR@156.5Unverified
7TERAN Symm.R@155.7Unverified
8VSRNR@154.7Unverified
9CAMPR@151.5Unverified
10SCAN i-tR@144Unverified
#ModelMetricClaimedVerifiedStatus
1TMCIR(Recall@5+Recall_subset@1)/283.46Unverified
2SPN4CIR (SPRC)(Recall@5+Recall_subset@1)/282.69Unverified
3SPRC2(Recall@5+Recall_subset@1)/282.66Unverified
4SPRC(Recall@5+Recall_subset@1)/281.39Unverified
5Candidate Set Re-ranking(Recall@5+Recall_subset@1)/280.9Unverified
6CaLa(Recall@5+Recall_subset@1)/278.74Unverified
7CASE (Pre-trained on LaSCo.Ca)(Recall@5+Recall_subset@1)/278.25Unverified
8CASE(Recall@5+Recall_subset@1)/277.5Unverified
9VISTA (base)(Recall@5+Recall_subset@1)/275.9Unverified
10MMRet-MLLM(Recall@5+Recall_subset@1)/275.7Unverified
#ModelMetricClaimedVerifiedStatus
1Unicom+ViT-L@336pxR@191.2Unverified
2ROADMAP (DeiT-B)R@186Unverified
3CGD (SG/GS)R@184.2Unverified
4ROADMAP (ResNet-50)R@183.1Unverified
5ProxyNCA++R@181.4Unverified
6PNP LossR@181.1Unverified
7Cross-Batch MemoryR@180.6Unverified
8Smooth-APR@180.1Unverified
9NormSoftmax2048 (ResNet-50)R@179.5Unverified
10EPSHN512R@178.3Unverified
#ModelMetricClaimedVerifiedStatus
1InternVL-G-FTR@185.9Unverified
2InternVL-C-FTR@185.2Unverified
3CN-CLIP (ViT-L/14@336px)R@184.4Unverified
4R2D2 (ViT-L/14)R@184.4Unverified
5CN-CLIP (ViT-H/14)R@183.8Unverified
6CN-CLIP (ViT-L/14)R@182.7Unverified
7CN-CLIP (ViT-B/16)R@179.1Unverified
8R2D2 (ViT-B)R@178.3Unverified
9Wukong (ViT-L/14)R@177.4Unverified
10Wukong (ViT-B/32)R@167.6Unverified
#ModelMetricClaimedVerifiedStatus
1Offline DiffusionMAP96.2Unverified
2CNN+IME layerMAP92Unverified
3DELF+FT+ATT+DIR+QEMAP90Unverified
4DIR+QE*MAP89Unverified