Image Retrieval
Image Retrieval is a fundamental and long-standing computer vision task that involves finding images similar to a given query from a large database. It is often considered a form of fine-grained, instance-level classification. The task is integral to image recognition alongside classification and cross-modal retrieval. By leveraging visual similarity and other criteria, image retrieval enables users to efficiently discover relevant images, making it a crucial tool in applications such as search and recommendation.
Extending CLIP for Category-to-image Retrieval in E-commerce
( Image credit: DELF )
Papers
Showing 1–10 of 2239 papers
All datasetsROxford (Hard)ROxford (Medium)RParis (Hard)RParis (Medium)CREPE (Compositional REPresentation Evaluation)Fashion IQFlickr30K 1K testCIRRSOPFlickr30k-CNOxf5kFlickr30k
Benchmark Results
| # | Model | Metric | Claimed | Verified | Status |
|---|---|---|---|---|---|
| 1 | BLIP-2 ViT-G (zero-shot, 1K test set) | Recall@10 | 98.9 | — | Unverified |
| 2 | BLIP-2 ViT-L (zero-shot, 1K test set) | Recall@10 | 98.9 | — | Unverified |
| 3 | HADA | Recall@10 | 98.02 | — | Unverified |
| 4 | MaMMUT (ours) | Recall@10 | 98 | — | Unverified |
| 5 | ALBEF | Recall@10 | 97.72 | — | Unverified |
| 6 | UNITER | Recall@10 | 96.76 | — | Unverified |
| 7 | ALBEF | Recall@1 | 92.6 | — | Unverified |
| 8 | LGSGM | Recall@10 | 90.2 | — | Unverified |
| 9 | GSMN | Recall@10 | 89 | — | Unverified |
| 10 | VisualSparta | Recall@10 | 88.1 | — | Unverified |