Image Classification
Image Classification is a fundamental task in vision recognition that aims to understand and categorize an image as a whole under a specific label. Unlike object detection, which involves classification and location of multiple objects within an image, image classification typically pertains to single-object images. When the classification becomes highly detailed or reaches instance-level, it is often referred to as image retrieval, which also involves finding similar images in a large database.
Papers
Showing 1–10 of 10419 papers
All datasetsImageNetCIFAR-10CIFAR-100STL-10ObjectNetMNISTSVHNiNaturalist 2018ImageNet ReaLFlowers-102Clothing1Mmini WebVision 1.0
Benchmark Results
| # | Model | Metric | Claimed | Verified | Status |
|---|---|---|---|---|---|
| 1 | OmniVec2 | Top-1 Accuracy | 94.6 | — | Unverified |
| 2 | OmniVec | Top-1 Accuracy | 93.8 | — | Unverified |
| 3 | InternImage-H | Top-1 Accuracy | 92.6 | — | Unverified |
| 4 | MAWS (ViT-2B) | Top-1 Accuracy | 91.3 | — | Unverified |
| 5 | MetaFormer (MetaFormer-2,384,extra_info) | Top-1 Accuracy | 88.7 | — | Unverified |
| 6 | Hiera-H (448px) | Top-1 Accuracy | 87.3 | — | Unverified |
| 7 | MAE (ViT-H, 448) | Top-1 Accuracy | 86.8 | — | Unverified |
| 8 | SWAG (ViT H/14) | Top-1 Accuracy | 86 | — | Unverified |
| 9 | SEER (RegNet10B - finetuned - 384px) | Top-1 Accuracy | 84.7 | — | Unverified |
| 10 | MetaFormer (MetaFormer-2,384) | Top-1 Accuracy | 84.3 | — | Unverified |