Image Classification
Image Classification is a fundamental task in vision recognition that aims to understand and categorize an image as a whole under a specific label. Unlike object detection, which involves classification and location of multiple objects within an image, image classification typically pertains to single-object images. When the classification becomes highly detailed or reaches instance-level, it is often referred to as image retrieval, which also involves finding similar images in a large database.
Papers
Showing 1–10 of 10419 papers
All datasetsImageNetCIFAR-10CIFAR-100STL-10ObjectNetMNISTSVHNiNaturalist 2018ImageNet ReaLFlowers-102Clothing1Mmini WebVision 1.0
Benchmark Results
| # | Model | Metric | Claimed | Verified | Status |
|---|---|---|---|---|---|
| 1 | Baseline (ViT-G/14) | Accuracy | 91.78 | — | Unverified |
| 2 | ViTAE-H (MAE, 512) | Accuracy | 91.2 | — | Unverified |
| 3 | Model soups (ViT-G/14) | Accuracy | 91.2 | — | Unverified |
| 4 | Meta Pseudo Labels (EfficientNet-B6-Wide) | Accuracy | 91.12 | — | Unverified |
| 5 | MAWS (ViT-6.5B) | Accuracy | 91.1 | — | Unverified |
| 6 | TokenLearner L/8 (24+11) | Accuracy | 91.05 | — | Unverified |
| 7 | Model soups (BASIC-L) | Accuracy | 91.03 | — | Unverified |
| 8 | Meta Pseudo Labels (EfficientNet-L2) | Accuracy | 91.02 | — | Unverified |
| 9 | FixEfficientNet-L2 | Accuracy | 90.9 | — | Unverified |
| 10 | MAWS (ViT-2B) | Accuracy | 90.9 | — | Unverified |