SOTAVerified

Image Classification

Image Classification is a fundamental task in vision recognition that aims to understand and categorize an image as a whole under a specific label. Unlike object detection, which involves classification and location of multiple objects within an image, image classification typically pertains to single-object images. When the classification becomes highly detailed or reaches instance-level, it is often referred to as image retrieval, which also involves finding similar images in a large database.

Source: Metamorphic Testing for Object Detection Systems

Papers

Showing 14261450 of 10420 papers

TitleStatusHype
CycleMLP: A MLP-like Architecture for Dense PredictionCode1
Parametric Scattering NetworksCode1
Self-Supervised Aggregation of Diverse Experts for Test-Agnostic Long-Tailed RecognitionCode1
Just Train Twice: Improving Group Robustness without Training Group InformationCode1
OODformer: Out-Of-Distribution Detection TransformerCode1
Non-binary deep transfer learning for image classificationCode1
Rectifying the Shortcut Learning of Background for Few-Shot LearningCode1
Shifts: A Dataset of Real Distributional Shift Across Multiple Large-Scale TasksCode1
A Fuzzy Rank-based Ensemble of CNN Models for Classification of Cervical CytologyCode1
Training Compact CNNs for Image Classification using Dynamic-coded Filter FusionCode1
Visual Parser: Representing Part-whole Hierarchies with TransformersCode1
Automated Learning Rate Scheduler for Large-batch TrainingCode1
Semi-Supervised Learning with Multi-Head Co-TrainingCode1
GLiT: Neural Architecture Search for Global and Local Image TransformerCode1
SpectralFormer: Rethinking Hyperspectral Image Classification with TransformersCode1
Categorical Relation-Preserving Contrastive Knowledge Distillation for Medical Image ClassificationCode1
Vision Xformers: Efficient Attention for Image ClassificationCode1
Learning Debiased Representation via Disentangled Feature AugmentationCode1
Hybrid Supervision Learning for Pathology Whole Slide Image ClassificationCode1
On Bridging Generic and Personalized Federated Learning for Image ClassificationCode1
CSWin Transformer: A General Vision Transformer Backbone with Cross-Shaped WindowsCode1
SIMILAR: Submodular Information Measures Based Active Learning In Realistic ScenariosCode1
Global Filter Networks for Image ClassificationCode1
Focal Self-attention for Local-Global Interactions in Vision TransformersCode1
Understanding and Improving Early Stopping for Learning with Noisy LabelsCode1
Show:102550
← PrevPage 58 of 417Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1CoCa (finetuned)Top 1 Accuracy91Unverified
2Model soups (BASIC-L)Top 1 Accuracy90.98Unverified
3Model soups (ViT-G/14)Top 1 Accuracy90.94Unverified
4DaViT-GTop 1 Accuracy90.4Unverified
5Meta Pseudo Labels (EfficientNet-L2)Top 1 Accuracy90.2Unverified
6DaViT-HTop 1 Accuracy90.2Unverified
7SwinV2-GTop 1 Accuracy90.17Unverified
8MAWS (ViT-6.5B)Top 1 Accuracy90.1Unverified
9Florence-CoSwin-HTop 1 Accuracy90.05Unverified
10RevCol-HTop 1 Accuracy90Unverified