SOTAVerified

Image Classification

Image Classification is a fundamental task in vision recognition that aims to understand and categorize an image as a whole under a specific label. Unlike object detection, which involves classification and location of multiple objects within an image, image classification typically pertains to single-object images. When the classification becomes highly detailed or reaches instance-level, it is often referred to as image retrieval, which also involves finding similar images in a large database.

Source: Metamorphic Testing for Object Detection Systems

Papers

Showing 80518075 of 10420 papers

TitleStatusHype
Deep Active Learning in the Open World0
Improved Mix-up with KL-Entropy for Learning From Noisy Labels0
Does Deep Active Learning Work in the Wild?0
Improved Image Classification with Token Fusion0
Improved Image Classification with Manifold Neural Networks0
Rethinking the Zigzag Flattening for Image Reading0
Rethinking Two Consensuses of the Transferability in Deep Learning0
Deep Active Ensemble Sampling For Image Classification0
Rethinking VLMs and LLMs for Image Classification0
Automatic Discovery and Optimization of Parts for Image Classification0
Automatic Detection and Image Recognition of Precision Agriculture for Citrus Diseases0
Improved Fine-Tuning by Better Leveraging Pre-Training Data0
Improved Few-Shot Visual Classification0
Retrain or not retrain? -- efficient pruning methods of deep CNN networks0
Decoupling Feature Extraction and Classification Layers for Calibrated Neural Networks0
Retrieval-enriched zero-shot image classification in low-resource domains0
Improved Few-Shot Image Classification Through Multiple-Choice Questions0
Automatic Dataset Construction (ADC): Sample Collection, Data Curation, and Beyond0
Improved EATFormer: A Vision Transformer for Medical Image Classification0
Improved Deep Neural Network Generalization Using m-Sharpness-Aware Minimization0
Reveal of Vision Transformers Robustness against Adversarial Attacks0
Improved Conformer-based End-to-End Speech Recognition Using Neural Architecture Search0
Reversed Active Learning based Atrous DenseNet for Pathological Image Classification0
Reverse engineering adversarial attacks with fingerprints from adversarial examples0
All-You-Can-Fit 8-Bit Flexible Floating-Point Format for Accurate and Memory-Efficient Inference of Deep Neural Networks0
Show:102550
← PrevPage 323 of 417Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1CoCa (finetuned)Top 1 Accuracy91Unverified
2Model soups (BASIC-L)Top 1 Accuracy90.98Unverified
3Model soups (ViT-G/14)Top 1 Accuracy90.94Unverified
4DaViT-GTop 1 Accuracy90.4Unverified
5DaViT-HTop 1 Accuracy90.2Unverified
6Meta Pseudo Labels (EfficientNet-L2)Top 1 Accuracy90.2Unverified
7SwinV2-GTop 1 Accuracy90.17Unverified
8MAWS (ViT-6.5B)Top 1 Accuracy90.1Unverified
9Florence-CoSwin-HTop 1 Accuracy90.05Unverified
10Meta Pseudo Labels (EfficientNet-B6-Wide)Top 1 Accuracy90Unverified