SOTAVerified

Image Classification

Image Classification is a fundamental task in vision recognition that aims to understand and categorize an image as a whole under a specific label. Unlike object detection, which involves classification and location of multiple objects within an image, image classification typically pertains to single-object images. When the classification becomes highly detailed or reaches instance-level, it is often referred to as image retrieval, which also involves finding similar images in a large database.

Source: Metamorphic Testing for Object Detection Systems

Papers

Showing 55265550 of 10420 papers

TitleStatusHype
Cross-Domain Evaluation of Few-Shot Classification Models: Natural Images vs. Histopathological Images0
Attention Feature Fusion Network via Knowledge Propagation for Automated Respiratory Sound Classification0
Hashing on Nonlinear Manifolds0
Deep reinforcement learning with automated label extraction from clinical reports accurately classifies 3D MRI brain volumes0
Label Consistent Transform Learning for Hyperspectral Image Classification0
Label Denoising through Cross-Model Agreement0
Label-Efficient Group Robustness via Out-of-Distribution Concept Curation0
Label-efficient Single Photon Images Classification via Active Learning0
DeepRepair: Style-Guided Repairing for DNNs in the Real-world Operational Environment0
Attention-free Spikformer: Mixing Spike Sequences with Simple Linear Transforms0
HASeparator: Hyperplane-Assisted Softmax0
Label Embedding with Partial Heterogeneous Contexts0
Label Geometry Aware Discriminator for Conditional Generative Networks0
HASA: Hybrid Architecture Search with Aggregation Strategy for Echinococcosis Classification and Ovary Segmentation in Ultrasound Images0
Cross Domain Ensemble Distillation for Domain Generalization0
Bayesian Optimization for Hyperparameters Tuning in Neural Networks0
Harvesting Mid-level Visual Concepts from Large-Scale Internet Images0
Cross-domain Deep Feature Combination for Bird Species Classification with Audio-visual Data0
Learning transformer-based heterogeneously salient graph representation for multimodal remote sensing image classification0
Learning Wake-Sleep Recurrent Attention Models0
Deep Residual Network based food recognition for enhanced Augmented Reality application0
LABO: Towards Learning Optimal Label Regularization via Bi-level Optimization0
Laconic Deep Learning Computing0
Laconic Image Classification: Human vs. Machine Performance0
Attention Enables Zero Approximation Error0
Show:102550
← PrevPage 222 of 417Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1CoCa (finetuned)Top 1 Accuracy91Unverified
2Model soups (BASIC-L)Top 1 Accuracy90.98Unverified
3Model soups (ViT-G/14)Top 1 Accuracy90.94Unverified
4DaViT-GTop 1 Accuracy90.4Unverified
5Meta Pseudo Labels (EfficientNet-L2)Top 1 Accuracy90.2Unverified
6DaViT-HTop 1 Accuracy90.2Unverified
7SwinV2-GTop 1 Accuracy90.17Unverified
8MAWS (ViT-6.5B)Top 1 Accuracy90.1Unverified
9Florence-CoSwin-HTop 1 Accuracy90.05Unverified
10RevCol-HTop 1 Accuracy90Unverified