SOTAVerified

Image Classification

Image Classification is a fundamental task in vision recognition that aims to understand and categorize an image as a whole under a specific label. Unlike object detection, which involves classification and location of multiple objects within an image, image classification typically pertains to single-object images. When the classification becomes highly detailed or reaches instance-level, it is often referred to as image retrieval, which also involves finding similar images in a large database.

Source: Metamorphic Testing for Object Detection Systems

Papers

Showing 80018050 of 10420 papers

TitleStatusHype
Enhancing Ship Classification in Optical Satellite Imagery: Integrating Convolutional Block Attention Module with ResNet for Improved Performance0
ResNetX: a more disordered and deeper network architecture0
Resolution-Based Distillation for Efficient Histology Image Classification0
Feature CAM: Interpretable AI in Image Classification0
REFT: Resource-Efficient Federated Training Framework for Heterogeneous and Resource-Constrained Environments0
Resource Efficient Neural Networks Using Hessian Based Pruning0
Comparison of Methods Generalizing Max- and Average-Pooling0
Responsibility: An Example-based Explainable AI approach via Training Process Inspection0
Few-shot Unsupervised Domain Adaptation with Image-to-class Sparse Similarity Encoding0
Activation Atlas0
Restricted Boltzmann Machines for galaxy morphology classification with a quantum annealer0
RE-Tagger: A light-weight Real-Estate Image Classifier0
RetailKLIP : Finetuning OpenCLIP backbone using metric learning on a single GPU for Zero-shot retail product image classification0
Retaining Knowledge and Enhancing Long-Text Representations in CLIP through Dual-Teacher Distillation0
Rethinking Two Consensuses of the Transferability in Deep Learning0
Rethinking and Improving Relative Position Encoding for Vision Transformer0
Retrain or not retrain? -- efficient pruning methods of deep CNN networks0
Few-Shot Non-Parametric Learning with Deep Latent Variable Model0
Few-shot medical image classification with simple shape and texture text descriptors using vision-language models0
Rethinking Client Reweighting for Selfish Federated Learning0
Concept Bottleneck with Visual Concept Filtering for Explainable Medical Image Classification0
Rethinking Crowdsourcing Annotation: Partial Annotation with Salient Labels for Multi-Label Image Classification0
Few-Shot Learning of Compact Models via Task-Specific Meta Distillation0
AF-KAN: Activation Function-Based Kolmogorov-Arnold Networks for Efficient Representation Learning0
Feature Kernel Distillation0
Activate and Reject: Towards Safe Domain Generalization under Category Shift0
Rethinking Foundation Models for Medical Image Classification through a Benchmark Study on MedMNIST0
Rethinking Generative Zero-Shot Learning: An Ensemble Learning Perspective for Recognising Visual Patches0
Rethinking Hard-Parameter Sharing in Multi-Domain Learning0
Rethinking Image Editing Detection in the Era of Generative AI Revolution0
Computer Vision Pipeline for Automated Antarctic Krill Analysis0
Few-shot Learning for Domain-specific Fine-grained Image Classification0
Feature Losses for Adversarial Robustness0
A Semantics-Guided Class Imbalance Learning Model for Zero-Shot Classification0
A Multi-stage Transfer Learning Framework for Diabetic Retinopathy Grading on Small Data0
Few-Shot Learning Approach on Tuberculosis Classification Based on Chest X-Ray Images0
Computer vision and machine learning for medical image analysis: recent advances, challenges, and way forward0
Few-shot Image Classification with Multi-Facet Prototypes0
Few-Shot Image Classification via Contrastive Self-Supervised Learning0
Feature Preserving Shrinkage on Bayesian Neural Networks via the R2D2 Prior0
A Self-Supervised Learning Pipeline for Demographically Fair Facial Attribute Classification0
Rethinking Persistent Homology for Visual Recognition0
Rethinking Pseudo Labels for Semi-Supervised Object Detection0
Rethinking Query, Key, and Value Embedding in Vision Transformer under Tiny Model Constraints0
Compute Less to Get More: Using ORC to Improve Sparse Filtering0
Rethinking Semi-Supervised Federated Learning: How to co-train fully-labeled and fully-unlabeled client imaging data0
A Fistful of Words: Learning Transferable Visual Models from Bag-of-Words Supervision0
Few-shot Image Classification based on Gradual Machine Learning0
Rethinking Soft Label in Label Distribution Learning Perspective0
Few-Shot Image Classification and Segmentation as Visual Question Answering Using Vision-Language Models0
Show:102550
← PrevPage 161 of 209Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1CoCa (finetuned)Top 1 Accuracy91Unverified
2Model soups (BASIC-L)Top 1 Accuracy90.98Unverified
3Model soups (ViT-G/14)Top 1 Accuracy90.94Unverified
4DaViT-GTop 1 Accuracy90.4Unverified
5Meta Pseudo Labels (EfficientNet-L2)Top 1 Accuracy90.2Unverified
6DaViT-HTop 1 Accuracy90.2Unverified
7SwinV2-GTop 1 Accuracy90.17Unverified
8MAWS (ViT-6.5B)Top 1 Accuracy90.1Unverified
9Florence-CoSwin-HTop 1 Accuracy90.05Unverified
10RevCol-HTop 1 Accuracy90Unverified