SOTAVerified

Image Classification

Image Classification is a fundamental task in vision recognition that aims to understand and categorize an image as a whole under a specific label. Unlike object detection, which involves classification and location of multiple objects within an image, image classification typically pertains to single-object images. When the classification becomes highly detailed or reaches instance-level, it is often referred to as image retrieval, which also involves finding similar images in a large database.

Source: Metamorphic Testing for Object Detection Systems

Papers

Showing 80268050 of 10420 papers

TitleStatusHype
Activate and Reject: Towards Safe Domain Generalization under Category Shift0
Rethinking Foundation Models for Medical Image Classification through a Benchmark Study on MedMNIST0
Rethinking Generative Zero-Shot Learning: An Ensemble Learning Perspective for Recognising Visual Patches0
Rethinking Hard-Parameter Sharing in Multi-Domain Learning0
Rethinking Image Editing Detection in the Era of Generative AI Revolution0
Computer Vision Pipeline for Automated Antarctic Krill Analysis0
Few-shot Learning for Domain-specific Fine-grained Image Classification0
Feature Losses for Adversarial Robustness0
A Semantics-Guided Class Imbalance Learning Model for Zero-Shot Classification0
A Multi-stage Transfer Learning Framework for Diabetic Retinopathy Grading on Small Data0
Few-Shot Learning Approach on Tuberculosis Classification Based on Chest X-Ray Images0
Computer vision and machine learning for medical image analysis: recent advances, challenges, and way forward0
Few-shot Image Classification with Multi-Facet Prototypes0
Few-Shot Image Classification via Contrastive Self-Supervised Learning0
Feature Preserving Shrinkage on Bayesian Neural Networks via the R2D2 Prior0
A Self-Supervised Learning Pipeline for Demographically Fair Facial Attribute Classification0
Rethinking Persistent Homology for Visual Recognition0
Rethinking Pseudo Labels for Semi-Supervised Object Detection0
Rethinking Query, Key, and Value Embedding in Vision Transformer under Tiny Model Constraints0
Compute Less to Get More: Using ORC to Improve Sparse Filtering0
Rethinking Semi-Supervised Federated Learning: How to co-train fully-labeled and fully-unlabeled client imaging data0
A Fistful of Words: Learning Transferable Visual Models from Bag-of-Words Supervision0
Few-shot Image Classification based on Gradual Machine Learning0
Rethinking Soft Label in Label Distribution Learning Perspective0
Few-Shot Image Classification and Segmentation as Visual Question Answering Using Vision-Language Models0
Show:102550
← PrevPage 322 of 417Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1CoCa (finetuned)Top 1 Accuracy91Unverified
2Model soups (BASIC-L)Top 1 Accuracy90.98Unverified
3Model soups (ViT-G/14)Top 1 Accuracy90.94Unverified
4DaViT-GTop 1 Accuracy90.4Unverified
5DaViT-HTop 1 Accuracy90.2Unverified
6Meta Pseudo Labels (EfficientNet-L2)Top 1 Accuracy90.2Unverified
7SwinV2-GTop 1 Accuracy90.17Unverified
8MAWS (ViT-6.5B)Top 1 Accuracy90.1Unverified
9Florence-CoSwin-HTop 1 Accuracy90.05Unverified
10Meta Pseudo Labels (EfficientNet-B6-Wide)Top 1 Accuracy90Unverified