SOTAVerified

Image Classification

Image Classification is a fundamental task in vision recognition that aims to understand and categorize an image as a whole under a specific label. Unlike object detection, which involves classification and location of multiple objects within an image, image classification typically pertains to single-object images. When the classification becomes highly detailed or reaches instance-level, it is often referred to as image retrieval, which also involves finding similar images in a large database.

Source: Metamorphic Testing for Object Detection Systems

Papers

Showing 47014750 of 10420 papers

TitleStatusHype
Self Pre-training with Masked Autoencoders for Medical Image Classification and SegmentationCode1
Multiscale Convolutional Transformer with Center Mask Pretraining for Hyperspectral Image Classification0
Renyi Fair Information Bottleneck for Image Classification0
Uni4Eye: Unified 2D and 3D Self-supervised Pre-training via Masked Image Modeling Transformer for Ophthalmic Image Classification0
Active Self-Semi-Supervised Learning for Few Labeled Samples0
ParC-Net: Position Aware Circular Convolution with Merits from ConvNets and TransformerCode2
Dynamic Group Transformer: A General Vision Transformer Backbone with Dynamic Group Attention0
Graph Attention Transformer Network for Multi-Label Image ClassificationCode1
PASS: Part-Aware Self-Supervised Pre-Training for Person Re-IdentificationCode1
Discriminability-Transferability Trade-Off: An Information-Theoretic PerspectiveCode0
Selective-Supervised Contrastive Learning with Noisy LabelsCode1
Art-Attack: Black-Box Adversarial Attack via Evolutionary Art0
Explaining Classifiers by Constructing Familiar ConceptsCode0
Dynamic MLP for Fine-Grained Image Classification by Leveraging Geographical and Temporal InformationCode1
Dynamic ConvNets on Tiny Devices via Nested Sparsity0
WaveMix: Resource-efficient Token Mixing for ImagesCode1
Graph Neural Networks for Image Classification and Reinforcement Learning using Graph representations0
Fidelity of Interpretability Methods and Perturbation Artifacts in Neural Networks0
MetaFormer: A Unified Meta Framework for Fine-Grained RecognitionCode2
Dynamic Backdoors with Global Average Pooling0
FairPrune: Achieving Fairness Through Pruning for Dermatological Disease Diagnosis0
Class-Aware Contrastive Semi-Supervised LearningCode1
DiT: Self-supervised Pre-training for Document Image TransformerCode1
Semi-supervised Learning using Robust LossCode0
Ensembles of Vision Transformers as a New Paradigm for Automated Classification in EcologyCode0
Random Quantum Neural Networks (RQNN) for Noisy Image RecognitionCode1
AdaFamily: A family of Adam-like adaptive gradient methods0
Exploring Hierarchical Graph Representation for Large-Scale Zero-Shot Image ClassificationCode1
MIAShield: Defending Membership Inference Attacks via Preemptive Exclusion of Members0
Aggregated Pyramid Vision Transformer: Split-transform-merge Strategy for Image Recognition without Convolutions0
Continuous-Time Meta-Learning with Forward Mode Differentiation0
ADVISE: ADaptive Feature Relevance and VISual Explanations for Convolutional Neural NetworksCode0
Visual Feature Encoding for GNNs on Road Networks0
Semi-supervised Deep Learning for Image Classification with Distribution Mismatch: A Survey0
Amortized Proximal Optimization0
Evaluating the Adversarial Robustness of Adaptive Test-time DefensesCode1
ESW Edge-Weights : Ensemble Stochastic Watershed Edge-Weights for Hyperspectral Image Classification0
Attribute Descent: Simulating Object-Centric Datasets on the Content Level and BeyondCode1
Synergistic Network Learning and Label Correction for Noise-robust Image Classification0
Neuro-Inspired Deep Neural Networks with Sparse, Strong ActivationsCode0
Relational Surrogate Loss LearningCode1
QOC: Quantum On-Chip Training with Parameter Shift and Gradient PruningCode3
Accelerating Neural Architecture Exploration Across Modalities Using Genetic Algorithms0
TeachAugment: Data Augmentation Optimization Using Teacher KnowledgeCode1
Monogenic Wavelet Scattering Network for Texture Image Classification0
Goal-Oriented Communication for Edge Learning based on the Information Bottleneck0
RRL:Regional Rotation Layer in Convolutional Neural Networks0
SUTD-PRCM Dataset and Neural Architecture Search Approach for Complex Metasurface Design0
New Benchmark for Household Garbage Image Recognition0
Self-Training: A Survey0
Show:102550
← PrevPage 95 of 209Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1CoCa (finetuned)Top 1 Accuracy91Unverified
2Model soups (BASIC-L)Top 1 Accuracy90.98Unverified
3Model soups (ViT-G/14)Top 1 Accuracy90.94Unverified
4DaViT-GTop 1 Accuracy90.4Unverified
5Meta Pseudo Labels (EfficientNet-L2)Top 1 Accuracy90.2Unverified
6DaViT-HTop 1 Accuracy90.2Unverified
7SwinV2-GTop 1 Accuracy90.17Unverified
8MAWS (ViT-6.5B)Top 1 Accuracy90.1Unverified
9Florence-CoSwin-HTop 1 Accuracy90.05Unverified
10RevCol-HTop 1 Accuracy90Unverified