SOTAVerified

Image Classification

Image Classification is a fundamental task in vision recognition that aims to understand and categorize an image as a whole under a specific label. Unlike object detection, which involves classification and location of multiple objects within an image, image classification typically pertains to single-object images. When the classification becomes highly detailed or reaches instance-level, it is often referred to as image retrieval, which also involves finding similar images in a large database.

Source: Metamorphic Testing for Object Detection Systems

Papers

Showing 501550 of 10419 papers

TitleStatusHype
Designing Network Design SpacesCode1
DetCo: Unsupervised Contrastive Learning for Object DetectionCode1
Combining Metric Learning and Attention Heads For Accurate and Efficient Multilabel Image ClassificationCode1
Counterfactual Explanations for Medical Image Classification and Regression using Diffusion AutoencoderCode1
ACPL: Anti-curriculum Pseudo-labelling for Semi-supervised Medical Image ClassificationCode1
Combating Label Noise in Deep Learning Using AbstentionCode1
DGMIL: Distribution Guided Multiple Instance Learning for Whole Slide Image ClassificationCode1
3D Human Pose Estimation with Spatial and Temporal TransformersCode1
DHP: Differentiable Meta Pruning via HyperNetworksCode1
Diagnose Like a Pathologist: Transformer-Enabled Hierarchical Attention-Guided Multiple Instance Learning for Whole Slide Image ClassificationCode1
Combating noisy labels by agreement: A joint training method with co-regularizationCode1
Communication-Efficient and Privacy-Preserving Feature-based Federated Transfer LearningCode1
CODE-CL: Conceptor-Based Gradient Projection for Deep Continual LearningCode1
Co-Correcting: Noise-tolerant Medical Image Classification via mutual Label CorrectionCode1
CoDeNet: Efficient Deployment of Input-Adaptive Object Detection on Embedded FPGAsCode1
AlphaNet: Improved Training of Supernets with Alpha-DivergenceCode1
CrossViT: Cross-Attention Multi-Scale Vision Transformer for Image ClassificationCode1
CoCa: Contrastive Captioners are Image-Text Foundation ModelsCode1
Collaborative Transformers for Grounded Situation RecognitionCode1
Communication-Efficient Federated Learning Based on Explanation-Guided Pruning for Remote Sensing Image ClassificationCode1
CNN Filter DB: An Empirical Investigation of Trained Convolutional FiltersCode1
CMW-Net: Learning a Class-Aware Sample Weighting Mapping for Robust Deep LearningCode1
Co^2L: Contrastive Continual LearningCode1
Layer-adaptive sparsity for the Magnitude-based PruningCode1
ClusterFormer: Clustering As A Universal Visual LearnerCode1
Cluster-to-Conquer: A Framework for End-to-End Multi-Instance Learning for Whole Slide Image ClassificationCode1
Co2L: Contrastive Continual LearningCode1
CLR: Channel-wise Lightweight Reprogramming for Continual LearningCode1
Domain Prompt Learning for Efficiently Adapting CLIP to Unseen DomainsCode1
A deep active learning system for species identification and counting in camera trap imagesCode1
Rethinking Depthwise Separable Convolutions: How Intra-Kernel Correlations Lead to Improved MobileNetsCode1
Clusterability as an Alternative to Anchor Points When Learning with Noisy LabelsCode1
CoAtNet: Marrying Convolution and Attention for All Data SizesCode1
Comparing Kullback-Leibler Divergence and Mean Squared Error Loss in Knowledge DistillationCode1
Curriculum Temperature for Knowledge DistillationCode1
Differentiable Top-k Classification LearningCode1
CLIP meets DINO for Tuning Zero-Shot Classifier using Unlabeled Image CollectionsCode1
CLIP-guided Federated Learning on Heterogeneous and Long-Tailed DataCode1
CLIP4IDC: CLIP for Image Difference CaptioningCode1
Grafting Transformer on Automatically Designed Convolutional Neural Network for Hyperspectral Image ClassificationCode1
CoWs on Pasture: Baselines and Benchmarks for Language-Driven Zero-Shot Object NavigationCode1
Understanding the Role of the Projector in Knowledge DistillationCode1
Addressing Failure Prediction by Learning Model ConfidenceCode1
Clean-Label Backdoor Attacks on Video Recognition ModelsCode1
Addressing Failure Detection by Learning Model ConfidenceCode1
A Closer Look at Self-Supervised Lightweight Vision TransformersCode1
CLCNet: Rethinking of Ensemble Modeling with Classification Confidence NetworkCode1
CleanNet: Transfer Learning for Scalable Image Classifier Training with Label NoiseCode1
Class Distance Weighted Cross-Entropy Loss for Ulcerative Colitis Severity EstimationCode1
Class-Difficulty Based Methods for Long-Tailed Visual RecognitionCode1
Show:102550
← PrevPage 11 of 209Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1CoCa (finetuned)Top 1 Accuracy91Unverified
2Model soups (BASIC-L)Top 1 Accuracy90.98Unverified
3Model soups (ViT-G/14)Top 1 Accuracy90.94Unverified
4DaViT-GTop 1 Accuracy90.4Unverified
5Meta Pseudo Labels (EfficientNet-L2)Top 1 Accuracy90.2Unverified
6DaViT-HTop 1 Accuracy90.2Unverified
7SwinV2-GTop 1 Accuracy90.17Unverified
8MAWS (ViT-6.5B)Top 1 Accuracy90.1Unverified
9Florence-CoSwin-HTop 1 Accuracy90.05Unverified
10RevCol-HTop 1 Accuracy90Unverified