SOTAVerified

Image Classification

Image Classification is a fundamental task in vision recognition that aims to understand and categorize an image as a whole under a specific label. Unlike object detection, which involves classification and location of multiple objects within an image, image classification typically pertains to single-object images. When the classification becomes highly detailed or reaches instance-level, it is often referred to as image retrieval, which also involves finding similar images in a large database.

Source: Metamorphic Testing for Object Detection Systems

Papers

Showing 13511400 of 10419 papers

TitleStatusHype
A survey on attention mechanisms for medical applications: are we moving towards better algorithms?Code1
Adaptive Mask Sampling and Manifold to Euclidean Subspace Learning with Distance Covariance Representation for Hyperspectral Image ClassificationCode1
DAF:re: A Challenging, Crowd-Sourced, Large-Scale, Long-Tailed Dataset For Anime Character RecognitionCode1
CvT: Introducing Convolutions to Vision TransformersCode1
Bridging the Gap between Spatial and Spectral Domains: A Unified Framework for Graph Neural NetworksCode1
CutMix: Regularization Strategy to Train Strong Classifiers with Localizable FeaturesCode1
CycleMLP: A MLP-like Architecture for Dense PredictionCode1
batchboost: regularization for stabilizing training with resistance to underfitting & overfittingCode1
Curriculum By SmoothingCode1
Curriculum Labeling: Revisiting Pseudo-Labeling for Semi-Supervised LearningCode1
FNA++: Fast Network Adaptation via Parameter Remapping and Architecture SearchCode1
Focal and Global Knowledge Distillation for DetectorsCode1
BSNet: Bi-Similarity Network for Few-shot Fine-grained Image ClassificationCode1
Batch Normalization: Accelerating Deep Network Training by Reducing Internal Covariate ShiftCode1
FOCUS: Knowledge-enhanced Adaptive Visual Compression for Few-shot Whole Slide Image ClassificationCode1
Focus Longer to See Better:Recursively Refined Attention for Fine-Grained Image ClassificationCode1
Forward Learning of Graph Neural NetworksCode1
Foundation Model Assisted Weakly Supervised Semantic SegmentationCode1
Fourier Image TransformerCode1
FPGA: Fast Patch-Free Global Learning Framework for Fully End-to-End Hyperspectral Image ClassificationCode1
FrankenSplit: Efficient Neural Feature Compression with Shallow Variational Bottleneck Injection for Mobile Edge ComputingCode1
Bayesian continual learning and forgetting in neural networksCode1
Deep Ensembling with No Overhead for either Training or Testing: The All-Round Blessings of Dynamic SparsityCode1
Frequency Attention for Knowledge DistillationCode1
CSP: Self-Supervised Contrastive Spatial Pre-Training for Geospatial-Visual RepresentationsCode1
No Routing Needed Between CapsulesCode1
CSWin Transformer: A General Vision Transformer Backbone with Cross-Shaped WindowsCode1
Function-Consistent Feature DistillationCode1
Curriculum Temperature for Knowledge DistillationCode1
Bayesian Model-Agnostic Meta-LearningCode1
Bayesian Neural Network Priors RevisitedCode1
GAN-based Priors for Quantifying UncertaintyCode1
Achieving Fairness Through Channel Pruning for Dermatological Disease DiagnosisCode1
Gaussian RAM: Lightweight Image Classification via Stochastic Retina-Inspired Glimpse and Reinforcement LearningCode1
Bayesian Optimization Meets Self-DistillationCode1
General E(2)-Equivariant Steerable CNNsCode1
A Survey of Classical And Quantum Sequence ModelsCode1
Generalized Jensen-Shannon Divergence Loss for Learning with Noisy LabelsCode1
Cross-modal Adversarial ReprogrammingCode1
General Multi-label Image Classification with TransformersCode1
Active Learning for Convolutional Neural Networks: A Core-Set ApproachCode1
Cross-modulated Few-shot Image Generation for Colorectal Tissue ClassificationCode1
Fcaformer: Forward Cross Attention in Hybrid Vision TransformerCode1
Generative Hierarchical Features from Synthesizing ImagesCode1
A Survey: Deep Learning for Hyperspectral Image Classification with Few Labeled SamplesCode1
Cross-Iteration Batch NormalizationCode1
Generic Neural Architecture Search via RegressionCode1
Generic-to-Specific Distillation of Masked AutoencodersCode1
BEV-LGKD: A Unified LiDAR-Guided Knowledge Distillation Framework for BEV 3D Object DetectionCode1
AutoAssist: A Framework to Accelerate Training of Deep Neural NetworksCode1
Show:102550
← PrevPage 28 of 209Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1CoCa (finetuned)Top 1 Accuracy91Unverified
2Model soups (BASIC-L)Top 1 Accuracy90.98Unverified
3Model soups (ViT-G/14)Top 1 Accuracy90.94Unverified
4DaViT-GTop 1 Accuracy90.4Unverified
5Meta Pseudo Labels (EfficientNet-L2)Top 1 Accuracy90.2Unverified
6DaViT-HTop 1 Accuracy90.2Unverified
7SwinV2-GTop 1 Accuracy90.17Unverified
8MAWS (ViT-6.5B)Top 1 Accuracy90.1Unverified
9Florence-CoSwin-HTop 1 Accuracy90.05Unverified
10RevCol-HTop 1 Accuracy90Unverified