SOTAVerified

Image Classification

Image Classification is a fundamental task in vision recognition that aims to understand and categorize an image as a whole under a specific label. Unlike object detection, which involves classification and location of multiple objects within an image, image classification typically pertains to single-object images. When the classification becomes highly detailed or reaches instance-level, it is often referred to as image retrieval, which also involves finding similar images in a large database.

Source: Metamorphic Testing for Object Detection Systems

Papers

Showing 35013550 of 10419 papers

TitleStatusHype
Shape-Aware Fine-Grained Classification of Erythroid CellsCode1
Sparse Mixture Once-for-all Adversarial Training for Efficient In-Situ Trade-Off Between Accuracy and Robustness of DNNs0
Attribute-Guided Multi-Level Attention Network for Fine-Grained Fashion RetrievalCode0
Langevin algorithms for very deep Neural Networks with application to image classificationCode0
Saliency-Augmented Memory Completion for Continual LearningCode0
LMFLOSS: A Hybrid Loss For Imbalanced Medical Image ClassificationCode0
Hyperspherical Loss-Aware Ternary Quantization0
Image Classification with Small Datasets: Overview and BenchmarkCode1
When are Lemons Purple? The Concept Association Bias of Vision-Language Models0
Understanding and Improving the Role of Projection Head in Self-Supervised Learning0
On Calibrating Semantic Segmentation Models: Analyses and An AlgorithmCode1
Reversible Column NetworksCode2
Class Prototype-based Cleaner for Label Noise LearningCode0
Decision-making and control with diffractive optical networksCode0
MaskingDepth: Masked Consistency Regularization for Semi-supervised Monocular Depth EstimationCode1
Temporal Output Discrepancy for Loss Estimation-based Active Learning0
Calibrating Deep Neural Networks using Explicit Regularisation and Dynamic Data Pruning0
Galaxy Image Classification using Hierarchical Data Learning with Weighted Sampling and Label SmoothingCode0
DDIPNet and DDIPNet+: Discriminant Deep Image Prior Networks for Remote Sensing Image Classification0
Unified Framework for Histopathology Image Augmentation and Classification via Generative Models0
Rethinking Label Smoothing on Multi-hop Question AnsweringCode0
Improving Pre-Trained Weights Through Meta-Heuristics Fine-TuningCode0
Style-Hallucinated Dual Consistency Learning: A Unified Framework for Visual Domain GeneralizationCode1
A Framework for Generalizing Critical Heat Flux Detection Models Using Unsupervised Image-to-Image Translation0
Scattering-induced entropy boost for highly-compressed optical sensing and encryption0
Test-time Adaptation in the Dynamic World with Compound Domain Knowledge Management0
From Xception to NEXcepTion: New Design Decisions and Neural Architecture SearchCode0
Instance-dependent Label Distribution Estimation for Learning with Label NoiseCode0
Better May Not Be Fairer: A Study on Subgroup Discrepancy in Image ClassificationCode0
Convolution-enhanced Evolving Attention NetworksCode1
Bayesian posterior approximation with stochastic ensemblesCode0
Backdoor Attack Detection in Computer Vision by Applying Matrix Factorization on the Weights of Deep Networks0
CLIPPO: Image-and-Language Understanding from Pixels Only0
Learning to Detect Semantic Boundaries with Image-level Class Labels0
Efficient Self-supervised Learning with Contextualized Target Representations for Vision, Speech and LanguageCode1
SAIF: Sparse Adversarial and Imperceptible Attack Framework0
Domain Generalization by Learning and Removing Domain-specific FeaturesCode1
Design-time Fashion Popularity Forecasting in VR Environments0
Reproducible scaling laws for contrastive language-image learningCode1
Post-hoc Uncertainty Learning using a Dirichlet Meta-ModelCode1
GPViT: A High Resolution Non-Hierarchical Vision Transformer with Group PropagationCode1
Adversarial Attacks and Defences for Skin Cancer Classification0
CAT: Learning to Collaborate Channel and Spatial Attention from Multi-Information Fusion0
Can a face tell us anything about an NBA prospect? -- A Deep Learning approach0
Losses over Labels: Weakly Supervised Learning via Direct Loss ConstructionCode0
Regularized Optimal Transport Layers for Generalized Global Pooling OperationsCode1
Synthetic Image Data for Deep Learning0
Quantum Phase Recognition using Quantum Tensor Networks0
A Neural ODE Interpretation of Transformer Layers0
General Adversarial Defense Against Black-box Attacks via Pixel Level and Feature Level Distribution Alignments0
Show:102550
← PrevPage 71 of 209Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1CoCa (finetuned)Top 1 Accuracy91Unverified
2Model soups (BASIC-L)Top 1 Accuracy90.98Unverified
3Model soups (ViT-G/14)Top 1 Accuracy90.94Unverified
4DaViT-GTop 1 Accuracy90.4Unverified
5Meta Pseudo Labels (EfficientNet-L2)Top 1 Accuracy90.2Unverified
6DaViT-HTop 1 Accuracy90.2Unverified
7SwinV2-GTop 1 Accuracy90.17Unverified
8MAWS (ViT-6.5B)Top 1 Accuracy90.1Unverified
9Florence-CoSwin-HTop 1 Accuracy90.05Unverified
10RevCol-HTop 1 Accuracy90Unverified