SOTAVerified

Image Classification

Image Classification is a fundamental task in vision recognition that aims to understand and categorize an image as a whole under a specific label. Unlike object detection, which involves classification and location of multiple objects within an image, image classification typically pertains to single-object images. When the classification becomes highly detailed or reaches instance-level, it is often referred to as image retrieval, which also involves finding similar images in a large database.

Source: Metamorphic Testing for Object Detection Systems

Papers

Showing 17011750 of 10419 papers

TitleStatusHype
Online Knowledge Distillation via Mutual Contrastive Learning for Visual RecognitionCode1
Online Training Through Time for Spiking Neural NetworksCode1
CDUL: CLIP-Driven Unsupervised Learning for Multi-Label Image ClassificationCode1
Continual Learning Using a Kernel-Based Method Over Foundation ModelsCode1
On the Importance of Firth Bias Reduction in Few-Shot ClassificationCode1
On the Performance Analysis of Momentum Method: A Frequency Domain PerspectiveCode1
CEFHRI: A Communication Efficient Federated Learning Framework for Recognizing Industrial Human-Robot InteractionCode1
CellGAN: Conditional Cervical Cell Synthesis for Augmenting Cytopathological Image ClassificationCode1
Ontology-guided Semantic Composition for Zero-Shot LearningCode1
CellMix: A General Instance Relationship based Method for Data Augmentation Towards Pathology Image ClassificationCode1
Astroformer: More Data Might not be all you need for ClassificationCode1
Active Finetuning: Exploiting Annotation Budget in the Pretraining-Finetuning ParadigmCode1
Open Set Recognition using Vision Transformer with an Additional Detection HeadCode1
Centrality and Consistency: Two-Stage Clean Samples Identification for Learning with Instance-Dependent Noisy LabelsCode1
Open Vocabulary Semantic Segmentation with Patch Aligned Contrastive LearningCode1
Open-World Semi-Supervised LearningCode1
Optimal Representations for Covariate ShiftCode1
Deep convolutional tensor networkCode1
Deep Fast Vision: Accelerated Deep Transfer Learning Vision Prototyping and BeyondCode1
OverFeat: Integrated Recognition, Localization and Detection using Convolutional NetworksCode1
DeepMIM: Deep Supervision for Masked Image ModelingCode1
P2T: Pyramid Pooling Transformer for Scene UnderstandingCode1
A Novel Approach for detecting Normal, COVID-19 and Pneumonia patient using only binary classifications from chest CT-ScansCode1
Cervical Cytology Classification Using PCA & GWO Enhanced Deep Features SelectionCode1
Parameter-efficient Model Adaptation for Vision TransformersCode1
Deep Transferring QuantizationCode1
A Stitch in Time Saves Nine: A Train-Time Regularizing Loss for Improved Neural Network CalibrationCode1
Contrastive Losses Are Natural Criteria for Unsupervised Video SummarizationCode1
Pareto Manifold Learning: Tackling multiple tasks via ensembles of single-task modelsCode1
PASS: Part-Aware Self-Supervised Pre-Training for Person Re-IdentificationCode1
Age Estimation Using Expectation of Label Distribution LearningCode1
Contrasting Intra-Modal and Ranking Cross-Modal Hard Negatives to Enhance Visio-Linguistic Compositional UnderstandingCode1
A Novel Convolutional Neural Network Architecture with a Continuous SymmetryCode1
PatchCleanser: Certifiably Robust Defense against Adversarial Patches for Any Image ClassifierCode1
Deep AutoAugmentCode1
Contrastive Deep SupervisionCode1
Class Distance Weighted Cross-Entropy Loss for Ulcerative Colitis Severity EstimationCode1
PAD-Net: An Efficient Framework for Dynamic NetworksCode1
Pay Attention to MLPsCode1
ChestX-ray8: Hospital-scale Chest X-ray Database and Benchmarks on Weakly-Supervised Classification and Localization of Common Thorax DiseasesCode1
PDiscoFormer: Relaxing Part Discovery Constraints with Vision TransformersCode1
PDO-eConvs: Partial Differential Operator Based Equivariant ConvolutionsCode1
Contrastive Learning of Generalized Game RepresentationsCode1
Peripheral Vision TransformerCode1
CHEX: CHannel EXploration for CNN Model CompressionCode1
CheXFusion: Effective Fusion of Multi-View Features using Transformers for Long-Tailed Chest X-Ray ClassificationCode1
CheXWorld: Exploring Image World Modeling for Radiograph Representation LearningCode1
CHiLS: Zero-Shot Image Classification with Hierarchical Label SetsCode1
LR-Net: A Block-based Convolutional Neural Network for Low-Resolution Image ClassificationCode1
DecoupleNet: A Lightweight Backbone Network With Efficient Feature Decoupling for Remote Sensing Visual TasksCode1
Show:102550
← PrevPage 35 of 209Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1CoCa (finetuned)Top 1 Accuracy91Unverified
2Model soups (BASIC-L)Top 1 Accuracy90.98Unverified
3Model soups (ViT-G/14)Top 1 Accuracy90.94Unverified
4DaViT-GTop 1 Accuracy90.4Unverified
5Meta Pseudo Labels (EfficientNet-L2)Top 1 Accuracy90.2Unverified
6DaViT-HTop 1 Accuracy90.2Unverified
7SwinV2-GTop 1 Accuracy90.17Unverified
8MAWS (ViT-6.5B)Top 1 Accuracy90.1Unverified
9Florence-CoSwin-HTop 1 Accuracy90.05Unverified
10RevCol-HTop 1 Accuracy90Unverified