SOTAVerified

Image Classification

Image Classification is a fundamental task in vision recognition that aims to understand and categorize an image as a whole under a specific label. Unlike object detection, which involves classification and location of multiple objects within an image, image classification typically pertains to single-object images. When the classification becomes highly detailed or reaches instance-level, it is often referred to as image retrieval, which also involves finding similar images in a large database.

Source: Metamorphic Testing for Object Detection Systems

Papers

Showing 33513400 of 10419 papers

TitleStatusHype
FV-MgNet: Fully Connected V-cycle MgNet for Interpretable Time Series Forecasting0
Image-Based Vehicle Classification by Synergizing Features from Supervised and Self-Supervised Learning Paradigms0
Weight Prediction Boosts the Convergence of AdamW0
Structured mutation inspired by evolutionary theory enriches population performance and diversity0
Deep Dependency Networks for Multi-Label Classification0
mPLUG-2: A Modularized Multi-modal Foundation Model Across Text, Image and VideoCode4
NASiam: Efficient Representation Learning using Neural Architecture Search for Siamese NetworksCode0
Training with Mixed-Precision Floating-Point Assignments0
Does Deep Active Learning Work in the Wild?0
NP-Match: Towards a New Probabilistic Model for Semi-Supervised LearningCode1
UPop: Unified and Progressive Pruning for Compressing Vision-Language TransformersCode1
Transfer Learning and Class Decomposition for Detecting the Cognitive Decline of Alzheimer Disease0
Rethinking Soft Label in Label Distribution Learning Perspective0
Inference Time Evidences of Adversarial Attacks for Forensic on Transformers0
Reverse engineering adversarial attacks with fingerprints from adversarial examples0
Massively Scaling Heteroscedastic Classifiers0
Lateralized Learning for Multi-Class Visual Classification Tasks0
DAFD: Domain Adaptation via Feature Disentanglement for Image Classification0
NeSyFOLD: Neurosymbolic Framework for Interpretable Image ClassificationCode0
Equivariant Differentially Private Deep Learning: Why DP-SGD Needs Sparser ModelsCode0
Language-Driven Anchors for Zero-Shot Adversarial RobustnessCode0
Identifying Adversarially Attackable and Robust SamplesCode0
SeaFormer++: Squeeze-enhanced Axial Transformer for Mobile Visual RecognitionCode2
PhaVIP: Phage VIrion Protein classification based on chaos game representation and Vision TransformerCode1
The Influences of Color and Shape Features in Visual Contrastive Learning0
Supervision Complexity and its Role in Knowledge Distillation0
MetaNO: How to Transfer Your Knowledge on Learning Hidden Physics0
Anticipate, Ensemble and Prune: Improving Convolutional Neural Networks via Aggregated Early Exits0
Learning to Unlearn: Instance-wise Unlearning for Pre-trained ClassifiersCode1
CellMix: A General Instance Relationship based Method for Data Augmentation Towards Pathology Image ClassificationCode1
Direct Parameterization of Lipschitz-Bounded Deep NetworksCode1
PECAN: A Deterministic Certified Defense Against Backdoor Attacks0
Trainable Activations for Image ClassificationCode1
ZiCo: Zero-shot NAS via Inverse Coefficient of Variation on GradientsCode1
The Power of Linear Combinations: Learning with Random Convolutions0
Universal Domain Adaptation for Remote Sensing Image Scene ClassificationCode1
Efficient Hyperdimensional ComputingCode0
Neural networks learn to magnify areas near decision boundariesCode0
Discovering and Mitigating Visual Biases through Keyword ExplanationCode1
Explore the Power of Dropout on Few-shot Learning0
Vision-Language Models Performing Zero-Shot Tasks Exhibit Gender-based Disparities0
Navigating the Pitfalls of Active Learning Evaluation: A Systematic Framework for Meaningful Performance AssessmentCode1
Discriminator-free Unsupervised Domain Adaptation for Multi-label Image ClassificationCode1
Self-Supervised Curricular Deep Learning for Chest X-Ray Image Classification0
Connecting metrics for shape-texture knowledge in computer vision0
Progressive Meta-Pooling Learning for Lightweight Image Classification Model0
Lightweight Neural Architecture Search for Temporal Convolutional Networks at the EdgeCode1
When does the student surpass the teacher? Federated Semi-supervised Learning with Teacher-Student EMA0
Local Window Attention Transformer for Polarimetric SAR Image ClassificationCode1
Combined Use of Federated Learning and Image Encryption for Privacy-Preserving Image Classification with Vision Transformer0
Show:102550
← PrevPage 68 of 209Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1CoCa (finetuned)Top 1 Accuracy91Unverified
2Model soups (BASIC-L)Top 1 Accuracy90.98Unverified
3Model soups (ViT-G/14)Top 1 Accuracy90.94Unverified
4DaViT-GTop 1 Accuracy90.4Unverified
5Meta Pseudo Labels (EfficientNet-L2)Top 1 Accuracy90.2Unverified
6DaViT-HTop 1 Accuracy90.2Unverified
7SwinV2-GTop 1 Accuracy90.17Unverified
8MAWS (ViT-6.5B)Top 1 Accuracy90.1Unverified
9Florence-CoSwin-HTop 1 Accuracy90.05Unverified
10RevCol-HTop 1 Accuracy90Unverified