SOTAVerified

Image Classification

Image Classification is a fundamental task in vision recognition that aims to understand and categorize an image as a whole under a specific label. Unlike object detection, which involves classification and location of multiple objects within an image, image classification typically pertains to single-object images. When the classification becomes highly detailed or reaches instance-level, it is often referred to as image retrieval, which also involves finding similar images in a large database.

Source: Metamorphic Testing for Object Detection Systems

Papers

Showing 2650 of 10419 papers

TitleStatusHype
Iterative Quantum Feature Maps0
One Prototype Is Enough: Single-Prototype Activation for Interpretable Image ClassificationCode0
SIM-Net: A Multimodal Fusion Network Using Inferred 3D Object Shape Point Clouds from RGB Images for 2D Classification0
Unfolding the Past: A Comprehensive Deep Learning Approach to Analyzing Incunabula Pages0
DRO-Augment Framework: Robustness by Synergizing Wasserstein Distributionally Robust Optimization and Data Augmentation0
Robust Training with Data Augmentation for Medical Imaging Classification0
Efficient Transformations in Deep Learning Convolutional Neural Networks0
FedWSIDD: Federated Whole Slide Image Classification via Dataset Distillation0
Enhancing One-run Privacy Auditing with Quantile Regression-Based Membership Inference0
J3DAI: A tiny DNN-Based Edge AI Accelerator for 3D-Stacked CMOS Image Sensor0
DDS-NAS: Dynamic Data Selection within Neural Architecture Search via On-line Hard Example Mining applied to Image Classification0
Train Once, Forget Precisely: Anchored Optimization for Efficient Post-Hoc Unlearning0
Compositional Attribute Imbalance in Vision Datasets0
One-Shot Neural Architecture Search with Network Similarity Directed Initialization for Pathological Image Classification0
Finding Optimal Kernel Size and Dimension in Convolutional Neural Networks An Architecture Optimization Approach0
SeqPE: Transformer with Sequential Position EncodingCode1
Evaluating Cell Type Inference in Vision Language Models Under Varying Visual ContextCode0
Intriguing Frequency Interpretation of Adversarial Robustness for CNNs and ViTs0
Cross-architecture universal feature coding via distribution alignment0
OscNet v1.5: Energy Efficient Hopfield Network on CMOS Oscillators for Image ClassificationCode0
Graph Semi-Supervised Learning for Point Classification on Data Manifolds0
MRI-CORE: A Foundation Model for Magnetic Resonance Imaging0
PiPViT: Patch-based Visual Interpretable Prototypes for Retinal Image AnalysisCode0
Boosting Adversarial Transferability for Hyperspectral Image Classification Using 3D Structure-invariant Transformation and Intermediate Feature Distance0
SNR and Resource Adaptive Deep JSCC for Distributed IoT Image Classification0
Show:102550
← PrevPage 2 of 417Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1CoCa (finetuned)Top 1 Accuracy91Unverified
2Model soups (BASIC-L)Top 1 Accuracy90.98Unverified
3Model soups (ViT-G/14)Top 1 Accuracy90.94Unverified
4DaViT-GTop 1 Accuracy90.4Unverified
5DaViT-HTop 1 Accuracy90.2Unverified
6Meta Pseudo Labels (EfficientNet-L2)Top 1 Accuracy90.2Unverified
7SwinV2-GTop 1 Accuracy90.17Unverified
8MAWS (ViT-6.5B)Top 1 Accuracy90.1Unverified
9Florence-CoSwin-HTop 1 Accuracy90.05Unverified
10Meta Pseudo Labels (EfficientNet-B6-Wide)Top 1 Accuracy90Unverified