SOTAVerified

Image Classification

Image Classification is a fundamental task in vision recognition that aims to understand and categorize an image as a whole under a specific label. Unlike object detection, which involves classification and location of multiple objects within an image, image classification typically pertains to single-object images. When the classification becomes highly detailed or reaches instance-level, it is often referred to as image retrieval, which also involves finding similar images in a large database.

Source: Metamorphic Testing for Object Detection Systems

Papers

Showing 401425 of 10419 papers

TitleStatusHype
BitQ: Tailoring Block Floating Point Precision for Improved DNN Efficiency on Resource-Constrained DevicesCode1
HVT: A Comprehensive Vision Framework for Learning in Non-Euclidean SpaceCode1
DecoupleNet: A Lightweight Backbone Network With Efficient Feature Decoupling for Remote Sensing Visual TasksCode1
Finetuning CLIP to Reason about Pairwise DifferencesCode1
SparX: A Sparse Cross-Layer Connection Mechanism for Hierarchical Vision Mamba and Transformer NetworksCode1
Real-world Adversarial Defense against Patch Attacks based on Diffusion ModelCode1
Anytime Continual Learning for Open Vocabulary ClassificationCode1
Enhancing Few-Shot Image Classification through Learnable Multi-Scale Embedding and Attention MechanismsCode1
EntAugment: Entropy-Driven Adaptive Data Augmentation Framework for Image ClassificationCode1
PatchAlign:Fair and Accurate Skin Disease Image Classification by Alignment with Clinical LabelsCode1
LowFormer: Hardware Efficient Design for Convolutional Transformer BackbonesCode1
FC-KAN: Function Combinations in Kolmogorov-Arnold NetworksCode1
Spatial-Aware Conformal Prediction for Trustworthy Hyperspectral Image ClassificationCode1
Stochastic Layer-Wise Shuffle: A Good Practice to Improve Vision Mamba TrainingCode1
Inversion Circle Interpolation: Diffusion-based Image Augmentation for Data-scarce ClassificationCode1
DCT-CryptoNets: Scaling Private Inference in the Frequency DomainCode1
MSFMamba: Multi-Scale Feature Fusion State Space Model for Multi-Source Remote Sensing Image ClassificationCode1
GenFormer -- Generated Images are All You Need to Improve Robustness of Transformers on Small DatasetsCode1
MSCPT: Few-shot Whole Slide Image Classification with Multi-scale and Context-focused Prompt TuningCode1
Approaching Deep Learning through the Spectral Dynamics of WeightsCode1
Efficient Image-to-Image Diffusion Classifier for Adversarial RobustnessCode1
Towards flexible perception with visual memoryCode1
Category-Prompt Refined Feature Learning for Long-Tailed Multi-Label Image ClassificationCode1
Towards Cross-Domain Single Blood Cell Image Classification via Large-Scale LoRA-based Segment Anything ModelCode1
Boosting Memory Efficiency in Transfer Learning for High-Resolution Medical Image ClassificationCode1
Show:102550
← PrevPage 17 of 417Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1CoCa (finetuned)Top 1 Accuracy91Unverified
2Model soups (BASIC-L)Top 1 Accuracy90.98Unverified
3Model soups (ViT-G/14)Top 1 Accuracy90.94Unverified
4DaViT-GTop 1 Accuracy90.4Unverified
5Meta Pseudo Labels (EfficientNet-L2)Top 1 Accuracy90.2Unverified
6DaViT-HTop 1 Accuracy90.2Unverified
7SwinV2-GTop 1 Accuracy90.17Unverified
8MAWS (ViT-6.5B)Top 1 Accuracy90.1Unverified
9Florence-CoSwin-HTop 1 Accuracy90.05Unverified
10RevCol-HTop 1 Accuracy90Unverified