SOTAVerified

Image Classification

Image Classification is a fundamental task in vision recognition that aims to understand and categorize an image as a whole under a specific label. Unlike object detection, which involves classification and location of multiple objects within an image, image classification typically pertains to single-object images. When the classification becomes highly detailed or reaches instance-level, it is often referred to as image retrieval, which also involves finding similar images in a large database.

Source: Metamorphic Testing for Object Detection Systems

Papers

Showing 826850 of 10419 papers

TitleStatusHype
PND-Net: Plant Nutrition Deficiency and Disease Classification using Graph Convolutional Network0
Feature Augmentation for Self-supervised Contrastive Learning: A Closer Look0
Efficiera Residual Networks: Hardware-Friendly Fully Binary Weight with 2-bit Activation Model Achieves Practical ImageNet AccuracyCode0
SGLP: A Similarity Guided Fast Layer Partition Pruning for Compressing Large Deep ModelsCode0
Towards Better Multi-head Attention via Channel-wise Sample PermutationCode0
Towards a More Complete Theory of Function Preserving Transforms0
GlobalMamba: Global Image Serialization for Vision MambaCode1
Will the Inclusion of Generated Data Amplify Bias Across Generations in Future Image Classification Models?0
Ensemble of ConvNeXt V2 and MaxViT for Long-Tailed CXR Classification with View-Based AggregationCode0
big.LITTLE Vision Transformer for Efficient Visual Recognition0
SkillAggregation: Reference-free LLM-Dependent Aggregation0
Enhancing JEPAs with Spatial Conditioning: Robust and Efficient Representation Learning0
Provably Reliable Conformal Prediction Sets in the Presence of Data Poisoning0
Robust 3D Point Clouds Classification based on Declarative DefendersCode1
Understanding Robustness of Parameter-Efficient Tuning for Image ClassificationCode0
Deep Transfer Learning: Model Framework and Error Analysis0
Diabetic retinopathy image classification method based on GreenBen data augmentation0
EG-SpikeFormer: Eye-Gaze Guided Transformer on Spiking Neural Networks for Medical Image Analysis0
Cross-Domain Evaluation of Few-Shot Classification Models: Natural Images vs. Histopathological Images0
Accelerated Distributed Stochastic Non-Convex Optimization over Time-Varying Directed Networks0
Efficient Hyperparameter Importance Assessment for CNNs0
Semantic Token Reweighting for Interpretable and Controllable Text Embeddings in CLIP0
DeBiFormer: Vision Transformer with Deformable Agent Bi-level Routing AttentionCode1
Frequency-Temporal Attention Network for Remote Sensing Imagery Change DetectionCode0
Bilinear MLPs enable weight-based mechanistic interpretabilityCode1
Show:102550
← PrevPage 34 of 417Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1CoCa (finetuned)Top 1 Accuracy91Unverified
2Model soups (BASIC-L)Top 1 Accuracy90.98Unverified
3Model soups (ViT-G/14)Top 1 Accuracy90.94Unverified
4DaViT-GTop 1 Accuracy90.4Unverified
5Meta Pseudo Labels (EfficientNet-L2)Top 1 Accuracy90.2Unverified
6DaViT-HTop 1 Accuracy90.2Unverified
7SwinV2-GTop 1 Accuracy90.17Unverified
8MAWS (ViT-6.5B)Top 1 Accuracy90.1Unverified
9Florence-CoSwin-HTop 1 Accuracy90.05Unverified
10RevCol-HTop 1 Accuracy90Unverified