SOTAVerified

Image Classification

Image Classification is a fundamental task in vision recognition that aims to understand and categorize an image as a whole under a specific label. Unlike object detection, which involves classification and location of multiple objects within an image, image classification typically pertains to single-object images. When the classification becomes highly detailed or reaches instance-level, it is often referred to as image retrieval, which also involves finding similar images in a large database.

Source: Metamorphic Testing for Object Detection Systems

Papers

Showing 576600 of 10419 papers

TitleStatusHype
Provable Uncertainty Decomposition via Higher-Order Calibration0
Torque-Aware Momentum0
VisionGRU: A Linear-Complexity RNN Model for Efficient Image AnalysisCode1
Beyond Gradient Averaging in Parallel Optimization: Improved Robustness through Gradient Agreement FilteringCode1
COBRA: COmBinatorial Retrieval Augmentation for Few-Shot Adaptation0
DiffFormer: a Differential Spatial-Spectral Transformer for Hyperspectral Image ClassificationCode0
Predicting the Reliability of an Image Classifier under Image Distortion0
Revisiting MLLMs: An In-Depth Analysis of Image Classification Abilities0
Forget Vectors at Play: Universal Input Perturbations Driving Machine Unlearning in Image ClassificationCode0
UNEM: UNrolled Generalized EM for Transductive Few-Shot LearningCode0
PB-UAP: Hybrid Universal Adversarial Attack For Image Segmentation0
Adversarial Attack Against Images Classification based on Generative Adversarial Networks0
Sensitive Image Classification by Vision Transformers0
V"Mean"ba: Visual State Space Models only need 1 hidden dimension0
LEARN: A Unified Framework for Multi-Task Domain Adapt Few-Shot LearningCode0
FairREAD: Re-fusing Demographic Attributes after Disentanglement for Fair Medical Image Classification0
Towards Interpretable Radiology Report Generation via Concept Bottlenecks using a Multi-Agentic RAGCode1
Continual Learning Using a Kernel-Based Method Over Foundation ModelsCode1
Mamba2D: A Natively Multi-Dimensional State-Space Model for Vision TasksCode1
Till the Layers Collapse: Compressing a Deep Neural Network through the Lenses of Batch Normalization LayersCode0
PSSCL: A progressive sample selection framework with contrastive loss designed for noisy labelsCode0
Modelling Multi-modal Cross-interaction for ML-FSIC Based on Local Feature Selection0
Zero-Shot Prompting and Few-Shot Fine-Tuning: Revisiting Document Image Classification Using Large Language Models0
MBInception: A new Multi-Block Inception Model for Enhancing Image Processing Efficiency0
RemoteTrimmer: Adaptive Structural Pruning for Remote Sensing Image ClassificationCode0
Show:102550
← PrevPage 24 of 417Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1CoCa (finetuned)Top 1 Accuracy91Unverified
2Model soups (BASIC-L)Top 1 Accuracy90.98Unverified
3Model soups (ViT-G/14)Top 1 Accuracy90.94Unverified
4DaViT-GTop 1 Accuracy90.4Unverified
5Meta Pseudo Labels (EfficientNet-L2)Top 1 Accuracy90.2Unverified
6DaViT-HTop 1 Accuracy90.2Unverified
7SwinV2-GTop 1 Accuracy90.17Unverified
8MAWS (ViT-6.5B)Top 1 Accuracy90.1Unverified
9Florence-CoSwin-HTop 1 Accuracy90.05Unverified
10RevCol-HTop 1 Accuracy90Unverified