SOTAVerified

Image Classification

Image Classification is a fundamental task in vision recognition that aims to understand and categorize an image as a whole under a specific label. Unlike object detection, which involves classification and location of multiple objects within an image, image classification typically pertains to single-object images. When the classification becomes highly detailed or reaches instance-level, it is often referred to as image retrieval, which also involves finding similar images in a large database.

Source: Metamorphic Testing for Object Detection Systems

Papers

Showing 101125 of 10419 papers

TitleStatusHype
Asymmetric Duos: Sidekicks Improve Uncertainty0
MoMBS: Mixed-order minibatch sampling enhances model training from diverse-quality images0
Scaling Up Biomedical Vision-Language Models: Fine-Tuning, Instruction Tuning, and Multi-Modal LearningCode4
Feature Preserving Shrinkage on Bayesian Neural Networks via the R2D2 Prior0
Ownership Verification of DNN Models Using White-Box Adversarial Attacks with Specified Probability Manipulation0
SemSegBench & DetecBench: Benchmarking Reliability and Generalization Beyond ClassificationCode0
EVM-Fusion: An Explainable Vision Mamba Architecture with Neural Algorithmic Fusion0
COLORA: Efficient Fine-Tuning for Convolutional Models with a Study Case on Optical Coherence Tomography Image Classification0
Swin Transformer for Robust CGI Images Detection: Intra- and Inter-Dataset Analysis across Multiple Color Spaces0
Fusion of Foundation and Vision Transformer Model Features for Dermatoscopic Image Classification0
When VLMs Meet Image Classification: Test Sets Renovation via Missing Label Identification0
Accelerating Targeted Hard-Label Adversarial Attacks in Low-Query Black-Box SettingsCode0
TULiP: Test-time Uncertainty Estimation via Linearization and Weight Perturbation0
Domain Adaptive Skin Lesion Classification via Conformal Ensemble of Vision Transformers0
GradPCA: Leveraging NTK Alignment for Reliable Out-of-Distribution Detection0
Adaptive Temperature Scaling with Conformal Prediction0
Aligning Explanations with Human CommunicationCode0
Beyond Linearity: Squeeze-and-Recalibrate Blocks for Few-Shot Whole Slide Image Classification0
Parameter-Efficient Fine-Tuning of Multispectral Foundation Models for Hyperspectral Image Classification0
FragFake: A Dataset for Fine-Grained Detection of Edited Images with Vision Language ModelsCode0
SNAP: A Benchmark for Testing the Effects of Capture Conditions on Fundamental Vision TasksCode0
Large Language Models Implicitly Learn to See and Hear Just By Reading0
KO: Kinetics-inspired Neural Optimizer with PDE Simulation Approaches0
Intra-class Patch Swap for Self-DistillationCode0
Domain Adaptation for Multi-label Image Classification: a Discriminator-free ApproachCode1
Show:102550
← PrevPage 5 of 417Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1CoCa (finetuned)Top 1 Accuracy91Unverified
2Model soups (BASIC-L)Top 1 Accuracy90.98Unverified
3Model soups (ViT-G/14)Top 1 Accuracy90.94Unverified
4DaViT-GTop 1 Accuracy90.4Unverified
5Meta Pseudo Labels (EfficientNet-L2)Top 1 Accuracy90.2Unverified
6DaViT-HTop 1 Accuracy90.2Unverified
7SwinV2-GTop 1 Accuracy90.17Unverified
8MAWS (ViT-6.5B)Top 1 Accuracy90.1Unverified
9Florence-CoSwin-HTop 1 Accuracy90.05Unverified
10RevCol-HTop 1 Accuracy90Unverified