SOTAVerified

Image Classification

Image Classification is a fundamental task in vision recognition that aims to understand and categorize an image as a whole under a specific label. Unlike object detection, which involves classification and location of multiple objects within an image, image classification typically pertains to single-object images. When the classification becomes highly detailed or reaches instance-level, it is often referred to as image retrieval, which also involves finding similar images in a large database.

Source: Metamorphic Testing for Object Detection Systems

Papers

Showing 23512375 of 10420 papers

TitleStatusHype
MoMBS: Mixed-order minibatch sampling enhances model training from diverse-quality images0
Asymmetric Duos: Sidekicks Improve Uncertainty0
COLORA: Efficient Fine-Tuning for Convolutional Models with a Study Case on Optical Coherence Tomography Image Classification0
Ownership Verification of DNN Models Using White-Box Adversarial Attacks with Specified Probability Manipulation0
SemSegBench & DetecBench: Benchmarking Reliability and Generalization Beyond ClassificationCode0
Feature Preserving Shrinkage on Bayesian Neural Networks via the R2D2 Prior0
EVM-Fusion: An Explainable Vision Mamba Architecture with Neural Algorithmic Fusion0
Fusion of Foundation and Vision Transformer Model Features for Dermatoscopic Image Classification0
Swin Transformer for Robust CGI Images Detection: Intra- and Inter-Dataset Analysis across Multiple Color Spaces0
Accelerating Targeted Hard-Label Adversarial Attacks in Low-Query Black-Box SettingsCode0
TULiP: Test-time Uncertainty Estimation via Linearization and Weight Perturbation0
When VLMs Meet Image Classification: Test Sets Renovation via Missing Label Identification0
Parameter-Efficient Fine-Tuning of Multispectral Foundation Models for Hyperspectral Image Classification0
Domain Adaptive Skin Lesion Classification via Conformal Ensemble of Vision Transformers0
FragFake: A Dataset for Fine-Grained Detection of Edited Images with Vision Language ModelsCode0
Beyond Linearity: Squeeze-and-Recalibrate Blocks for Few-Shot Whole Slide Image Classification0
Adaptive Temperature Scaling with Conformal Prediction0
SNAP: A Benchmark for Testing the Effects of Capture Conditions on Fundamental Vision TasksCode0
Aligning Explanations with Human CommunicationCode0
GradPCA: Leveraging NTK Alignment for Reliable Out-of-Distribution Detection0
Scaling Vision Mamba Across Resolutions via Fractal Traversal0
KO: Kinetics-inspired Neural Optimizer with PDE Simulation Approaches0
Large Language Models Implicitly Learn to See and Hear Just By Reading0
Intra-class Patch Swap for Self-DistillationCode0
Enhancing Transformers Through Conditioned Embedded Tokens0
Show:102550
← PrevPage 95 of 417Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1CoCa (finetuned)Top 1 Accuracy91Unverified
2Model soups (BASIC-L)Top 1 Accuracy90.98Unverified
3Model soups (ViT-G/14)Top 1 Accuracy90.94Unverified
4DaViT-GTop 1 Accuracy90.4Unverified
5Meta Pseudo Labels (EfficientNet-L2)Top 1 Accuracy90.2Unverified
6DaViT-HTop 1 Accuracy90.2Unverified
7SwinV2-GTop 1 Accuracy90.17Unverified
8MAWS (ViT-6.5B)Top 1 Accuracy90.1Unverified
9Florence-CoSwin-HTop 1 Accuracy90.05Unverified
10RevCol-HTop 1 Accuracy90Unverified