SOTAVerified

Image Classification

Image Classification is a fundamental task in vision recognition that aims to understand and categorize an image as a whole under a specific label. Unlike object detection, which involves classification and location of multiple objects within an image, image classification typically pertains to single-object images. When the classification becomes highly detailed or reaches instance-level, it is often referred to as image retrieval, which also involves finding similar images in a large database.

Source: Metamorphic Testing for Object Detection Systems

Papers

Showing 851900 of 10419 papers

TitleStatusHype
What is Left After Distillation? How Knowledge Transfer Impacts Fairness and Bias0
Time Traveling to Defend Against Adversarial Example Attacks in Image Classification0
When the Small-Loss Trick is Not Enough: Multi-Label Image Classification with Noisy Labels Applied to CCTV Sewer Inspections0
Explainability of Deep Neural Networks for Brain Tumor DetectionCode0
CSA: Data-efficient Mapping of Unimodal Features to Multimodal Features0
More Experts Than Galaxies: Conditionally-overlapping Experts With Biologically-Inspired Fixed RoutingCode0
JPEG Inspired Deep LearningCode0
QuadMamba: Learning Quadtree-based Selective Scan for Visual State Space ModelCode1
Optimizing Estimators of Squared Calibration Errors in Classification0
Convex Distillation: Efficient Compression of Deep Networks via Convex Optimization0
Parameter Efficient Fine-tuning via Explained Variance AdaptationCode1
FACMIC: Federated Adaptative CLIP Model for Medical Image ClassificationCode1
NegMerge: Consensual Weight Negation for Strong Machine UnlearningCode1
A second-order-like optimizer with adaptive gradient scaling for deep learningCode0
Stochastic Kernel Regularisation Improves Generalisation in Deep Kernel MachinesCode0
Contrastive Learning to Fine-Tune Feature Extraction Models for the Visual Cortex0
Core Tokensets for Data-efficient Sequential Training of TransformersCode0
Conformal Structured PredictionCode0
Variable Resolution Pixel Quantization for Low Power Machine Vision Application on Edge0
SELECT: A Large-Scale Benchmark of Data Curation Strategies for Image ClassificationCode1
IGroupSS-Mamba: Interval Group Spatial-Spectral Mamba for Hyperspectral Image Classification0
LoTLIP: Improving Language-Image Pre-training for Long Text Understanding0
Art Forgery Detection using Kolmogorov Arnold and Convolutional Neural Networks0
Control-oriented Clustering of Visual Latent Representation0
Interpret Your Decision: Logical Reasoning Regularization for Generalization in Visual ClassificationCode0
MECFormer: Multi-task Whole Slide Image Classification with Expert Consultation Network0
Impact of Regularization on Calibration and Robustness: from the Representation Space Perspective0
IT^3: Idempotent Test-Time Training0
A Retention-Centric Framework for Continual Learning with Guaranteed Model Developmental SafetyCode0
Classification-Denoising Networks0
Selective Transformer for Hyperspectral Image Classification0
Rethinking VLMs and LLMs for Image Classification0
On Expert Estimation in Hierarchical Mixture of Experts: Beyond Softmax Gating Functions0
Lie Algebra Canonicalization: Equivariant Neural Operators under arbitrary Lie Groups0
LoGra-Med: Long Context Multi-Graph Alignment for Medical Vision-Language Model0
Hard Negative Sample Mining for Whole Slide Image ClassificationCode0
CTARR: A fast and robust method for identifying anatomical regions on CT images via atlas registrationCode0
BiSSL: Enhancing the Alignment Between Self-Supervised Pretraining and Downstream Fine-Tuning via Bilevel Optimization0
SynCo: Synthetic Hard Negatives in Contrastive Learning for Better Unsupervised Visual RepresentationsCode0
Personalized Quantum Federated Learning for Privacy Image Classification0
MONICA: Benchmarking on Long-tailed Medical Image ClassificationCode1
Kolmogorov-Arnold Network AutoencodersCode0
Local-to-Global Self-Supervised Representation Learning for Diabetic Retinopathy Grading0
Deep Nets with Subsampling Layers Unwittingly Discard Useful Activations at Test-TimeCode0
NECOMIMI: Neural-Cognitive Multimodal EEG-informed Image Generation with Diffusion ModelsCode0
KPCA-CAM: Visual Explainability of Deep Computer Vision Models using Kernel PCACode0
Classroom-Inspired Multi-Mentor Distillation with Adaptive Learning Strategies0
Satellite image classification with neural quantum kernels0
Fine-Tuning Personalization in Federated Learning to Mitigate Adversarial Clients0
SATA: Spatial Autocorrelation Token Analysis for Enhancing the Robustness of Vision TransformersCode0
Show:102550
← PrevPage 18 of 209Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1CoCa (finetuned)Top 1 Accuracy91Unverified
2Model soups (BASIC-L)Top 1 Accuracy90.98Unverified
3Model soups (ViT-G/14)Top 1 Accuracy90.94Unverified
4DaViT-GTop 1 Accuracy90.4Unverified
5Meta Pseudo Labels (EfficientNet-L2)Top 1 Accuracy90.2Unverified
6DaViT-HTop 1 Accuracy90.2Unverified
7SwinV2-GTop 1 Accuracy90.17Unverified
8MAWS (ViT-6.5B)Top 1 Accuracy90.1Unverified
9Florence-CoSwin-HTop 1 Accuracy90.05Unverified
10RevCol-HTop 1 Accuracy90Unverified