SOTAVerified

Image Classification

Image Classification is a fundamental task in vision recognition that aims to understand and categorize an image as a whole under a specific label. Unlike object detection, which involves classification and location of multiple objects within an image, image classification typically pertains to single-object images. When the classification becomes highly detailed or reaches instance-level, it is often referred to as image retrieval, which also involves finding similar images in a large database.

Source: Metamorphic Testing for Object Detection Systems

Papers

Showing 876900 of 10419 papers

TitleStatusHype
MECFormer: Multi-task Whole Slide Image Classification with Expert Consultation Network0
Impact of Regularization on Calibration and Robustness: from the Representation Space Perspective0
IT^3: Idempotent Test-Time Training0
A Retention-Centric Framework for Continual Learning with Guaranteed Model Developmental SafetyCode0
Classification-Denoising Networks0
Selective Transformer for Hyperspectral Image Classification0
Rethinking VLMs and LLMs for Image Classification0
On Expert Estimation in Hierarchical Mixture of Experts: Beyond Softmax Gating Functions0
Lie Algebra Canonicalization: Equivariant Neural Operators under arbitrary Lie Groups0
LoGra-Med: Long Context Multi-Graph Alignment for Medical Vision-Language Model0
Hard Negative Sample Mining for Whole Slide Image ClassificationCode0
CTARR: A fast and robust method for identifying anatomical regions on CT images via atlas registrationCode0
BiSSL: Enhancing the Alignment Between Self-Supervised Pretraining and Downstream Fine-Tuning via Bilevel Optimization0
SynCo: Synthetic Hard Negatives in Contrastive Learning for Better Unsupervised Visual RepresentationsCode0
Personalized Quantum Federated Learning for Privacy Image Classification0
MONICA: Benchmarking on Long-tailed Medical Image ClassificationCode1
Kolmogorov-Arnold Network AutoencodersCode0
Local-to-Global Self-Supervised Representation Learning for Diabetic Retinopathy Grading0
Deep Nets with Subsampling Layers Unwittingly Discard Useful Activations at Test-TimeCode0
NECOMIMI: Neural-Cognitive Multimodal EEG-informed Image Generation with Diffusion ModelsCode0
KPCA-CAM: Visual Explainability of Deep Computer Vision Models using Kernel PCACode0
Classroom-Inspired Multi-Mentor Distillation with Adaptive Learning Strategies0
Satellite image classification with neural quantum kernels0
Fine-Tuning Personalization in Federated Learning to Mitigate Adversarial Clients0
SATA: Spatial Autocorrelation Token Analysis for Enhancing the Robustness of Vision TransformersCode0
Show:102550
← PrevPage 36 of 417Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1CoCa (finetuned)Top 1 Accuracy91Unverified
2Model soups (BASIC-L)Top 1 Accuracy90.98Unverified
3Model soups (ViT-G/14)Top 1 Accuracy90.94Unverified
4DaViT-GTop 1 Accuracy90.4Unverified
5Meta Pseudo Labels (EfficientNet-L2)Top 1 Accuracy90.2Unverified
6DaViT-HTop 1 Accuracy90.2Unverified
7SwinV2-GTop 1 Accuracy90.17Unverified
8MAWS (ViT-6.5B)Top 1 Accuracy90.1Unverified
9Florence-CoSwin-HTop 1 Accuracy90.05Unverified
10RevCol-HTop 1 Accuracy90Unverified