SOTAVerified

Image Classification

Image Classification is a fundamental task in vision recognition that aims to understand and categorize an image as a whole under a specific label. Unlike object detection, which involves classification and location of multiple objects within an image, image classification typically pertains to single-object images. When the classification becomes highly detailed or reaches instance-level, it is often referred to as image retrieval, which also involves finding similar images in a large database.

Source: Metamorphic Testing for Object Detection Systems

Papers

Showing 26512675 of 10420 papers

TitleStatusHype
FedSoup: Improving Generalization and Personalization in Federated Learning via Selective Model InterpolationCode1
The importance of feature preprocessing for differentially private linear optimization0
Confidence Estimation Using Unlabeled DataCode0
Class Attention to Regions of Lesion for Imbalanced Medical Image Recognition0
Attacking by Aligning: Clean-Label Backdoor Attacks on Object DetectionCode0
What do neural networks learn in image classification? A frequency shortcut perspectiveCode1
Interpreting and Correcting Medical Image Classification with PIP-NetCode1
Semantic-Aware Dual Contrastive Learning for Multi-label Image ClassificationCode1
As large as it gets: Learning infinitely large Filters via Neural Implicit Functions in the Fourier DomainCode0
Linearized Relative Positional EncodingCode0
Promoting Exploration in Memory-Augmented Adam using Critical MomentaCode0
PatchCT: Aligning Patch Set and Label Set with Conditional Transport for Multi-Label Image ClassificationCode0
Systematic comparison of semi-supervised and self-supervised learning for medical image classificationCode1
R-Cut: Enhancing Explainability in Vision Transformers with Relationship Weighted Out and Cut0
Human Action Recognition in Still Images Using ConViT0
Bridging the Gap: Multi-Level Cross-Modality Joint Alignment for Visible-Infrared Person Re-IdentificationCode1
Airway Label Prediction in Video Bronchoscopy: Capturing Temporal Dependencies Utilizing Anatomical Knowledge0
Active Learning for Object Detection with Non-Redundant Informative Sampling0
Multi-Domain Learning with Modulation Adapters0
Diffusion Models Beat GANs on Image ClassificationCode1
M-FLAG: Medical Vision-Language Pre-training with Frozen Language Models and Latent Space Geometry OptimizationCode1
Does Visual Pretraining Help End-to-End Reasoning?0
Fast Adaptation with Bradley-Terry Preference Models in Text-To-Image Classification and Generation0
Spatial-Spectral Hyperspectral Classification based on Learnable 3D Group ConvolutionCode0
Machine learning for option pricing: an empirical investigation of network architectures0
Show:102550
← PrevPage 107 of 417Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1CoCa (finetuned)Top 1 Accuracy91Unverified
2Model soups (BASIC-L)Top 1 Accuracy90.98Unverified
3Model soups (ViT-G/14)Top 1 Accuracy90.94Unverified
4DaViT-GTop 1 Accuracy90.4Unverified
5Meta Pseudo Labels (EfficientNet-L2)Top 1 Accuracy90.2Unverified
6DaViT-HTop 1 Accuracy90.2Unverified
7SwinV2-GTop 1 Accuracy90.17Unverified
8MAWS (ViT-6.5B)Top 1 Accuracy90.1Unverified
9Florence-CoSwin-HTop 1 Accuracy90.05Unverified
10RevCol-HTop 1 Accuracy90Unverified