SOTAVerified

Image Classification

Image Classification is a fundamental task in vision recognition that aims to understand and categorize an image as a whole under a specific label. Unlike object detection, which involves classification and location of multiple objects within an image, image classification typically pertains to single-object images. When the classification becomes highly detailed or reaches instance-level, it is often referred to as image retrieval, which also involves finding similar images in a large database.

Source: Metamorphic Testing for Object Detection Systems

Papers

Showing 44014425 of 10420 papers

TitleStatusHype
Towards a Visual-Language Foundation Model for Computational Pathology0
Few-shot 1/a Anomalies Feedback : Damage Vision Mining Opportunity and Embedding Feature Imbalance0
Concept-based explainability for an EEG transformer modelCode0
An X3D Neural Network Analysis for Runner's Performance Assessment in a Wild Sporting Environment0
Sparse then Prune: Toward Efficient Vision TransformersCode0
An Intelligent Remote Sensing Image Quality Inspection SystemCode0
Post-variational quantum neural networks0
Quantized Feature Distillation for Network Quantization0
Deep learning for classification of noisy QR codes0
The importance of feature preprocessing for differentially private linear optimization0
Class Attention to Regions of Lesion for Imbalanced Medical Image Recognition0
Attacking by Aligning: Clean-Label Backdoor Attacks on Object DetectionCode0
As large as it gets: Learning infinitely large Filters via Neural Implicit Functions in the Fourier DomainCode0
Confidence Estimation Using Unlabeled DataCode0
R-Cut: Enhancing Explainability in Vision Transformers with Relationship Weighted Out and Cut0
Human Action Recognition in Still Images Using ConViT0
Linearized Relative Positional EncodingCode0
Promoting Exploration in Memory-Augmented Adam using Critical MomentaCode0
PatchCT: Aligning Patch Set and Label Set with Conditional Transport for Multi-Label Image ClassificationCode0
Does Visual Pretraining Help End-to-End Reasoning?0
Multi-Domain Learning with Modulation Adapters0
Airway Label Prediction in Video Bronchoscopy: Capturing Temporal Dependencies Utilizing Anatomical Knowledge0
Active Learning for Object Detection with Non-Redundant Informative Sampling0
Fast Adaptation with Bradley-Terry Preference Models in Text-To-Image Classification and Generation0
Spatial-Spectral Hyperspectral Classification based on Learnable 3D Group ConvolutionCode0
Show:102550
← PrevPage 177 of 417Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1CoCa (finetuned)Top 1 Accuracy91Unverified
2Model soups (BASIC-L)Top 1 Accuracy90.98Unverified
3Model soups (ViT-G/14)Top 1 Accuracy90.94Unverified
4DaViT-GTop 1 Accuracy90.4Unverified
5Meta Pseudo Labels (EfficientNet-L2)Top 1 Accuracy90.2Unverified
6DaViT-HTop 1 Accuracy90.2Unverified
7SwinV2-GTop 1 Accuracy90.17Unverified
8MAWS (ViT-6.5B)Top 1 Accuracy90.1Unverified
9Florence-CoSwin-HTop 1 Accuracy90.05Unverified
10RevCol-HTop 1 Accuracy90Unverified