SOTAVerified

Image Classification

Image Classification is a fundamental task in vision recognition that aims to understand and categorize an image as a whole under a specific label. Unlike object detection, which involves classification and location of multiple objects within an image, image classification typically pertains to single-object images. When the classification becomes highly detailed or reaches instance-level, it is often referred to as image retrieval, which also involves finding similar images in a large database.

Source: Metamorphic Testing for Object Detection Systems

Papers

Showing 63016325 of 10420 papers

TitleStatusHype
Learning Transferable Visual Models From Natural Language SupervisionCode2
GaNDLF: A Generally Nuanced Deep Learning Framework for Scalable End-to-End Clinical Workflows in Medical ImagingCode1
Class Knowledge Overlay to Visual Feature Learning for Zero-Shot Image Classification0
A Universal Model for Cross Modality Mapping by Relational Reasoning0
Highly Efficient Representation and Active Learning Framework and Its Application to Imbalanced Medical Image Classification0
Web Table Classification based on Visual Features0
Visual Word Embedding for Text ClassificationCode0
Do Input Gradients Highlight Discriminative Features?Code1
Robust Pollen Imagery Classification with Generative Modeling and Mixup Training0
Transfer Learning with Convolutional Neural Networks for Rainfall Detection in Single ImagesCode0
Pyramid Vision Transformer: A Versatile Backbone for Dense Prediction without ConvolutionsCode1
Arguments for the Unsuitability of Convolutional Neural Networks for Non--Local Tasks0
ASAM: Adaptive Sharpness-Aware Minimization for Scale-Invariant Learning of Deep Neural NetworksCode2
Probabilistic Spatial Analysis in Quantitative Microscopy with Uncertainty-Aware Cell Detection using Deep Bayesian Regression of Density Maps0
Classifying high-dimensional Gaussian mixtures: Where kernel methods fail and neural networks succeedCode0
FINE Samples for Learning with Noisy LabelsCode1
Explainers in the Wild: Making Surrogate Explainers Robust to Distortions through Perception0
Conditional Positional Encodings for Vision TransformersCode1
MetaDelta: A Meta-Learning System for Few-shot Image ClassificationCode1
Revisiting Classification Perspective on Scene Text Recognition0
CSIT-Free Model Aggregation for Federated Edge Learning via Reconfigurable Intelligent Surface0
The Uncanny Similarity of Recurrence and DepthCode0
Combining Spiking Neural Network and Artificial Neural Network for Enhanced Image Classification0
A Hierarchical Conditional Random Field-based Attention Mechanism Approach for Gastric Histopathology Image Classification0
Evolving Attention with Residual ConvolutionsCode1
Show:102550
← PrevPage 253 of 417Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1CoCa (finetuned)Top 1 Accuracy91Unverified
2Model soups (BASIC-L)Top 1 Accuracy90.98Unverified
3Model soups (ViT-G/14)Top 1 Accuracy90.94Unverified
4DaViT-GTop 1 Accuracy90.4Unverified
5Meta Pseudo Labels (EfficientNet-L2)Top 1 Accuracy90.2Unverified
6DaViT-HTop 1 Accuracy90.2Unverified
7SwinV2-GTop 1 Accuracy90.17Unverified
8MAWS (ViT-6.5B)Top 1 Accuracy90.1Unverified
9Florence-CoSwin-HTop 1 Accuracy90.05Unverified
10RevCol-HTop 1 Accuracy90Unverified