SOTAVerified

Image Classification

Image Classification is a fundamental task in vision recognition that aims to understand and categorize an image as a whole under a specific label. Unlike object detection, which involves classification and location of multiple objects within an image, image classification typically pertains to single-object images. When the classification becomes highly detailed or reaches instance-level, it is often referred to as image retrieval, which also involves finding similar images in a large database.

Source: Metamorphic Testing for Object Detection Systems

Papers

Showing 33013350 of 10419 papers

TitleStatusHype
Improved Online Conformal Prediction via Strongly Adaptive Online LearningCode1
TFormer: A Transmission-Friendly ViT Model for IoT Devices0
Classification of Lung Pathologies in Neonates using Dual Tree Complex Wavelet Transform0
Learning from Noisy Labels with Decoupled Meta Label PurifierCode1
Learning with Noisy labels via Self-supervised Adversarial Noisy MaskingCode1
Symbolic Discovery of Optimization AlgorithmsCode0
Simple Hardware-Efficient Long Convolutions for Sequence ModelingCode2
Deep Learning and Medical Imaging for COVID-19 Diagnosis: A Comprehensive Survey0
A Comprehensive Study of Modern Architectures and Regularization Approaches on CheXpert50000
Sneaky Spikes: Uncovering Stealthy Backdoor Attacks in Spiking Neural Networks with Neuromorphic DataCode0
A Domain Decomposition-Based CNN-DNN Architecture for Model Parallel Training Applied to Image Recognition Problems0
Semantic Image Segmentation: Two Decades of Research0
Stitchable Neural NetworksCode2
A Unified View of Long-Sequence Models towards Modeling Million-Scale Dependencies0
How to Use Dropout Correctly on Residual Networks with Batch NormalizationCode0
Scientific Computing with Diffractive Optical Neural Networks0
LiT Tuned Models for Efficient Species DetectionCode0
NephroNet: A Novel Program for Identifying Renal Cell Carcinoma and Generating Synthetic Training Images with Convolutional Neural Networks and Diffusion Models0
The LuViRA Dataset: Synchronized Vision, Radio, and Audio Sensors for Indoor LocalizationCode1
Text recognition on images using pre-trained CNN0
Context Understanding in Computer Vision: A Survey0
Scaling Vision Transformers to 22 Billion ParametersCode0
GMConv: Modulating Effective Receptive Fields for Convolutional Kernels0
Reversible Vision TransformersCode1
On Function-Coupled Watermarks for Deep Neural NetworksCode0
Cross-Layer Retrospective Retrieving via Layer AttentionCode1
Effective Data Augmentation With Diffusion ModelsCode2
Understanding Why ViT Trains Badly on Small Datasets: An Intuitive PerspectiveCode2
On the Ideal Number of Groups for Isometric Gradient Propagation0
Class-Incremental Learning: A SurveyCode2
Multipath agents for modular multitask ML systems0
CHiLS: Zero-Shot Image Classification with Hierarchical Label SetsCode1
Rethinking Robust Contrastive Learning from the Adversarial PerspectiveCode0
CECT: Controllable Ensemble CNN and Transformer for COVID-19 Image ClassificationCode0
Revisiting Discriminative vs. Generative Classifiers: Theory and ImplicationsCode1
Knowledge Distillation in Vision Transformers: A Critical Review0
Semantic-Guided Generative Image Augmentation Method with Diffusion Models for Image Classification0
CosPGD: an efficient white-box adversarial attack for pixel-wise prediction tasksCode1
Cluster-CAM: Cluster-Weighted Visual Interpretation of CNNs' Decision in Image Classification0
Example-Based Explainable AI and its Application for Remote Sensing Image Classification0
Learning a Fourier Transform for Linear Relative Positional Encodings in Transformers0
Spiking Synaptic Penalty: Appropriate Penalty Term for Energy-Efficient Spiking Neural Networks0
Revisiting Long-tailed Image Classification: Survey and Benchmarks with New Evaluation Metrics0
SoK: A Systematic Evaluation of Backdoor Trigger Characteristics in Image Classification0
SAAL: Sharpness-Aware Active LearningCode1
Language Quantized AutoEncoders: Towards Unsupervised Text-Image AlignmentCode1
Hyperspectral Image Classification Using Deep Matrix CapsulesCode1
Continual Learning with Scaled Gradient ProjectionCode1
FV-MgNet: Fully Connected V-cycle MgNet for Interpretable Time Series Forecasting0
On Suppressing Range of Adaptive Stepsizes of Adam to Improve Generalisation Performance0
Show:102550
← PrevPage 67 of 209Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1CoCa (finetuned)Top 1 Accuracy91Unverified
2Model soups (BASIC-L)Top 1 Accuracy90.98Unverified
3Model soups (ViT-G/14)Top 1 Accuracy90.94Unverified
4DaViT-GTop 1 Accuracy90.4Unverified
5Meta Pseudo Labels (EfficientNet-L2)Top 1 Accuracy90.2Unverified
6DaViT-HTop 1 Accuracy90.2Unverified
7SwinV2-GTop 1 Accuracy90.17Unverified
8MAWS (ViT-6.5B)Top 1 Accuracy90.1Unverified
9Florence-CoSwin-HTop 1 Accuracy90.05Unverified
10RevCol-HTop 1 Accuracy90Unverified