SOTAVerified

Image Classification

Image Classification is a fundamental task in vision recognition that aims to understand and categorize an image as a whole under a specific label. Unlike object detection, which involves classification and location of multiple objects within an image, image classification typically pertains to single-object images. When the classification becomes highly detailed or reaches instance-level, it is often referred to as image retrieval, which also involves finding similar images in a large database.

Source: Metamorphic Testing for Object Detection Systems

Papers

Showing 63516375 of 10420 papers

TitleStatusHype
Multi-Semantic Image Recognition Model and Evaluating Index for explaining the deep learning models0
Turning old models fashion again: Recycling classical CNN networks using the Lattice Transformation0
Which Design Decisions in AI-enabled Mobile Applications Contribute to Greener AI?0
DeepPSL: End-to-end perception and reasoning0
A Strong Baseline for the VIPriors Data-Efficient Image Classification Challenge0
Federated Deep Learning with Bayesian Privacy0
Predicting Driver Self-Reported Stress by Analyzing the Road Scene0
ST-MAML: A Stochastic-Task based Method for Task-Heterogeneous Meta-Learning0
Training on Test Data with Bayesian Adaptation for Covariate Shift0
Cluster Attack: Query-based Adversarial Attacks on Graphs with Graph-Dependent PriorsCode0
Audio-to-Image Cross-Modal Generation0
Accelerated PDEs for Construction and Theoretical Analysis of an SGD Extension0
Frequency Disentangled Residual Network0
Classification of COVID-19 from CXR Images in a 15-class Scenario: an Attempt to Avoid Bias in the System0
Distribution-sensitive Information Retention for Accurate Binary Neural Network0
From images in the wild to video-informed image classification0
Frequency Pooling: Shift-Equivalent and Anti-Aliasing Downsampling0
A Multi-stage Transfer Learning Framework for Diabetic Retinopathy Grading on Small Data0
FooBaR: Fault Fooling Backdoor Attack on Neural Network TrainingCode0
Partial sensitivity analysis in differential privacyCode0
Multi-Domain Few-Shot Learning and Dataset for Agricultural Applications0
Explaining Convolutional Neural Networks by Tagging Filters0
Audio-Visual Speech Recognition is Worth 32328 Voxels0
GhostShiftAddNet: More Features from Energy-Efficient OperationsCode0
Class incremental learning for video action classification0
Show:102550
← PrevPage 255 of 417Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1CoCa (finetuned)Top 1 Accuracy91Unverified
2Model soups (BASIC-L)Top 1 Accuracy90.98Unverified
3Model soups (ViT-G/14)Top 1 Accuracy90.94Unverified
4DaViT-GTop 1 Accuracy90.4Unverified
5DaViT-HTop 1 Accuracy90.2Unverified
6Meta Pseudo Labels (EfficientNet-L2)Top 1 Accuracy90.2Unverified
7SwinV2-GTop 1 Accuracy90.17Unverified
8MAWS (ViT-6.5B)Top 1 Accuracy90.1Unverified
9Florence-CoSwin-HTop 1 Accuracy90.05Unverified
10Meta Pseudo Labels (EfficientNet-B6-Wide)Top 1 Accuracy90Unverified