SOTAVerified

Image Classification

Image Classification is a fundamental task in vision recognition that aims to understand and categorize an image as a whole under a specific label. Unlike object detection, which involves classification and location of multiple objects within an image, image classification typically pertains to single-object images. When the classification becomes highly detailed or reaches instance-level, it is often referred to as image retrieval, which also involves finding similar images in a large database.

Source: Metamorphic Testing for Object Detection Systems

Papers

Showing 33013325 of 10420 papers

TitleStatusHype
Improved Online Conformal Prediction via Strongly Adaptive Online LearningCode1
TFormer: A Transmission-Friendly ViT Model for IoT Devices0
Classification of Lung Pathologies in Neonates using Dual Tree Complex Wavelet Transform0
Learning from Noisy Labels with Decoupled Meta Label PurifierCode1
Learning with Noisy labels via Self-supervised Adversarial Noisy MaskingCode1
Symbolic Discovery of Optimization AlgorithmsCode0
Deep Learning and Medical Imaging for COVID-19 Diagnosis: A Comprehensive Survey0
Simple Hardware-Efficient Long Convolutions for Sequence ModelingCode2
A Comprehensive Study of Modern Architectures and Regularization Approaches on CheXpert50000
Semantic Image Segmentation: Two Decades of Research0
A Domain Decomposition-Based CNN-DNN Architecture for Model Parallel Training Applied to Image Recognition Problems0
Stitchable Neural NetworksCode2
Sneaky Spikes: Uncovering Stealthy Backdoor Attacks in Spiking Neural Networks with Neuromorphic DataCode0
A Unified View of Long-Sequence Models towards Modeling Million-Scale Dependencies0
How to Use Dropout Correctly on Residual Networks with Batch NormalizationCode0
Scientific Computing with Diffractive Optical Neural Networks0
LiT Tuned Models for Efficient Species DetectionCode0
NephroNet: A Novel Program for Identifying Renal Cell Carcinoma and Generating Synthetic Training Images with Convolutional Neural Networks and Diffusion Models0
The LuViRA Dataset: Synchronized Vision, Radio, and Audio Sensors for Indoor LocalizationCode1
Text recognition on images using pre-trained CNN0
Scaling Vision Transformers to 22 Billion ParametersCode0
Context Understanding in Computer Vision: A Survey0
GMConv: Modulating Effective Receptive Fields for Convolutional Kernels0
Reversible Vision TransformersCode1
On Function-Coupled Watermarks for Deep Neural NetworksCode0
Show:102550
← PrevPage 133 of 417Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1CoCa (finetuned)Top 1 Accuracy91Unverified
2Model soups (BASIC-L)Top 1 Accuracy90.98Unverified
3Model soups (ViT-G/14)Top 1 Accuracy90.94Unverified
4DaViT-GTop 1 Accuracy90.4Unverified
5Meta Pseudo Labels (EfficientNet-L2)Top 1 Accuracy90.2Unverified
6DaViT-HTop 1 Accuracy90.2Unverified
7SwinV2-GTop 1 Accuracy90.17Unverified
8MAWS (ViT-6.5B)Top 1 Accuracy90.1Unverified
9Florence-CoSwin-HTop 1 Accuracy90.05Unverified
10RevCol-HTop 1 Accuracy90Unverified