SOTAVerified

Image Classification

Image Classification is a fundamental task in vision recognition that aims to understand and categorize an image as a whole under a specific label. Unlike object detection, which involves classification and location of multiple objects within an image, image classification typically pertains to single-object images. When the classification becomes highly detailed or reaches instance-level, it is often referred to as image retrieval, which also involves finding similar images in a large database.

Source: Metamorphic Testing for Object Detection Systems

Papers

Showing 57265750 of 10420 papers

TitleStatusHype
SITTA: Single Image Texture Translation for Data AugmentationCode1
PVT v2: Improved Baselines with Pyramid Vision TransformerCode1
VOLO: Vision Outlooker for Visual RecognitionCode1
NN2CAM: Automated Neural Network Mapping for Multi-Precision Edge Processing on FPGA-Based Cameras0
Frequency Domain Convolutional Neural Network: Accelerated CNN for Large Diabetic Retinopathy Image Classification0
Estimating the Robustness of Classification Models by the Structure of the Learned Feature-Space0
Classifying Textual Data with Pre-trained Vision Models through Transfer Learning and Data TransformationsCode0
PatentNet: A Large-Scale Incomplete Multiview, Multimodal, Multilabel Industrial Goods Image Database0
Bayesian Statistics Guided Label Refurbishment Mechanism: Mitigating Label Noise in Medical Image ClassificationCode0
Florida Wildlife Camera Trap Dataset0
P2T: Pyramid Pooling Transformer for Scene UnderstandingCode1
Multi-layered Semantic Representation Network for Multi-label Image ClassificationCode1
Fourier Transform Approximation as an Auxiliary Task for Image ClassificationCode0
Adaptive Learning Rate and Momentum for Training Deep Neural NetworksCode0
Stochastic Polyak Stepsize with a Moving Target0
The Hitchhiker's Guide to Prior-Shift AdaptationCode0
NCIS: Neural Contextual Iterative Smoothing for Purifying Adversarial Perturbations0
Policy Smoothing for Provably Robust Reinforcement Learning0
Stateful ODE-Nets using Basis Function ExpansionsCode1
On fine-tuning of Autoencoders for Fuzzy rule classifiers0
Secure Distributed Training at ScaleCode1
Brain tumor grade classification Using LSTM Neural Networks with Domain Pre-Transforms0
TNT: Text-Conditioned Network with Transductive Inference for Few-Shot Video ClassificationCode0
Segmentation of cell-level anomalies in electroluminescence images of photovoltaic modules0
TokenLearner: What Can 8 Learned Tokens Do for Images and Videos?Code1
Show:102550
← PrevPage 230 of 417Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1CoCa (finetuned)Top 1 Accuracy91Unverified
2Model soups (BASIC-L)Top 1 Accuracy90.98Unverified
3Model soups (ViT-G/14)Top 1 Accuracy90.94Unverified
4DaViT-GTop 1 Accuracy90.4Unverified
5Meta Pseudo Labels (EfficientNet-L2)Top 1 Accuracy90.2Unverified
6DaViT-HTop 1 Accuracy90.2Unverified
7SwinV2-GTop 1 Accuracy90.17Unverified
8MAWS (ViT-6.5B)Top 1 Accuracy90.1Unverified
9Florence-CoSwin-HTop 1 Accuracy90.05Unverified
10RevCol-HTop 1 Accuracy90Unverified