SOTAVerified

Image Classification

Image Classification is a fundamental task in vision recognition that aims to understand and categorize an image as a whole under a specific label. Unlike object detection, which involves classification and location of multiple objects within an image, image classification typically pertains to single-object images. When the classification becomes highly detailed or reaches instance-level, it is often referred to as image retrieval, which also involves finding similar images in a large database.

Source: Metamorphic Testing for Object Detection Systems

Papers

Showing 57015750 of 10420 papers

TitleStatusHype
Global Filter Networks for Image ClassificationCode1
Focal Self-attention for Local-Global Interactions in Vision TransformersCode1
Temporally Sorting Images from Real-World EventsCode0
Understanding Adversarial Examples Through Deep Neural Network's Response Surface and Uncertainty Regions0
One-class Steel Detector Using Patch GAN Discriminator for Visualising Anomalous Feature Map0
Exploring Robust Architectures for Deep Artificial Neural NetworksCode0
Understanding and Improving Early Stopping for Learning with Noisy LabelsCode1
Exploring Localization for Self-supervised Fine-grained Contrastive Learning0
Cells are Actors: Social Network Analysis with Classical ML for SOTA Histology Image ClassificationCode0
Adaptive Sample Selection for Robust Learning under Label NoiseCode0
Benchmarking Knowledge-driven Zero-shot LearningCode1
Zoo-Tuning: Adaptive Transfer from a Zoo of Models0
Co^2L: Contrastive Continual LearningCode1
R-Drop: Regularized Dropout for Neural NetworksCode1
Hyperspectral Remote Sensing Image Classification Based on Multi-scale Cross Graphic Convolution0
Multi-Compound Transformer for Accurate Biomedical Image SegmentationCode1
Progressive Class-based Expansion Learning For Image Classification0
Deep Ensembling with No Overhead for either Training or Testing: The All-Round Blessings of Dynamic SparsityCode1
RAILS: A Robust Adversarial Immune-inspired Learning SystemCode1
Immuno-mimetic Deep Neural Networks (Immuno-Net)0
Can An Image Classifier Suffice For Action Recognition?Code1
Image Classification with CondenseNeXt for ARM-Based Computing PlatformsCode0
Spectral-Spatial Global Graph Reasoning for Hyperspectral Image ClassificationCode1
Scene Uncertainty and the Wellington Posterior of Deterministic Image Classifiers0
Efficient Document Image Classification Using Region-Based Graph Neural Network0
SITTA: Single Image Texture Translation for Data AugmentationCode1
PVT v2: Improved Baselines with Pyramid Vision TransformerCode1
VOLO: Vision Outlooker for Visual RecognitionCode1
NN2CAM: Automated Neural Network Mapping for Multi-Precision Edge Processing on FPGA-Based Cameras0
Frequency Domain Convolutional Neural Network: Accelerated CNN for Large Diabetic Retinopathy Image Classification0
Estimating the Robustness of Classification Models by the Structure of the Learned Feature-Space0
Classifying Textual Data with Pre-trained Vision Models through Transfer Learning and Data TransformationsCode0
PatentNet: A Large-Scale Incomplete Multiview, Multimodal, Multilabel Industrial Goods Image Database0
Bayesian Statistics Guided Label Refurbishment Mechanism: Mitigating Label Noise in Medical Image ClassificationCode0
Florida Wildlife Camera Trap Dataset0
P2T: Pyramid Pooling Transformer for Scene UnderstandingCode1
Multi-layered Semantic Representation Network for Multi-label Image ClassificationCode1
Fourier Transform Approximation as an Auxiliary Task for Image ClassificationCode0
Adaptive Learning Rate and Momentum for Training Deep Neural NetworksCode0
Stochastic Polyak Stepsize with a Moving Target0
The Hitchhiker's Guide to Prior-Shift AdaptationCode0
NCIS: Neural Contextual Iterative Smoothing for Purifying Adversarial Perturbations0
Policy Smoothing for Provably Robust Reinforcement Learning0
Stateful ODE-Nets using Basis Function ExpansionsCode1
On fine-tuning of Autoencoders for Fuzzy rule classifiers0
Secure Distributed Training at ScaleCode1
Brain tumor grade classification Using LSTM Neural Networks with Domain Pre-Transforms0
TNT: Text-Conditioned Network with Transductive Inference for Few-Shot Video ClassificationCode0
Segmentation of cell-level anomalies in electroluminescence images of photovoltaic modules0
TokenLearner: What Can 8 Learned Tokens Do for Images and Videos?Code1
Show:102550
← PrevPage 115 of 209Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1CoCa (finetuned)Top 1 Accuracy91Unverified
2Model soups (BASIC-L)Top 1 Accuracy90.98Unverified
3Model soups (ViT-G/14)Top 1 Accuracy90.94Unverified
4DaViT-GTop 1 Accuracy90.4Unverified
5Meta Pseudo Labels (EfficientNet-L2)Top 1 Accuracy90.2Unverified
6DaViT-HTop 1 Accuracy90.2Unverified
7SwinV2-GTop 1 Accuracy90.17Unverified
8MAWS (ViT-6.5B)Top 1 Accuracy90.1Unverified
9Florence-CoSwin-HTop 1 Accuracy90.05Unverified
10RevCol-HTop 1 Accuracy90Unverified