SOTAVerified

Image Classification

Image Classification is a fundamental task in vision recognition that aims to understand and categorize an image as a whole under a specific label. Unlike object detection, which involves classification and location of multiple objects within an image, image classification typically pertains to single-object images. When the classification becomes highly detailed or reaches instance-level, it is often referred to as image retrieval, which also involves finding similar images in a large database.

Source: Metamorphic Testing for Object Detection Systems

Papers

Showing 36013650 of 10419 papers

TitleStatusHype
Bi-directional Feature Reconstruction Network for Fine-Grained Few-Shot Image ClassificationCode1
GENNAPE: Towards Generalized Neural Architecture Performance EstimatorsCode0
Nonlinear Advantage: Trained Networks Might Not Be As Complex as You Think0
AIO-P: Expanding Neural Performance Predictors Beyond Image ClassificationCode1
Exploiting Category Names for Few-Shot Classification with Vision-Language Models0
AdvMask: A Sparse Adversarial Attack Based Data Augmentation Method for Image Classification0
SimCS: Simulation for Domain Incremental Online Continual Segmentation0
Impact of Automatic Image Classification and Blind Deconvolution in Improving Text Detection Performance of the CRAFT Algorithm0
Curriculum Temperature for Knowledge DistillationCode1
Entropy-Driven Mixed-Precision Quantization for Deep Network Design0
SgVA-CLIP: Semantic-guided Visual Adapting of Vision-Language Models for Few-shot Image ClassificationCode0
Establishment of Neural Networks Robust to Label Noise0
Class Adaptive Network CalibrationCode1
SI-GAT: A method based on improved Graph Attention Network for sonar image classification0
Explaining Deep Convolutional Neural Networks for Image Classification by Evolving Local Interpretable Model-agnostic Explanations0
Forged Image Detection using SOTA Image Classification Deep Learning Methods for Image Forensics with Error Level Analysis0
Learning to Learn: How to Continuously Teach Humans and Machines0
RankDNN: Learning to Rank for Few-shot LearningCode1
A Call to Reflect on Evaluation Practices for Failure Detection in Image ClassificationCode1
Context-Adaptive Deep Neural Networks via Bridge-Mode Connectivity0
Semantic-Aware Local-Global Vision Transformer0
A Particle-based Sparse Gaussian Process Optimizer0
Looking at the posterior: accuracy and uncertainty of neural-network predictions0
CLIP-ReID: Exploiting Vision-Language Model for Image Re-Identification without Concrete Text LabelsCode2
Cross-Domain Ensemble Distillation for Domain GeneralizationCode1
Differentially Private Image Classification from FeaturesCode0
Relating Regularization and Generalization through the Intrinsic Dimension of Activations0
Self-Supervised Learning based on Heat Equation0
Global Meets Local: Effective Multi-Label Image Classification via Category-Aware Weak Supervision0
EurNet: Efficient Multi-Range Relational Modeling of Spatial Multi-Relational DataCode0
Data Augmentation Vision Transformer for Fine-grained Image Classification0
AugOp: Inject Transformation into Neural Operator0
ActMAD: Activation Matching to Align Distributions for Test-Time-TrainingCode1
Word-Level Representation From Bytes For Language Modeling0
SVFormer: Semi-supervised Video Transformer for Action RecognitionCode1
Don't Watch Me: A Spatio-Temporal Trojan Attack on Deep-Reinforcement-Learning-Augment Autonomous Driving0
Dynamic Loss For Robust LearningCode0
Rethinking Implicit Neural Representations for Vision Learners0
Semantic Guided Level-Category Hybrid Prediction Network for Hierarchical Image Classification0
Towards Human-Interpretable Prototypes for Visual Assessment of Image Classification Models0
Neural Dependencies Emerging from Learning Massive Categories0
Multi-Spectral Image Classification with Ultra-Lean Complex-Valued Models0
Modeling Hierarchical Structural Distance for Unsupervised Domain Adaptation0
Blind Knowledge Distillation for Robust Image ClassificationCode0
Plug and Play Active Learning for Object DetectionCode1
Language in a Bottle: Language Model Guided Concept Bottlenecks for Interpretable Image ClassificationCode1
R2-MLP: Round-Roll MLP for Multi-View 3D Object RecognitionCode0
Learning to Generate Image Embeddings with User-level Differential Privacy0
An Algorithm for Routing Vectors in Sequences0
Frozen Overparameterization: A Double Descent Perspective on Transfer Learning of Deep Neural Networks0
Show:102550
← PrevPage 73 of 209Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1CoCa (finetuned)Top 1 Accuracy91Unverified
2Model soups (BASIC-L)Top 1 Accuracy90.98Unverified
3Model soups (ViT-G/14)Top 1 Accuracy90.94Unverified
4DaViT-GTop 1 Accuracy90.4Unverified
5Meta Pseudo Labels (EfficientNet-L2)Top 1 Accuracy90.2Unverified
6DaViT-HTop 1 Accuracy90.2Unverified
7SwinV2-GTop 1 Accuracy90.17Unverified
8MAWS (ViT-6.5B)Top 1 Accuracy90.1Unverified
9Florence-CoSwin-HTop 1 Accuracy90.05Unverified
10RevCol-HTop 1 Accuracy90Unverified