SOTAVerified

Image Classification

Image Classification is a fundamental task in vision recognition that aims to understand and categorize an image as a whole under a specific label. Unlike object detection, which involves classification and location of multiple objects within an image, image classification typically pertains to single-object images. When the classification becomes highly detailed or reaches instance-level, it is often referred to as image retrieval, which also involves finding similar images in a large database.

Source: Metamorphic Testing for Object Detection Systems

Papers

Showing 27012750 of 10419 papers

TitleStatusHype
Invariant Scattering Transform for Medical Imaging0
How to use model architecture and training environment to estimate the energy consumption of DL trainingCode0
Distilling Self-Supervised Vision Transformers for Weakly-Supervised Few-Shot Classification & Segmentation0
Learning Curves for Noisy Heterogeneous Feature-Subsampled Ridge EnsemblesCode0
A Novel Site-Agnostic Multimodal Deep Learning Model to Identify Pro-Eating Disorder Content on Social Media0
Improving the Efficiency of Human-in-the-Loop Systems: Adding Artificial to Human ExpertsCode0
Art Authentication with Vision Transformers0
Multi-Similarity Contrastive Learning0
Revisiting Computer-Aided Tuberculosis DiagnosisCode1
Distilling Large Vision-Language Model with Out-of-Distribution GeneralizabilityCode1
Benchmarking Test-Time Adaptation against Distribution Shifts in Image ClassificationCode1
The Role of Subgroup Separability in Group-Fair Medical Image ClassificationCode0
Multi-Scale U-Shape MLP for Hyperspectral Image Classification0
UX Heuristics and Checklist for Deep Learning powered Mobile Applications with Image Classification0
Adversarial Attacks on Image Classification Models: FGSM and Patch Attacks and their Impact0
Rethinking Multiple Instance Learning for Whole Slide Image Classification: A Good Instance Classifier is All You NeedCode1
A ChatGPT Aided Explainable Framework for Zero-Shot Medical Image Diagnosis0
Multi-Scale Prototypical Transformer for Whole Slide Image Classification0
Make A Long Image Short: Adaptive Token Length for Vision Transformers0
A Neural Collapse Perspective on Feature Evolution in Graph Neural NetworksCode0
Continual Learning in Open-vocabulary Classification with Complementary Memory SystemsCode0
In-Domain Self-Supervised Learning Improves Remote Sensing Image Scene Classification0
Mitigating Bias: Enhancing Image Classification by Improving Model Explanations0
Why do CNNs excel at feature extraction? A mathematical explanation0
Structured Network Pruning by Measuring Filter-wise Interactions0
Review helps learn better: Temporal Supervised Knowledge Distillation0
FedDefender: Backdoor Attack Defense in Federated LearningCode1
The Forward-Forward Algorithm as a feature extractor for skin lesion classification: A preliminary study0
Query-Efficient Decision-based Black-Box Patch Attack0
SysNoise: Exploring and Benchmarking Training-Deployment System Inconsistency0
Forward-Forward Algorithm for Hyperspectral Image Classification: A Preliminary Study0
MobileViG: Graph-Based Sparse Attention for Mobile Vision ApplicationsCode1
Single-Stage Heavy-Tailed Food Classification0
Filter Pruning for Efficient CNNs via Knowledge-driven Differential Filter SamplerCode0
More for Less: Compact Convolutional Transformers Enable Robust Medical Image Classification with Limited Data0
Sphere2Vec: A General-Purpose Location Representation Learning over a Spherical Surface for Large-Scale Geospatial PredictionsCode1
Vision Through the Veil: Differential Privacy in Federated Learning for Medical Image Classification0
Designing Stable Neural Networks using Convex Analysis and ODEsCode0
Post-train Black-box Defense via Bayesian Boundary Correction0
CLIPAG: Towards Generator-Free Text-to-Image Generation0
Boosting the Generalization Ability for Hyperspectral Image Classification using Spectral-spatial Axial Aggregation Transformer0
PCDAL: A Perturbation Consistency-Driven Active Learning Approach for Medical Image Segmentation and ClassificationCode0
BinaryViT: Pushing Binary Vision Transformers Towards Convolutional ModelsCode1
Cross-Inferential Networks for Source-free Unsupervised Domain Adaptation0
A Structurally Regularized CNN Architecture via Adaptive Subband Decomposition0
Rapid-INR: Storage Efficient CPU-free DNN Training Using Implicit Neural RepresentationCode0
Does Saliency-Based Training bring Robustness for Deep Neural Networks in Image Classification?0
Pseudo-Bag Mixup Augmentation for Multiple Instance Learning-Based Whole Slide Image Classification0
Approximated Prompt Tuning for Vision-Language Pre-trained Models0
Predictive Coding beyond Correlations0
Show:102550
← PrevPage 55 of 209Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1CoCa (finetuned)Top 1 Accuracy91Unverified
2Model soups (BASIC-L)Top 1 Accuracy90.98Unverified
3Model soups (ViT-G/14)Top 1 Accuracy90.94Unverified
4DaViT-GTop 1 Accuracy90.4Unverified
5Meta Pseudo Labels (EfficientNet-L2)Top 1 Accuracy90.2Unverified
6DaViT-HTop 1 Accuracy90.2Unverified
7SwinV2-GTop 1 Accuracy90.17Unverified
8MAWS (ViT-6.5B)Top 1 Accuracy90.1Unverified
9Florence-CoSwin-HTop 1 Accuracy90.05Unverified
10RevCol-HTop 1 Accuracy90Unverified