SOTAVerified

Image Classification

Image Classification is a fundamental task in vision recognition that aims to understand and categorize an image as a whole under a specific label. Unlike object detection, which involves classification and location of multiple objects within an image, image classification typically pertains to single-object images. When the classification becomes highly detailed or reaches instance-level, it is often referred to as image retrieval, which also involves finding similar images in a large database.

Source: Metamorphic Testing for Object Detection Systems

Papers

Showing 46514700 of 10420 papers

TitleStatusHype
Cascaded Cross-Attention Networks for Data-Efficient Whole-Slide Image Classification Using Transformers0
Meta-Learners for Few-Shot Weakly-Supervised Medical Image SegmentationCode0
Medical supervised masked autoencoders: Crafting a better masking strategy and efficient fine-tuning schedule for medical image classificationCode0
Explainable Knowledge Distillation for On-device Chest X-Ray Classification0
A Multi-modal Approach to Single-modal Visual Place Classification0
Alternating Gradient Descent and Mixture-of-Experts for Integrated Multimodal Perception0
Fashion CUT: Unsupervised domain adaptation for visual pattern classification in clothes using synthetic data and pseudo-labels0
Investigating the Corruption Robustness of Image Classifiers with Random Lp-norm CorruptionsCode0
Architectural Vision for Quantum Computing in the Edge-Cloud ContinuumCode0
Semantic Embedded Deep Neural Network: A Generic Approach to Boost Multi-Label Image Classification Performance0
Understanding Gaussian Attention Bias of Vision Transformers Using Effective Receptive FieldsCode0
Creative Discovery using QD SearchCode0
LABO: Towards Learning Optimal Label Regularization via Bi-level Optimization0
Pick your Poison: Undetectability versus Robustness in Data Poisoning Attacks0
Boldness-Recalibration for Binary Event Predictions0
Semantic Segmentation using Vision Transformers: A survey0
Human Attention-Guided Explainable Artificial Intelligence for Computer Vision Models0
Breast Cancer Diagnosis Using Machine Learning Techniques0
Image Captioners Sometimes Tell More Than Images They See0
LatentAugment: Dynamically Optimized Latent Probabilities of Data AugmentationCode0
Forward-Forward Contrastive Learning0
Unsupervised Mutual Transformer Learning for Multi-Gigapixel Whole Slide Image Classification0
On the Impact of Data Quality on Image Classification Fairness0
mAedesID: Android Application for Aedes Mosquito Species Identification using Convolutional Neural Network0
FCA: Taming Long-tailed Federated Medical Image Classification by Classifier AnchoringCode0
TPMIL: Trainable Prototype Enhanced Multiple Instance Learning for Whole Slide Image ClassificationCode0
Detecting Novelties with Empty Classes0
Instruction-ViT: Multi-Modal Prompts for Instruction Learning in ViT0
MMViT: Multiscale Multiview Vision Transformers0
Advancing Ischemic Stroke Diagnosis: A Novel Two-Stage Approach for Blood Clot Origin Identification0
PVP: Pre-trained Visual Parameter-Efficient Tuning0
Tensor Decomposition for Model Reduction in Neural Networks: A Review0
iMixer: hierarchical Hopfield network implies an invertible, implicit and iterative MLP-MixerCode0
Sample-Specific Debiasing for Better Image-Text Models0
Now You See Me: Robust approach to Partial Occlusions0
Evaluating Adversarial Robustness on Document Image Classification0
Graph Convolutional Networks based on Manifold Learning for Semi-Supervised Image Classification0
The Case for Hierarchical Deep Learning Inference at the Network Edge0
Improving Classification Neural Networks by using Absolute activation function (MNIST/LeNET-5 example)Code0
SATIN: A Multi-Task Metadataset for Classifying Satellite Imagery using Vision-Language Models0
Vision Transformer for Efficient Chest X-ray and Gastrointestinal Image Classification0
Exploiting Patch Sizes and Resolutions for Multi-Scale Deep Learning in Mammogram Image Classification0
Picking Up Quantization Steps for Compressed Image ClassificationCode0
DeformableFormer: Classification of Endoscopic Ultrasound Guided Fine Needle Biopsy in Pancreatic Diseases0
Hyperbolic Geometry in Computer Vision: A Survey0
WATT-EffNet: A Lightweight and Accurate Model for Classifying Aerial Disaster ImagesCode0
Graph based Label Enhancement for Multi-instance Multi-label learning0
A baseline on continual learning methods for video action recognition0
Angle based dynamic learning rate for gradient descentCode0
Multi-domain learning CNN model for microscopy image classification0
Show:102550
← PrevPage 94 of 209Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1CoCa (finetuned)Top 1 Accuracy91Unverified
2Model soups (BASIC-L)Top 1 Accuracy90.98Unverified
3Model soups (ViT-G/14)Top 1 Accuracy90.94Unverified
4DaViT-GTop 1 Accuracy90.4Unverified
5Meta Pseudo Labels (EfficientNet-L2)Top 1 Accuracy90.2Unverified
6DaViT-HTop 1 Accuracy90.2Unverified
7SwinV2-GTop 1 Accuracy90.17Unverified
8MAWS (ViT-6.5B)Top 1 Accuracy90.1Unverified
9Florence-CoSwin-HTop 1 Accuracy90.05Unverified
10RevCol-HTop 1 Accuracy90Unverified