SOTAVerified

Image Classification

Image Classification is a fundamental task in vision recognition that aims to understand and categorize an image as a whole under a specific label. Unlike object detection, which involves classification and location of multiple objects within an image, image classification typically pertains to single-object images. When the classification becomes highly detailed or reaches instance-level, it is often referred to as image retrieval, which also involves finding similar images in a large database.

Source: Metamorphic Testing for Object Detection Systems

Papers

Showing 14511500 of 10419 papers

TitleStatusHype
Deep Fast Vision: A Python Library for Accelerated Deep Transfer Learning Vision PrototypingCode1
Failure Detection in Medical Image Classification: A Reality Check and Benchmarking TestbedCode1
Learning to Combine Top-Down and Bottom-Up Signals in Recurrent Neural Networks with Attention over ModulesCode1
Learning to Generalize: Meta-Learning for Domain GeneralizationCode1
Capsules with Inverted Dot-Product Attention RoutingCode1
Learning to Unlearn: Instance-wise Unlearning for Pre-trained ClassifiersCode1
Deep Hyperspectral Unmixing using Transformer NetworkCode1
A General Regret Bound of Preconditioned Gradient Method for DNN TrainingCode1
AdaScale SGD: A User-Friendly Algorithm for Distributed TrainingCode1
Learning Visual Representations for Transfer Learning by Suppressing TextureCode1
An Analysis on Ensemble Learning optimized Medical Image Classification with Deep Convolutional Neural NetworksCode1
Learning with Noisy Labels by Efficient Transition Matrix Estimation to Combat Label MiscorrectionCode1
Revisiting the Importance of Amplifying Bias for DebiasingCode1
Bias Loss for Mobile Neural NetworksCode1
LEMUR Neural Network Dataset: Towards Seamless AutoMLCode1
BiasPruner: Debiased Continual Learning for Medical Image ClassificationCode1
Bidirectional-Convolutional LSTM Based Spectral-Spatial Feature Learning for Hyperspectral Image ClassificationCode1
Bi-directional Feature Reconstruction Network for Fine-Grained Few-Shot Image ClassificationCode1
Bi-directional Weakly Supervised Knowledge Distillation for Whole Slide Image ClassificationCode1
FACMIC: Federated Adaptative CLIP Model for Medical Image ClassificationCode1
The MAMe Dataset: On the relevance of High Resolution and Variable Shape image propertiesCode1
Deeply Coupled Cross-Modal Prompt LearningCode1
Object Segmentation Without Labels with Large-Scale Generative ModelsCode1
Deep Subdomain Adaptation Network for Image ClassificationCode1
Big Self-Supervised Models Advance Medical Image ClassificationCode1
Fair Contrastive Learning for Facial Attribute ClassificationCode1
Bilinear MLPs enable weight-based mechanistic interpretabilityCode1
LightViT: Towards Light-Weight Convolution-Free Vision TransformersCode1
Can Biases in ImageNet Models Explain Generalization?Code1
AdaViT: Adaptive Tokens for Efficient Vision TransformerCode1
Linear Attention with Global Context: A Multipole Attention Mechanism for Vision and PhysicsCode1
LiteGPT: Large Vision-Language Model for Joint Chest X-ray Localization and Classification TaskCode1
Can Language Understand Depth?Code1
BinaryViT: Pushing Binary Vision Transformers Towards Convolutional ModelsCode1
Deep Multimodal Guidance for Medical Image ClassificationCode1
LLMs as Visual Explainers: Advancing Image Classification with Evolving Visual DescriptionsCode1
Extremely Lightweight Quantization Robust Real-Time Single-Image Super Resolution for Mobile DevicesCode1
LocalViT: Bringing Locality to Vision TransformersCode1
Deep Networks with Stochastic DepthCode1
Long-Tailed Classification by Keeping the Good and Removing the Bad Momentum Causal EffectCode1
DeepNoise: Signal and Noise Disentanglement based on Classifying Fluorescent Microscopy Images via Deep LearningCode1
CAMIL: Context-Aware Multiple Instance Learning for Cancer Detection and Subtyping in Whole Slide ImagesCode1
CamDiff: Camouflage Image Augmentation via Diffusion ModelCode1
BionoiNet: ligand-binding site classification with off-the-shelf deep neural networkCode1
High-parallelism Inception-like Spiking Neural Networks for Unsupervised Feature LearningCode1
Deep Polynomial Neural NetworksCode1
Extending CAM-based XAI methods for Remote Sensing Imagery SegmentationCode1
LPT: Long-tailed Prompt Tuning for Image ClassificationCode1
Eye-gaze Guided Multi-modal Alignment for Medical Representation LearningCode1
Fair Federated Medical Image Classification Against Quality Shift via Inter-Client Progressive State MatchingCode1
Show:102550
← PrevPage 30 of 209Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1CoCa (finetuned)Top 1 Accuracy91Unverified
2Model soups (BASIC-L)Top 1 Accuracy90.98Unverified
3Model soups (ViT-G/14)Top 1 Accuracy90.94Unverified
4DaViT-GTop 1 Accuracy90.4Unverified
5Meta Pseudo Labels (EfficientNet-L2)Top 1 Accuracy90.2Unverified
6DaViT-HTop 1 Accuracy90.2Unverified
7SwinV2-GTop 1 Accuracy90.17Unverified
8MAWS (ViT-6.5B)Top 1 Accuracy90.1Unverified
9Florence-CoSwin-HTop 1 Accuracy90.05Unverified
10RevCol-HTop 1 Accuracy90Unverified