SOTAVerified

Image Classification

Image Classification is a fundamental task in vision recognition that aims to understand and categorize an image as a whole under a specific label. Unlike object detection, which involves classification and location of multiple objects within an image, image classification typically pertains to single-object images. When the classification becomes highly detailed or reaches instance-level, it is often referred to as image retrieval, which also involves finding similar images in a large database.

Source: Metamorphic Testing for Object Detection Systems

Papers

Showing 12011250 of 10419 papers

TitleStatusHype
PADRe: A Unifying Polynomial Attention Drop-in Replacement for Efficient Vision Transformer0
TCFormer: Visual Recognition via Token Clustering TransformerCode3
Probing the Efficacy of Federated Parameter-Efficient Fine-Tuning of Vision Transformers for Medical Image Classification0
Employing Sentence Space Embedding for Classification of Data Stream from Fake News DomainCode0
Unconstrained Open Vocabulary Image Classification: Zero-Shot Transfer from Text to Image via CLIP InversionCode0
Improving Hyperbolic Representations via Gromov-Wasserstein Regularization0
Pathology-knowledge Enhanced Multi-instance Prompt Learning for Few-shot Whole Slide Image Classification0
Anticipating Future Object Compositions without Forgetting0
GeoMix: Towards Geometry-Aware Data AugmentationCode0
Backdoor Attacks against Image-to-Image Networks0
DataDream: Few-shot Guided Dataset GenerationCode2
Augmented Neural Fine-Tuning for Efficient Backdoor PurificationCode1
Deep Learning Algorithms for Early Diagnosis of Acute Lymphoblastic Leukemia0
A Self-Supervised Learning Pipeline for Demographically Fair Facial Attribute Classification0
Seq-to-Final: A Benchmark for Tuning from Sequential Distributions to a Final Time PointCode0
GPC: Generative and General Pathology Image Classifier0
Evaluating the Adversarial Robustness of Semantic Segmentation: Trying Harder Pays OffCode0
On Exact Bit-level Reversible Transformers Without Changing ArchitecturesCode1
Open Vocabulary Multi-Label Video Classification0
SlideGCD: Slide-based Graph Collaborative Training with Knowledge Distillation for Whole Slide Image ClassificationCode0
CAMP: Continuous and Adaptive Learning Model in PathologyCode0
A Mathematical Framework, a Taxonomy of Modeling Paradigms, and a Suite of Learning Techniques for Neural-Symbolic SystemsCode0
Data Adaptive Traceback for Vision-Language Foundation Models in Image Classification0
Local Clustering for Lung Cancer Image Classification via Sparse Solution TechniqueCode0
Histopathological Image Classification with Cell Morphology Aware Deep Neural NetworksCode1
BiasPruner: Debiased Continual Learning for Medical Image ClassificationCode1
Enrich the content of the image Using Context-Aware Copy Paste0
GraphMamba: An Efficient Graph Structure Learning Vision Mamba for Hyperspectral Image ClassificationCode1
Exploring the Boundaries of On-Device Inference: When Tiny Falls Short, Go Hierarchical0
MambaVision: A Hybrid Mamba-Transformer Vision BackboneCode7
The Misclassification Likelihood Matrix: Some Classes Are More Likely To Be Misclassified Than Others0
HDKD: Hybrid Data-Efficient Knowledge Distillation Network for Medical Image ClassificationCode0
FALFormer: Feature-aware Landmarks self-attention for Whole-slide Image ClassificationCode0
Dual-stage Hyperspectral Image Classification Model with Spectral SupertokenCode1
Trainable Highly-expressive Activation FunctionsCode1
Towards a text-based quantitative and explainable histopathology image analysisCode0
Exploring Camera Encoder Designs for Autonomous Driving Perception0
NoisyAG-News: A Benchmark for Addressing Instance-Dependent Noise in Text Classification0
CTRL-F: Pairing Convolution with Transformer for Image Classification via Multi-Level Feature Cross-Attention and Representation Learning FusionCode0
GeoWATCH for Detecting Heavy Construction in Heterogeneous Time Series of Satellite Images0
Hybrid Classical-Quantum architecture for vectorised image classification of hand-written sketches0
Momentum Auxiliary Network for Supervised Local LearningCode1
Wavelet Convolutions for Large Receptive FieldsCode4
An accurate detection is not all you need to combat label noise in web-noisy datasetsCode0
FALIP: Visual Prompt as Foveal Attention Boosts CLIP Zero-Shot Performance0
Evaluating the Fairness of Neural Collapse in Medical Image Classification0
Learning to Adapt Category Consistent Meta-Feature of CLIP for Few-Shot Classification0
Multi-Label Plant Species Classification with Self-Supervised Vision TransformersCode1
Active Label Refinement for Robust Training of Imbalanced Medical Image Classification Tasks in the Presence of High Label NoiseCode0
Leveraging Topological Guidance for Improved Knowledge DistillationCode0
Show:102550
← PrevPage 25 of 209Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1CoCa (finetuned)Top 1 Accuracy91Unverified
2Model soups (BASIC-L)Top 1 Accuracy90.98Unverified
3Model soups (ViT-G/14)Top 1 Accuracy90.94Unverified
4DaViT-GTop 1 Accuracy90.4Unverified
5Meta Pseudo Labels (EfficientNet-L2)Top 1 Accuracy90.2Unverified
6DaViT-HTop 1 Accuracy90.2Unverified
7SwinV2-GTop 1 Accuracy90.17Unverified
8MAWS (ViT-6.5B)Top 1 Accuracy90.1Unverified
9Florence-CoSwin-HTop 1 Accuracy90.05Unverified
10RevCol-HTop 1 Accuracy90Unverified