SOTAVerified

Image Classification

Image Classification is a fundamental task in vision recognition that aims to understand and categorize an image as a whole under a specific label. Unlike object detection, which involves classification and location of multiple objects within an image, image classification typically pertains to single-object images. When the classification becomes highly detailed or reaches instance-level, it is often referred to as image retrieval, which also involves finding similar images in a large database.

Source: Metamorphic Testing for Object Detection Systems

Papers

Showing 301325 of 10419 papers

TitleStatusHype
PARIC: Probabilistic Attention Regularization for Language Guided Image Classification from Pre-trained Vison Language Models0
Open-Set Plankton Recognition0
Leveraging Vision-Language Embeddings for Zero-Shot Learning in Histopathology Images0
Do We Always Need the Simplicity Bias? Looking for Optimal Inductive Biases in the Wild0
Learning Interpretable Logic Rules from Deep Vision Models0
A Multi-Modal Federated Learning Framework for Remote Sensing Image Classification0
Interpretable Image Classification via Non-parametric Part Prototype LearningCode1
Multiplicative Learning0
(, δ) Considered Harmful: Best Practices for Reporting Differential Privacy GuaranteesCode0
Extreme Learning Machines for Attention-based Multiple Instance Learning in Whole-Slide Image Classification0
Discovering Influential Neuron Path in Vision Transformers0
Bayesian Test-Time Adaptation for Vision-Language Models0
ForAug: Recombining Foregrounds and Backgrounds to Improve Vision Transformer Training with Bias MitigationCode0
Double-Stage Feature-Level Clustering-Based Mixture of Experts Framework0
Deep Learning for Climate Action: Computer Vision Analysis of Visual Narratives on X0
Probing Network Decisions: Capturing Uncertainties and Unveiling Vulnerabilities Without Label Information0
Fair Federated Medical Image Classification Against Quality Shift via Inter-Client Progressive State MatchingCode1
Membership Inference Attacks fueled by Few-Short Learning to detect privacy leakage tackling data integrity0
KAN-Mixers: a new deep learning architecture for image classification0
MsaMIL-Net: An End-to-End Multi-Scale Aware Multiple Instance Learning Network for Efficient Whole Slide Image Classification0
Tangentially Aligned Integrated Gradients for User-Friendly Explanations0
Measuring directional bias amplification in image captions using predictability0
MADS: Multi-Attribute Document Supervision for Zero-Shot Image Classification0
Task Vector Quantization for Memory-Efficient Model MergingCode0
A Zero-shot Learning Method Based on Large Language Models for Multi-modal Knowledge Graph Embedding0
Show:102550
← PrevPage 13 of 417Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1CoCa (finetuned)Top 1 Accuracy91Unverified
2Model soups (BASIC-L)Top 1 Accuracy90.98Unverified
3Model soups (ViT-G/14)Top 1 Accuracy90.94Unverified
4DaViT-GTop 1 Accuracy90.4Unverified
5Meta Pseudo Labels (EfficientNet-L2)Top 1 Accuracy90.2Unverified
6DaViT-HTop 1 Accuracy90.2Unverified
7SwinV2-GTop 1 Accuracy90.17Unverified
8MAWS (ViT-6.5B)Top 1 Accuracy90.1Unverified
9Florence-CoSwin-HTop 1 Accuracy90.05Unverified
10RevCol-HTop 1 Accuracy90Unverified