SOTAVerified

Image Classification

Image Classification is a fundamental task in vision recognition that aims to understand and categorize an image as a whole under a specific label. Unlike object detection, which involves classification and location of multiple objects within an image, image classification typically pertains to single-object images. When the classification becomes highly detailed or reaches instance-level, it is often referred to as image retrieval, which also involves finding similar images in a large database.

Source: Metamorphic Testing for Object Detection Systems

Papers

Showing 23262350 of 10420 papers

TitleStatusHype
Subspace Adaptation Prior for Few-Shot LearningCode0
PaLI-3 Vision Language Models: Smaller, Faster, StrongerCode1
Fusion framework and multimodality for the Laplacian approximation of Bayesian neural networks0
Self-supervised visual learning for analyzing firearms trafficking activities on the Web0
Leveraging Vision-Language Models for Improving Domain Generalization in Image ClassificationCode1
Strategies and impact of learning curve estimation for CNN-based image classification0
AutoVP: An Automated Visual Prompting Framework and BenchmarkCode1
Revisiting Data Augmentation for Rotational Invariance in Convolutional Neural Networks0
DualAug: Exploiting Additional Heavy Augmentation with OOD Data RejectionCode0
NeuroInspect: Interpretable Neuron-based Debugging Framework through Class-conditional VisualizationsCode0
Multiview Transformer: Rethinking Spatial Information in Hyperspectral Image Classification0
Histopathological Image Classification and Vulnerability Analysis using Federated Learning0
Does resistance to style-transfer equal Global Shape Bias? Measuring network sensitivity to global shape configurationCode0
Human-Centered Evaluation of XAI Methods0
Efficient Adaptation of Large Vision Transformer via Adapter Re-ComposingCode1
Distributed Transfer Learning with 4th Gen Intel Xeon Processors0
Utilizing Synthetic Data for Medical Vision-Language Pre-training: Bypassing the Need for Real ImagesCode0
Adversarial Masked Image Inpainting for Robust Detection of Mpox and Non-Mpox0
SpikeCLIP: A Contrastive Language-Image Pretrained Spiking Neural NetworkCode0
EViT: An Eagle Vision Transformer with Bi-Fovea Self-AttentionCode1
Text-driven Prompt Generation for Vision-Language Models in Federated Learning0
ViTs are Everywhere: A Comprehensive Study Showcasing Vision Transformers in Different Domain0
Unleashing the power of Neural Collapse for Transferability Estimation0
What do larger image classifiers memorise?0
Transformer Fusion with Optimal TransportCode1
Show:102550
← PrevPage 94 of 417Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1CoCa (finetuned)Top 1 Accuracy91Unverified
2Model soups (BASIC-L)Top 1 Accuracy90.98Unverified
3Model soups (ViT-G/14)Top 1 Accuracy90.94Unverified
4DaViT-GTop 1 Accuracy90.4Unverified
5Meta Pseudo Labels (EfficientNet-L2)Top 1 Accuracy90.2Unverified
6DaViT-HTop 1 Accuracy90.2Unverified
7SwinV2-GTop 1 Accuracy90.17Unverified
8MAWS (ViT-6.5B)Top 1 Accuracy90.1Unverified
9Florence-CoSwin-HTop 1 Accuracy90.05Unverified
10RevCol-HTop 1 Accuracy90Unverified