SOTAVerified

Image Classification

Image Classification is a fundamental task in vision recognition that aims to understand and categorize an image as a whole under a specific label. Unlike object detection, which involves classification and location of multiple objects within an image, image classification typically pertains to single-object images. When the classification becomes highly detailed or reaches instance-level, it is often referred to as image retrieval, which also involves finding similar images in a large database.

Source: Metamorphic Testing for Object Detection Systems

Papers

Showing 59265950 of 10420 papers

TitleStatusHype
Diagnosis of Skin Cancer Using VGG16 and VGG19 Based Transfer Learning Models0
Text Descriptions are Compressive and Invariant Representations for Visual Learning0
Leveraging Perceptual Scores for Dataset Pruning in Computer Vision Tasks0
Leveraging Semi-Supervised Learning to Enhance Data Mining for Image Classification under Limited Labeled Data0
Leveraging Spatial and Semantic Feature Extraction for Skin Cancer Diagnosis with Capsule Networks and Graph Neural Networks0
Leveraging Superfluous Information in Contrastive Representation Learning0
Leveraging Systematic Knowledge of 2D Transformations0
MedFocusCLIP : Improving few shot classification in medical datasets using pixel wise attention0
Break a Lag: Triple Exponential Moving Average for Enhanced Optimization0
MDL-NAS: A Joint Multi-Domain Learning Framework for Vision Transformer0
Attack Agnostic Statistical Method for Adversarial Detection0
Grassmann Pooling as Compact Homogeneous Bilinear Pooling for Fine-Grained Visual Classification0
LEVIS: Large Exact Verifiable Input Spaces for Neural Networks0
GRASP: A Rehearsal Policy for Efficient Online Continual Learning0
Coupled End-to-End Transfer Learning With Generalized Fisher Information0
Measuring directional bias amplification in image captions using predictability0
AT-SNN: Adaptive Tokens for Vision Transformer on Spiking Neural Network0
GraphViz2Vec: A Structure-aware Feature Generation Model to Improve Classification in GNNs0
DIET-SNN: Direct Input Encoding With Leakage and Threshold Optimization in Deep Spiking Neural Networks0
Diff3Dformer: Leveraging Slice Sequence Diffusion for Enhanced 3D CT Classification with Transformer Networks0
Lie Algebra Canonicalization: Equivariant Neural Operators under arbitrary Lie Groups0
A Hybrid Architecture for On-Device Compressive Machine Learning0
DiffCLIP: Leveraging Stable Diffusion for Language Grounded 3D Classification0
Graph Structural Aggregation for Explainable Learning0
Graphs for deep learning representations0
Show:102550
← PrevPage 238 of 417Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1CoCa (finetuned)Top 1 Accuracy91Unverified
2Model soups (BASIC-L)Top 1 Accuracy90.98Unverified
3Model soups (ViT-G/14)Top 1 Accuracy90.94Unverified
4DaViT-GTop 1 Accuracy90.4Unverified
5Meta Pseudo Labels (EfficientNet-L2)Top 1 Accuracy90.2Unverified
6DaViT-HTop 1 Accuracy90.2Unverified
7SwinV2-GTop 1 Accuracy90.17Unverified
8MAWS (ViT-6.5B)Top 1 Accuracy90.1Unverified
9Florence-CoSwin-HTop 1 Accuracy90.05Unverified
10RevCol-HTop 1 Accuracy90Unverified