SOTAVerified

Image Classification

Image Classification is a fundamental task in vision recognition that aims to understand and categorize an image as a whole under a specific label. Unlike object detection, which involves classification and location of multiple objects within an image, image classification typically pertains to single-object images. When the classification becomes highly detailed or reaches instance-level, it is often referred to as image retrieval, which also involves finding similar images in a large database.

Source: Metamorphic Testing for Object Detection Systems

Papers

Showing 22262250 of 10420 papers

TitleStatusHype
Towards Few-Annotation Learning in Computer Vision: Application to Image Classification and Object Detection tasks0
SS-MAE: Spatial-Spectral Masked Auto-Encoder for Multi-Source Remote Sensing Image ClassificationCode1
OmniVec: Learning robust representations with cross modal sharing0
Improving the Effectiveness of Deep Generative Data0
Meta-Adapter: An Online Few-shot Learner for Vision-Language ModelCode1
A Simple Interpretable Transformer for Fine-Grained Image Classification and AnalysisCode1
Machine Learning-Based Tea Leaf Disease Detection: A Comprehensive Review0
Stacked Autoencoder Based Feature Extraction and Superpixel Generation for Multifrequency PolSAR Image Classification0
GQKVA: Efficient Pre-training of Transformers by Grouping Queries, Keys, and Values0
Asymmetric Masked Distillation for Pre-Training Small Foundation ModelsCode0
MixUp-MIL: A Study on Linear & Multilinear Interpolation-Based Data Augmentation for Whole Slide Image Classification0
GTP-ViT: Efficient Vision Transformers via Graph-based Token PropagationCode1
Benchmarking a Benchmark: How Reliable is MS-COCO?0
Hybrid quantum image classification and federated learning for hepatic steatosis diagnosis0
From Trojan Horses to Castle Walls: Unveiling Bilateral Data Poisoning Effects in Diffusion ModelsCode0
Thermal Face Image Classification using Deep Learning Techniques0
Continual Learning of Unsupervised Monocular Depth from VideosCode0
Detecting Spurious Correlations via Robust Visual Concepts in Real and AI-Generated Image Classification0
Distilling Out-of-Distribution Robustness from Vision-Language Foundation ModelsCode1
FedSN: A Federated Learning Framework over Heterogeneous LEO Satellite Networks0
Post-hoc Orthogonalization for Mitigation of Protected Feature Bias in CXR EmbeddingsCode0
InsPLAD: A Dataset and Benchmark for Power Line Asset Inspection in UAV ImagesCode1
Re-weighting Tokens: A Simple and Effective Active Learning Strategy for Named Entity Recognition0
Attention based Dual-Branch Complex Feature Fusion Network for Hyperspectral Image ClassificationCode1
Scattering Vision Transformer: Spectral Mixing Matters0
Show:102550
← PrevPage 90 of 417Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1CoCa (finetuned)Top 1 Accuracy91Unverified
2Model soups (BASIC-L)Top 1 Accuracy90.98Unverified
3Model soups (ViT-G/14)Top 1 Accuracy90.94Unverified
4DaViT-GTop 1 Accuracy90.4Unverified
5Meta Pseudo Labels (EfficientNet-L2)Top 1 Accuracy90.2Unverified
6DaViT-HTop 1 Accuracy90.2Unverified
7SwinV2-GTop 1 Accuracy90.17Unverified
8MAWS (ViT-6.5B)Top 1 Accuracy90.1Unverified
9Florence-CoSwin-HTop 1 Accuracy90.05Unverified
10RevCol-HTop 1 Accuracy90Unverified