SOTAVerified

Image Classification

Image Classification is a fundamental task in vision recognition that aims to understand and categorize an image as a whole under a specific label. Unlike object detection, which involves classification and location of multiple objects within an image, image classification typically pertains to single-object images. When the classification becomes highly detailed or reaches instance-level, it is often referred to as image retrieval, which also involves finding similar images in a large database.

Source: Metamorphic Testing for Object Detection Systems

Papers

Showing 76100 of 10419 papers

TitleStatusHype
Towards Graph-Based Privacy-Preserving Federated Learning: ModelNet -- A ResNet-based Model Classification Dataset0
Optimal Weighted Convolution for Classification and DenosingCode2
Provably Improving Generalization of Few-Shot Models with Synthetic Data0
Proxy-FDA: Proxy-based Feature Distribution Alignment for Fine-tuning Vision Foundation Models without Forgetting0
SASP: Strip-Aware Spatial Perception for Fine-Grained Bird Image Classification0
GeoVision Labeler: Zero-Shot Geospatial Classification with Vision and Language ModelsCode2
MaCP: Minimal yet Mighty Adaptation via Hierarchical Cosine Projection0
BIRD: Behavior Induction via Representation-structure Distillation0
Boosting Domain Incremental Learning: Selecting the Optimal Parameters is All You NeedCode0
Deep Modeling and Optimization of Medical Image ClassificationCode0
DSAGL: Dual-Stream Attention-Guided Learning for Weakly Supervised Whole Slide Image Classification0
MCFNet: A Multimodal Collaborative Fusion Network for Fine-Grained Semantic Classification0
Test-Time Adaptation of Vision-Language Models for Open-Vocabulary Semantic SegmentationCode1
Leveraging Diffusion Models for Synthetic Data Augmentation in Protein Subcellular Localization Classification0
Frequency-Adaptive Discrete Cosine-ViT-ResNet Architecture for Sparse-Data Vision0
Task-Oriented Low-Label Semantic Communication With Self-Supervised Learning0
Diagnosing and Mitigating Modality Interference in Multimodal Large Language ModelsCode0
Advancements in Medical Image Classification through Fine-Tuning Natural Domain Foundation ModelsCode0
Applications and Effect Evaluation of Generative Adversarial Networks in Semi-Supervised Learning0
DiSa: Directional Saliency-Aware Prompt Learning for Generalizable Vision-Language Models0
UORA: Uniform Orthogonal Reinitialization Adaptation in Parameter-Efficient Fine-Tuning of Large Models0
Improvement Strategies for Few-Shot Learning in OCT Image Classification of Rare Retinal Diseases0
Differential Privacy Analysis of Decentralized Gossip Averaging under Varying Threat Models0
Mosaic: Data-Free Knowledge Distillation via Mixture-of-Experts for Heterogeneous Distributed EnvironmentsCode0
Remote Sensing Image Classification with Decoupled Knowledge Distillation0
Show:102550
← PrevPage 4 of 417Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1CoCa (finetuned)Top 1 Accuracy91Unverified
2Model soups (BASIC-L)Top 1 Accuracy90.98Unverified
3Model soups (ViT-G/14)Top 1 Accuracy90.94Unverified
4DaViT-GTop 1 Accuracy90.4Unverified
5Meta Pseudo Labels (EfficientNet-L2)Top 1 Accuracy90.2Unverified
6DaViT-HTop 1 Accuracy90.2Unverified
7SwinV2-GTop 1 Accuracy90.17Unverified
8MAWS (ViT-6.5B)Top 1 Accuracy90.1Unverified
9Florence-CoSwin-HTop 1 Accuracy90.05Unverified
10RevCol-HTop 1 Accuracy90Unverified