SOTAVerified

Image Classification

Image Classification is a fundamental task in vision recognition that aims to understand and categorize an image as a whole under a specific label. Unlike object detection, which involves classification and location of multiple objects within an image, image classification typically pertains to single-object images. When the classification becomes highly detailed or reaches instance-level, it is often referred to as image retrieval, which also involves finding similar images in a large database.

Source: Metamorphic Testing for Object Detection Systems

Papers

Showing 25012550 of 10419 papers

TitleStatusHype
Diffusion models applied to skin and oral cancer classification0
GmNet: Revisiting Gating Mechanisms From A Frequency View0
Retinal Fundus Multi-Disease Image Classification using Hybrid CNN-Transformer-Ensemble ArchitecturesCode0
Neural Architecture Search by Learning a Hierarchical Search Space0
Improving (α, f)-Byzantine Resilience in Federated Learning via layerwise aggregation and cosine distanceCode0
RBFleX-NAS: Training-Free Neural Architecture Search Using Radial Basis Function Kernel and Hyperparameter Detection0
TS-Inverse: A Gradient Inversion Attack Tailored for Federated Time Series Forecasting ModelsCode0
Face Spoofing Detection using Deep LearningCode0
SAFE: Self-Adjustment Federated Learning Framework for Remote Sensing Collaborative Perception0
Extensions of regret-minimization algorithm for optimal design0
VectorFit : Adaptive Singular & Bias Vector Fine-Tuning of Pre-trained Foundation Models0
Optimizing Breast Cancer Detection in Mammograms: A Comprehensive Study of Transfer Learning, Resolution Reduction, and Multi-View Classification0
Exploring the Integration of Key-Value Attention Into Pure and Hybrid Transformers for Semantic Segmentation0
Explaining Domain Shifts in Language: Concept erasing for Interpretable Image ClassificationCode0
Feature Learning beyond the Lazy-Rich Dichotomy: Insights from Representational Geometry0
Leveraging Text-to-Image Generation for Handling Spurious Correlation0
CoRLD: Contrastive Representation Learning Of Deformable Shapes In ImagesCode0
PSA-MIL: A Probabilistic Spatial Attention-Based Multiple Instance Learning for Whole Slide Image ClassificationCode0
Beyond the Visible: Multispectral Vision-Language Learning for Earth Observation0
ARC: Anchored Representation Clouds for High-Resolution INR ClassificationCode0
Graph-Weighted Contrastive Learning for Semi-Supervised Hyperspectral Image ClassificationCode0
Test-Time Backdoor Detection for Object Detection Models0
Utilization of Neighbor Information for Image Classification with Different Levels of Supervision0
Effective Dimension Aware Fractional-Order Stochastic Gradient Descent for Convex Optimization Problems0
GC-Fed: Gradient Centralized Federated Learning with Partial Client Participation0
Neural Edge Histogram Descriptors for Underwater Acoustic Target RecognitionCode0
Defense Against Model Stealing Based on Account-Aware Distribution DiscrepancyCode0
TLAC: Two-stage LMM Augmented CLIP for Zero-Shot ClassificationCode0
Goal-Oriented Source Coding using LDPC Codes for Compressed-Domain Image Classification0
Open-Set Plankton Recognition0
DCAT: Dual Cross-Attention Fusion for Disease Classification in Radiological Images with Uncertainty Estimation0
PARIC: Probabilistic Attention Regularization for Language Guided Image Classification from Pre-trained Vison Language Models0
Do We Always Need the Simplicity Bias? Looking for Optimal Inductive Biases in the Wild0
Learning Interpretable Logic Rules from Deep Vision Models0
Leveraging Vision-Language Embeddings for Zero-Shot Learning in Histopathology Images0
A Multi-Modal Federated Learning Framework for Remote Sensing Image Classification0
Extreme Learning Machines for Attention-based Multiple Instance Learning in Whole-Slide Image Classification0
(, δ) Considered Harmful: Best Practices for Reporting Differential Privacy GuaranteesCode0
Multiplicative Learning0
Bayesian Test-Time Adaptation for Vision-Language Models0
Membership Inference Attacks fueled by Few-Short Learning to detect privacy leakage tackling data integrity0
ForAug: Recombining Foregrounds and Backgrounds to Improve Vision Transformer Training with Bias MitigationCode0
Discovering Influential Neuron Path in Vision Transformers0
Double-Stage Feature-Level Clustering-Based Mixture of Experts Framework0
Deep Learning for Climate Action: Computer Vision Analysis of Visual Narratives on X0
Probing Network Decisions: Capturing Uncertainties and Unveiling Vulnerabilities Without Label Information0
KAN-Mixers: a new deep learning architecture for image classification0
Tangentially Aligned Integrated Gradients for User-Friendly Explanations0
MsaMIL-Net: An End-to-End Multi-Scale Aware Multiple Instance Learning Network for Efficient Whole Slide Image Classification0
Distilling Knowledge into Quantum Vision Transformers for Biomedical Image Classification0
Show:102550
← PrevPage 51 of 209Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1CoCa (finetuned)Top 1 Accuracy91Unverified
2Model soups (BASIC-L)Top 1 Accuracy90.98Unverified
3Model soups (ViT-G/14)Top 1 Accuracy90.94Unverified
4DaViT-GTop 1 Accuracy90.4Unverified
5Meta Pseudo Labels (EfficientNet-L2)Top 1 Accuracy90.2Unverified
6DaViT-HTop 1 Accuracy90.2Unverified
7SwinV2-GTop 1 Accuracy90.17Unverified
8MAWS (ViT-6.5B)Top 1 Accuracy90.1Unverified
9Florence-CoSwin-HTop 1 Accuracy90.05Unverified
10RevCol-HTop 1 Accuracy90Unverified