SOTAVerified

Image Classification

Image Classification is a fundamental task in vision recognition that aims to understand and categorize an image as a whole under a specific label. Unlike object detection, which involves classification and location of multiple objects within an image, image classification typically pertains to single-object images. When the classification becomes highly detailed or reaches instance-level, it is often referred to as image retrieval, which also involves finding similar images in a large database.

Source: Metamorphic Testing for Object Detection Systems

Papers

Showing 251300 of 10419 papers

TitleStatusHype
Geometric Median Matching for Robust k-Subset Selection from Noisy Data0
PolygoNet: Leveraging Simplified Polygonal Representation for Effective Image ClassificationCode0
Impact of Data Duplication on Deep Neural Network-Based Image Classifiers: Robust vs. Standard Models0
Enabling Efficient Processing of Spiking Neural Networks with On-Chip Learning on Commodity Neuromorphic Processors for Edge AI Systems0
Over-the-Air Edge Inference via End-to-End Metasurfaces-Integrated Artificial Neural Networks0
PixelCAM: Pixel Class Activation Mapping for Histology Image Classification and ROI LocalizationCode0
CIBR: Cross-modal Information Bottleneck Regularization for Robust CLIP Generalization0
Crossmodal Knowledge Distillation with WordNet-Relaxed Text Embeddings for Robust Image Classification0
Expanding-and-Shrinking Binary Neural NetworksCode0
NoProp: Training Neural Networks without Back-propagation or Forward-propagationCode1
KernelDNA: Dynamic Kernel Sharing via Decoupled Naive AdaptersCode0
Efficient Dynamic Attention 3D Convolution for Hyperspectral Image ClassificationCode0
DC-SGD: Differentially Private SGD with Dynamic Clipping through Gradient Norm Distribution Estimation0
FairSAM: Fair Classification on Corrupted Data Through Sharpness-Aware Minimization0
Diffusion models applied to skin and oral cancer classification0
GmNet: Revisiting Gating Mechanisms From A Frequency View0
Data-Free Universal Attack by Exploiting the Intrinsic Vulnerability of Deep ModelsCode0
On Large Multimodal Models as Open-World Image ClassifiersCode1
Neural Architecture Search by Learning a Hierarchical Search Space0
Improving (α, f)-Byzantine Resilience in Federated Learning via layerwise aggregation and cosine distanceCode0
Retinal Fundus Multi-Disease Image Classification using Hybrid CNN-Transformer-Ensemble ArchitecturesCode0
RBFleX-NAS: Training-Free Neural Architecture Search Using Radial Basis Function Kernel and Hyperparameter Detection0
TS-Inverse: A Gradient Inversion Attack Tailored for Federated Time Series Forecasting ModelsCode0
SAFE: Self-Adjustment Federated Learning Framework for Remote Sensing Collaborative Perception0
VectorFit : Adaptive Singular & Bias Vector Fine-Tuning of Pre-trained Foundation Models0
LRSCLIP: A Vision-Language Foundation Model for Aligning Remote Sensing Image with Longer TextCode1
Extensions of regret-minimization algorithm for optimal design0
Face Spoofing Detection using Deep LearningCode0
Optimizing Breast Cancer Detection in Mammograms: A Comprehensive Study of Transfer Learning, Resolution Reduction, and Multi-View Classification0
Explaining Domain Shifts in Language: Concept erasing for Interpretable Image ClassificationCode0
Exploring the Integration of Key-Value Attention Into Pure and Hybrid Transformers for Semantic Segmentation0
Enhanced OoD Detection through Cross-Modal Alignment of Multi-Modal RepresentationsCode1
Feature Learning beyond the Lazy-Rich Dichotomy: Insights from Representational Geometry0
CoRLD: Contrastive Representation Learning Of Deformable Shapes In ImagesCode0
Leveraging Text-to-Image Generation for Handling Spurious Correlation0
PSA-MIL: A Probabilistic Spatial Attention-Based Multiple Instance Learning for Whole Slide Image ClassificationCode0
Beyond the Visible: Multispectral Vision-Language Learning for Earth Observation0
Think or Not Think: A Study of Explicit Thinking in Rule-Based Visual Reinforcement Fine-TuningCode2
Test-Time Backdoor Detection for Object Detection Models0
ARC: Anchored Representation Clouds for High-Resolution INR ClassificationCode0
Graph-Weighted Contrastive Learning for Semi-Supervised Hyperspectral Image ClassificationCode0
Utilization of Neighbor Information for Image Classification with Different Levels of Supervision0
Effective Dimension Aware Fractional-Order Stochastic Gradient Descent for Convex Optimization Problems0
Neural Edge Histogram Descriptors for Underwater Acoustic Target RecognitionCode0
GC-Fed: Gradient Centralized Federated Learning with Partial Client Participation0
Defense Against Model Stealing Based on Account-Aware Distribution DiscrepancyCode0
TLAC: Two-stage LMM Augmented CLIP for Zero-Shot ClassificationCode0
Goal-Oriented Source Coding using LDPC Codes for Compressed-Domain Image Classification0
DCAT: Dual Cross-Attention Fusion for Disease Classification in Radiological Images with Uncertainty Estimation0
Open-Set Plankton Recognition0
Show:102550
← PrevPage 6 of 209Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1CoCa (finetuned)Top 1 Accuracy91Unverified
2Model soups (BASIC-L)Top 1 Accuracy90.98Unverified
3Model soups (ViT-G/14)Top 1 Accuracy90.94Unverified
4DaViT-GTop 1 Accuracy90.4Unverified
5Meta Pseudo Labels (EfficientNet-L2)Top 1 Accuracy90.2Unverified
6DaViT-HTop 1 Accuracy90.2Unverified
7SwinV2-GTop 1 Accuracy90.17Unverified
8MAWS (ViT-6.5B)Top 1 Accuracy90.1Unverified
9Florence-CoSwin-HTop 1 Accuracy90.05Unverified
10RevCol-HTop 1 Accuracy90Unverified