SOTAVerified

Image Classification

Image Classification is a fundamental task in vision recognition that aims to understand and categorize an image as a whole under a specific label. Unlike object detection, which involves classification and location of multiple objects within an image, image classification typically pertains to single-object images. When the classification becomes highly detailed or reaches instance-level, it is often referred to as image retrieval, which also involves finding similar images in a large database.

Source: Metamorphic Testing for Object Detection Systems

Papers

Showing 32513300 of 10419 papers

TitleStatusHype
Topology Optimization of Random Memristors for Input-Aware Dynamic SNNCode0
Local Binary Pattern(LBP) Optimization for Feature Extraction0
Unifying Visual and Semantic Feature Spaces with Diffusion Models for Enhanced Cross-Modal Alignment0
Content-driven Magnitude-Derivative Spectrum Complementary Learning for Hyperspectral Image Classification0
Self-supervised pre-training with diffusion model for few-shot landmark detection in x-ray images0
Graph Neural Networks: A suitable Alternative to MLPs in Latent 3D Medical Image Classification?Code0
Unsqueeze [CLS] Bottleneck to Learn Rich RepresentationsCode0
Quanv4EO: Empowering Earth Observation by means of Quanvolutional Neural Networks0
Adaptive Gradient Regularization: A Faster and Generalizable Optimization Technique for Deep Neural Networks0
HSVLT: Hierarchical Scale-Aware Vision-Language Transformer for Multi-Label Image Classification0
S-E Pipeline: A Vision Transformer (ViT) based Resilient Classification Pipeline for Medical Imaging Against Adversarial Attacks0
Deep Bayesian segmentation for colon polyps: Well-calibrated predictions in medical imagingCode0
Image Classification using Fuzzy Pooling in Convolutional Kolmogorov-Arnold Networks0
Improved Few-Shot Image Classification Through Multiple-Choice Questions0
Comprehensive Study on Performance Evaluation and Optimization of Model Compression: Bridging Traditional Deep Learning and Large Language Models0
Is user feedback always informative? Retrieval Latent Defending for Semi-Supervised Domain Adaptation without Source DataCode0
Pavement Fatigue Crack Detection and Severity Classification Based on Convolutional Neural Network0
Learning deep illumination-robust features from multispectral filter array imagesCode0
Beyond Size and Class Balance: Alpha as a New Dataset Quality Metric for Deep Learning0
FMDNN: A Fuzzy-guided Multi-granular Deep Neural Network for Histopathological Image ClassificationCode0
HyperbolicLR: Epoch insensitive learning rate schedulerCode0
Assessing Sample Quality via the Latent Space of Generative ModelsCode0
Toward Efficient Convolutional Neural Networks With Structured Ternary PatternsCode0
Subgraph Clustering and Atom Learning for Improved Image Classification0
DEPICT: Diffusion-Enabled Permutation Importance for Image Classification Tasks0
EmoCAM: Toward Understanding What Drives CNN-based Emotion Recognition0
CoAPT: Context Attribute words for Prompt Tuning0
Nutrispace: A novel color space to enhance deep learning based early detection of cucurbits nutritional deficiency0
Addressing Imbalance for Class Incremental Learning in Medical Image Classification0
Differential Privacy Mechanisms in Neural Tangent Kernel Regression0
CycleMix: Mixing Source Domains for Domain Generalization in Style-Dependent DataCode0
Toward INT4 Fixed-Point Training via Exploring Quantization Error for Gradients0
ColorMAE: Exploring data-independent masking strategies in Masked AutoEncodersCode0
LookupViT: Compressing visual information to a limited number of tokens0
Benchmarking Robust Self-Supervised Learning Across Diverse Downstream TasksCode0
Adaptive Cascading Network for Continual Test-Time AdaptationCode0
Non-parametric regularization for class imbalance federated medical image classificationCode0
FETCH: A Memory-Efficient Replay Approach for Continual Learning in Image Classification0
Probing the Efficacy of Federated Parameter-Efficient Fine-Tuning of Vision Transformers for Medical Image Classification0
A Closer Look at Benchmarking Self-Supervised Pre-training with Image Classification0
Siamese Transformer Networks for Few-shot Image Classification0
Generalized Coverage for More Robust Low-Budget Active Learning0
PADRe: A Unifying Polynomial Attention Drop-in Replacement for Efficient Vision Transformer0
Unconstrained Open Vocabulary Image Classification: Zero-Shot Transfer from Text to Image via CLIP InversionCode0
Anticipating Future Object Compositions without Forgetting0
Improving Hyperbolic Representations via Gromov-Wasserstein Regularization0
Employing Sentence Space Embedding for Classification of Data Stream from Fake News DomainCode0
Pathology-knowledge Enhanced Multi-instance Prompt Learning for Few-shot Whole Slide Image Classification0
Backdoor Attacks against Image-to-Image Networks0
GeoMix: Towards Geometry-Aware Data AugmentationCode0
Show:102550
← PrevPage 66 of 209Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1CoCa (finetuned)Top 1 Accuracy91Unverified
2Model soups (BASIC-L)Top 1 Accuracy90.98Unverified
3Model soups (ViT-G/14)Top 1 Accuracy90.94Unverified
4DaViT-GTop 1 Accuracy90.4Unverified
5DaViT-HTop 1 Accuracy90.2Unverified
6Meta Pseudo Labels (EfficientNet-L2)Top 1 Accuracy90.2Unverified
7SwinV2-GTop 1 Accuracy90.17Unverified
8MAWS (ViT-6.5B)Top 1 Accuracy90.1Unverified
9Florence-CoSwin-HTop 1 Accuracy90.05Unverified
10Meta Pseudo Labels (EfficientNet-B6-Wide)Top 1 Accuracy90Unverified