SOTAVerified

Image Classification

Image Classification is a fundamental task in vision recognition that aims to understand and categorize an image as a whole under a specific label. Unlike object detection, which involves classification and location of multiple objects within an image, image classification typically pertains to single-object images. When the classification becomes highly detailed or reaches instance-level, it is often referred to as image retrieval, which also involves finding similar images in a large database.

Source: Metamorphic Testing for Object Detection Systems

Papers

Showing 28512900 of 10419 papers

TitleStatusHype
FairDD: Fair Dataset Distillation via Synchronized Matching0
Sparse Attention Vectors: Generative Multimodal Model Features Are Discriminative Vision-Language Classifiers0
ANDHRA Bandersnatch: Training Neural Networks to Predict Parallel RealitiesCode0
MVFormer: Diversifying Feature Normalization and Token Mixing for Efficient Vision Transformers0
Controlling Participation in Federated Learning with FeedbackCode0
Leveraging Semi-Supervised Learning to Enhance Data Mining for Image Classification under Limited Labeled Data0
Pruning Deep Convolutional Neural Network Using Conditional Mutual Information0
KANs for Computer Vision: An Experimental Study0
Mixture of Experts in Image Classification: What's the Sweet Spot?0
Optimized Tradeoffs for Private Prediction with Majority Ensembling0
Fall Leaf Adversarial Attack on Traffic Sign Classification0
CoA: Chain-of-Action for Generative Semantic LabelsCode0
An In-depth Investigation of Sparse Rate Reduction in Transformer-like Models0
SpikeAtConv: An Integrated Spiking-Convolutional Attention Architecture for Energy-Efficient Neuromorphic Vision Processing0
BadScan: An Architectural Backdoor Attack on Visual State Space Models0
Debiasing Classifiers by Amplifying Bias with Latent Diffusion and Large Language Models0
Creating Scalable AGI: the Open General Intelligence Framework0
Twin Trigger Generative Networks for Backdoor Attacks against Object Detection0
MUNBa: Machine Unlearning via Nash BargainingCode0
PaRCE: Probabilistic and Reconstruction-based Competency Estimation for CNN-based Image ClassificationCode0
Towards Million-Scale Adversarial Robustness Evaluation With Stronger Individual AttacksCode0
Uni-Mlip: Unified Self-supervision for Medical Vision Language Pre-training0
LPLgrad: Optimizing Active Learning Through Gradient Norm Sample Selection and Auxiliary Model TrainingCode0
MEGL: Multimodal Explanation-Guided Learning0
Problem-dependent convergence bounds for randomized linear gradient compression0
Invariant Shape Representation Learning For Image ClassificationCode0
Self-Supervised Learning in Deep Networks: A Pathway to Robust Few-Shot Classification0
Exploring Emerging Trends and Research Opportunities in Visual Place Recognition0
Fair Distillation: Teaching Fairness from Biased Teachers in Medical Imaging0
Just Leaf It: Accelerating Diffusion Classifiers with Hierarchical Class Pruning0
Diagnostic Text-guided Representation Learning in Hierarchical Classification for Pathological Whole Slide Image0
Deep Feature Response Discriminative CalibrationCode0
Multi-perspective Contrastive Logit Distillation0
Hysteresis Activation Function for Efficient InferenceCode0
Evidential Federated Learning for Skin Lesion Image Classification0
Embedding Byzantine Fault Tolerance into Federated Learning via Virtual Data-Driven Consistency Scoring PluginCode0
Adapting the Biological SSVEP Response to Artificial Neural Networks0
On the Cost of Model-Serving Frameworks: An Experimental Evaluation0
Outliers resistant image classification by anomaly detection0
ResidualDroppath: Enhancing Feature Reuse over Residual Connections0
RenderBender: A Survey on Adversarial Attacks Using Differentiable Rendering0
SAG-ViT: A Scale-Aware, High-Fidelity Patching Approach with Graph Attention for Vision TransformersCode0
Heuristical Comparison of Vision Transformers Against Convolutional Neural Networks for Semantic Segmentation on Remote Sensing ImageryCode0
Efficient Whole Slide Image Classification through Fisher Vector Representation0
ScaleNet: Scale Invariance Learning in Directed GraphsCode0
Computed tomography using meta-optics0
Semantic segmentation on multi-resolution optical and microwave data using deep learning0
Can KAN Work? Exploring the Potential of Kolmogorov-Arnold Networks in Computer Vision0
Deep Active Learning in the Open World0
Exploring Structural Nonlinearity in Binary Polariton-Based Neuromorphic Architectures0
Show:102550
← PrevPage 58 of 209Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1CoCa (finetuned)Top 1 Accuracy91Unverified
2Model soups (BASIC-L)Top 1 Accuracy90.98Unverified
3Model soups (ViT-G/14)Top 1 Accuracy90.94Unverified
4DaViT-GTop 1 Accuracy90.4Unverified
5DaViT-HTop 1 Accuracy90.2Unverified
6Meta Pseudo Labels (EfficientNet-L2)Top 1 Accuracy90.2Unverified
7SwinV2-GTop 1 Accuracy90.17Unverified
8MAWS (ViT-6.5B)Top 1 Accuracy90.1Unverified
9Florence-CoSwin-HTop 1 Accuracy90.05Unverified
10Meta Pseudo Labels (EfficientNet-B6-Wide)Top 1 Accuracy90Unverified