SOTAVerified

Image Classification

Image Classification is a fundamental task in vision recognition that aims to understand and categorize an image as a whole under a specific label. Unlike object detection, which involves classification and location of multiple objects within an image, image classification typically pertains to single-object images. When the classification becomes highly detailed or reaches instance-level, it is often referred to as image retrieval, which also involves finding similar images in a large database.

Source: Metamorphic Testing for Object Detection Systems

Papers

Showing 301350 of 10419 papers

TitleStatusHype
Open-Set Plankton Recognition0
Falcon: A Remote Sensing Vision-Language Foundation ModelCode3
Leveraging Vision-Language Embeddings for Zero-Shot Learning in Histopathology Images0
(, δ) Considered Harmful: Best Practices for Reporting Differential Privacy GuaranteesCode0
Do We Always Need the Simplicity Bias? Looking for Optimal Inductive Biases in the Wild0
Multiplicative Learning0
A Multi-Modal Federated Learning Framework for Remote Sensing Image Classification0
Interpretable Image Classification via Non-parametric Part Prototype LearningCode1
Learning Interpretable Logic Rules from Deep Vision Models0
Extreme Learning Machines for Attention-based Multiple Instance Learning in Whole-Slide Image Classification0
Discovering Influential Neuron Path in Vision Transformers0
ForAug: Recombining Foregrounds and Backgrounds to Improve Vision Transformer Training with Bias MitigationCode0
Bayesian Test-Time Adaptation for Vision-Language Models0
Probing Network Decisions: Capturing Uncertainties and Unveiling Vulnerabilities Without Label Information0
Deep Learning for Climate Action: Computer Vision Analysis of Visual Narratives on X0
Membership Inference Attacks fueled by Few-Short Learning to detect privacy leakage tackling data integrity0
Double-Stage Feature-Level Clustering-Based Mixture of Experts Framework0
Fair Federated Medical Image Classification Against Quality Shift via Inter-Client Progressive State MatchingCode1
KAN-Mixers: a new deep learning architecture for image classification0
MsaMIL-Net: An End-to-End Multi-Scale Aware Multiple Instance Learning Network for Efficient Whole Slide Image Classification0
Tangentially Aligned Integrated Gradients for User-Friendly Explanations0
Measuring directional bias amplification in image captions using predictability0
A Zero-shot Learning Method Based on Large Language Models for Multi-modal Knowledge Graph Embedding0
Brain Inspired Adaptive Memory Dual-Net for Few-Shot Image Classification0
Understanding the Learning Dynamics of LoRA: A Gradient Flow Perspective on Low-Rank Adaptation in Matrix Factorization0
Keeping Representation Similarity in Finetuning for Medical Image Analysis0
Task Vector Quantization for Memory-Efficient Model MergingCode0
MADS: Multi-Attribute Document Supervision for Zero-Shot Image Classification0
Distilling Knowledge into Quantum Vision Transformers for Biomedical Image Classification0
Enhancing Layer Attention Efficiency through Pruning Redundant Retrievals0
M^3amba: CLIP-driven Mamba Model for Multi-modal Remote Sensing ClassificationCode1
Disrupting Model Merging: A Parameter-Level Defense Without Sacrificing Accuracy0
Feature-EndoGaussian: Feature Distilled Gaussian Splatting in Surgical Deformable Scene Reconstruction0
Data-Free Black-Box Federated Learning via Zeroth-Order Gradient Estimation0
AF-KAN: Activation Function-Based Kolmogorov-Arnold Networks for Efficient Representation Learning0
Pathological Prior-Guided Multiple Instance Learning For Mitigating Catastrophic Forgetting in Breast Cancer Whole Slide Image Classification0
Minion Gated Recurrent Unit for Continual Learning0
Randomized based restricted kernel machine for hyperspectral image classification0
XFMamba: Cross-Fusion Mamba for Multi-View Medical Image ClassificationCode1
Remote Sensing Image Classification Using Convolutional Neural Network (CNN) and Transfer Learning Techniques0
Sharpness-Aware Minimization: General Analysis and Improved RatesCode0
Measurement noise scaling laws for cellular representation learningCode0
Mathematical Foundation of Interpretable Equivariant Surrogate Models0
Label Ranker: Self-Aware Preference for Classification Label Position in Visual Masked Self-Supervised Pre-Trained ModelCode0
Mamba base PKD for efficient knowledge compression0
SAR-W-MixMAE: SAR Foundation Model Training Using Backscatter Power Weighting0
ViKANformer: Embedding Kolmogorov Arnold Networks in Vision Transformers for Pattern-Based Learning0
Visual-RFT: Visual Reinforcement Fine-TuningCode7
AMUN: Adversarial Machine UNlearning0
Delving into Out-of-Distribution Detection with Medical Vision-Language ModelsCode1
Show:102550
← PrevPage 7 of 209Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1CoCa (finetuned)Top 1 Accuracy91Unverified
2Model soups (BASIC-L)Top 1 Accuracy90.98Unverified
3Model soups (ViT-G/14)Top 1 Accuracy90.94Unverified
4DaViT-GTop 1 Accuracy90.4Unverified
5Meta Pseudo Labels (EfficientNet-L2)Top 1 Accuracy90.2Unverified
6DaViT-HTop 1 Accuracy90.2Unverified
7SwinV2-GTop 1 Accuracy90.17Unverified
8MAWS (ViT-6.5B)Top 1 Accuracy90.1Unverified
9Florence-CoSwin-HTop 1 Accuracy90.05Unverified
10RevCol-HTop 1 Accuracy90Unverified