SOTAVerified

Image Classification

Image Classification is a fundamental task in vision recognition that aims to understand and categorize an image as a whole under a specific label. Unlike object detection, which involves classification and location of multiple objects within an image, image classification typically pertains to single-object images. When the classification becomes highly detailed or reaches instance-level, it is often referred to as image retrieval, which also involves finding similar images in a large database.

Source: Metamorphic Testing for Object Detection Systems

Papers

Showing 23512400 of 10419 papers

TitleStatusHype
What do larger image classifiers memorise?0
Enhancing Representations through Heterogeneous Self-Supervised Learning0
Progressive Neural Compression for Adaptive Image Offloading under Timing ConstraintsCode1
Tight Certified Robustness via Min-Max Representations of ReLU Neural Networks0
Activate and Reject: Towards Safe Domain Generalization under Category Shift0
PriViT: Vision Transformers for Fast Private InferenceCode0
A privacy-preserving method using secret key for convolutional neural network-based speech classification0
TiC: Exploring Vision Transformer in ConvolutionCode1
Why Do We Need Weight Decay in Modern Deep Learning?Code1
PrototypeFormer: Learning to Explore Prototype Relationships for Few-shot Image Classification0
Improved Baselines with Visual Instruction TuningCode6
DED: Diagnostic Evidence Distillation for acne severity grading on face imagesCode0
PDR-CapsNet: an Energy-Efficient Parallel Approach to Dynamic Routing in Capsule Networks0
Robust and Interpretable Medical Image Classifiers via Concept Bottleneck Models0
Comparative Analysis of Imbalanced Malware Byteplot Image Classification using Transfer Learning0
SemiReward: A General Reward Model for Semi-supervised LearningCode1
ViT-ReciproCAM: Gradient and Attention-Free Visual Explanations for Vision TransformerCode1
Heterogeneous Federated Learning Using Knowledge Codistillation0
Dynamic Shuffle: An Efficient Channel Mixture Method0
IBCL: Zero-shot Model Generation for Task Trade-offs in Continual LearningCode0
Neural architecture impact on identifying temporally extended Reinforcement Learning tasks0
Deformation-Invariant Neural Network and Its Applications in Distorted Image Restoration and Analysis0
Approximately Equivariant Quantum Neural Network for p4m Group Symmetries in Images0
Inductive biases of multi-task learning and finetuning: multiple regimes of feature reuseCode0
RoFormer for Position Aware Multiple Instance Learning in Whole Slide Image ClassificationCode0
Extending CAM-based XAI methods for Remote Sensing Imagery SegmentationCode1
Trainable Noise Model as an XAI evaluation method: application on Sobol for remote sensing image segmentationCode1
An evaluation of pre-trained models for feature extraction in image classification0
CLIPSelf: Vision Transformer Distills Itself for Open-Vocabulary Dense PredictionCode2
RRR-Net: Reusing, Reducing, and Recycling a Deep Backbone Network0
A Framework for Inference Inspired by Human Memory MechanismsCode1
A Comprehensive Review of Generative AI in Healthcare0
Understanding the Robustness of Randomized Feature Defense Against Query-Based Adversarial AttacksCode0
Ajwa or Medjool: a binary balanced dataset to teach machine learning عجوة أو مجدول: مجموعة بيانات متوازنة الصنفين لتدريس تعلم الآلة‏Code0
Learnable Extended Activation Function (LEAF) for Deep Neural NetworksCode0
MVC: A Multi-Task Vision Transformer Network for COVID-19 Diagnosis from Chest X-ray Images0
PRIME: Prioritizing Interpretability in Failure Mode Extraction0
SCoRe: Submodular Combinatorial Representation Learning0
Reconstruction of Patient-Specific Confounders in AI-based Radiologic Image Interpretation using Generative PretrainingCode0
ResBit: Residual Bit Vector for Categorical Values0
Tell Me a Story! Narrative-Driven XAI with Large Language ModelsCode0
Prototype Generation: Robust Feature Visualisation for Data Independent InterpretabilityCode0
RSAM: Learning on manifolds with Riemannian Sharpness-aware Minimization0
Batch Calibration: Rethinking Calibration for In-Context Learning and Prompt Engineering0
EWasteNet: A Two-Stream Data Efficient Image Transformer Approach for E-Waste Classification0
Ultra-low-power Image Classification on Neuromorphic HardwareCode0
Two-Step Active Learning for Instance Segmentation with Uncertainty and Diversity Sampling0
Logarithm-transform aided Gaussian Sampling for Few-Shot LearningCode0
AutoCLIP: Auto-tuning Zero-Shot Classifiers for Vision-Language ModelsCode0
Enhancing Sharpness-Aware Optimization Through Variance SuppressionCode1
Show:102550
← PrevPage 48 of 209Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1CoCa (finetuned)Top 1 Accuracy91Unverified
2Model soups (BASIC-L)Top 1 Accuracy90.98Unverified
3Model soups (ViT-G/14)Top 1 Accuracy90.94Unverified
4DaViT-GTop 1 Accuracy90.4Unverified
5Meta Pseudo Labels (EfficientNet-L2)Top 1 Accuracy90.2Unverified
6DaViT-HTop 1 Accuracy90.2Unverified
7SwinV2-GTop 1 Accuracy90.17Unverified
8MAWS (ViT-6.5B)Top 1 Accuracy90.1Unverified
9Florence-CoSwin-HTop 1 Accuracy90.05Unverified
10RevCol-HTop 1 Accuracy90Unverified