SOTAVerified

Image Classification

Image Classification is a fundamental task in vision recognition that aims to understand and categorize an image as a whole under a specific label. Unlike object detection, which involves classification and location of multiple objects within an image, image classification typically pertains to single-object images. When the classification becomes highly detailed or reaches instance-level, it is often referred to as image retrieval, which also involves finding similar images in a large database.

Source: Metamorphic Testing for Object Detection Systems

Papers

Showing 150 of 10419 papers

TitleStatusHype
Automatic Classification and Segmentation of Tunnel Cracks Based on Deep Learning and Visual Explanations0
Efficient Adaptation of Pre-trained Vision Transformer underpinned by Approximately Orthogonal Fine-Tuning Strategy0
Federated Learning for Commercial Image Sources0
MUPAX: Multidimensional Problem Agnostic eXplainable AI0
Adversarial attacks to image classification systems using evolutionary algorithms0
Hashed Watermark as a Filter: Defeating Forging and Overwriting Attacks in Weight-based Neural Network WatermarkingCode0
Transferring Styles for Reduced Texture Bias and Improved Robustness in Semantic Segmentation Networks0
FedGSCA: Medical Federated Learning with Global Sample Selector and Client Adaptive Adjuster under Label Noise0
ViT-ProtoNet for Few-Shot Image Classification: A Multi-Benchmark EvaluationCode0
Admissibility of Stein Shrinkage for Batch Normalization in the Presence of Adversarial Attacks0
GNN-ViTCap: GNN-Enhanced Multiple Instance Learning with Vision Transformers for Whole Slide Image Classification and Captioning0
Model-free Optical Processors using In Situ Reinforcement Learning with Proximal Policy Optimization0
SoftReMish: A Novel Activation Function for Enhanced Convolutional Neural Networks for Visual Recognition Performance0
Transferring Visual Explainability of Self-Explaining Models through Task Arithmetic0
MVNet: Hyperspectral Remote Sensing Image Classification Based on Hybrid Mamba-Transformer Vision Backbone ArchitectureCode0
Beyond Accuracy: Metrics that Uncover What Makes a 'Good' Visual DescriptorCode0
Linear Attention with Global Context: A Multipole Attention Mechanism for Vision and PhysicsCode1
Perception-Oriented Latent Coding for High-Performance Compressed Domain Semantic InferenceCode0
Revisiting CroPA: A Reproducibility Study and Enhancements for Cross-Prompt Adversarial Transferability in Vision-Language ModelsCode0
FR-CapsNet: Enhancing Low-Resolution Image Classification via Frequency Routed CapsulesCode0
Hierarchical Mask-Enhanced Dual Reconstruction Network for Few-Shot Fine-Grained Image ClassificationCode0
Practical insights on the effect of different encodings, ansätze and measurements in quantum and hybrid convolutional neural networksCode0
Counterfactual Influence as a Distributional Quantity0
Learning Moderately Input-Sensitive Functions: A Case Study in QR Code Decoding0
Disentangled representations of microscopy imagesCode0
Iterative Quantum Feature Maps0
One Prototype Is Enough: Single-Prototype Activation for Interpretable Image ClassificationCode0
SIM-Net: A Multimodal Fusion Network Using Inferred 3D Object Shape Point Clouds from RGB Images for 2D Classification0
Unfolding the Past: A Comprehensive Deep Learning Approach to Analyzing Incunabula Pages0
DRO-Augment Framework: Robustness by Synergizing Wasserstein Distributionally Robust Optimization and Data Augmentation0
Robust Training with Data Augmentation for Medical Imaging Classification0
Efficient Transformations in Deep Learning Convolutional Neural Networks0
FedWSIDD: Federated Whole Slide Image Classification via Dataset Distillation0
Enhancing One-run Privacy Auditing with Quantile Regression-Based Membership Inference0
J3DAI: A tiny DNN-Based Edge AI Accelerator for 3D-Stacked CMOS Image Sensor0
DDS-NAS: Dynamic Data Selection within Neural Architecture Search via On-line Hard Example Mining applied to Image Classification0
Train Once, Forget Precisely: Anchored Optimization for Efficient Post-Hoc Unlearning0
Compositional Attribute Imbalance in Vision Datasets0
One-Shot Neural Architecture Search with Network Similarity Directed Initialization for Pathological Image Classification0
Finding Optimal Kernel Size and Dimension in Convolutional Neural Networks An Architecture Optimization Approach0
SeqPE: Transformer with Sequential Position EncodingCode1
Evaluating Cell Type Inference in Vision Language Models Under Varying Visual ContextCode0
Intriguing Frequency Interpretation of Adversarial Robustness for CNNs and ViTs0
Cross-architecture universal feature coding via distribution alignment0
OscNet v1.5: Energy Efficient Hopfield Network on CMOS Oscillators for Image ClassificationCode0
Graph Semi-Supervised Learning for Point Classification on Data Manifolds0
MRI-CORE: A Foundation Model for Magnetic Resonance Imaging0
PiPViT: Patch-based Visual Interpretable Prototypes for Retinal Image AnalysisCode0
Boosting Adversarial Transferability for Hyperspectral Image Classification Using 3D Structure-invariant Transformation and Intermediate Feature Distance0
SNR and Resource Adaptive Deep JSCC for Distributed IoT Image Classification0
Show:102550
← PrevPage 1 of 209Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1CoCa (finetuned)Top 1 Accuracy91Unverified
2Model soups (BASIC-L)Top 1 Accuracy90.98Unverified
3Model soups (ViT-G/14)Top 1 Accuracy90.94Unverified
4DaViT-GTop 1 Accuracy90.4Unverified
5DaViT-HTop 1 Accuracy90.2Unverified
6Meta Pseudo Labels (EfficientNet-L2)Top 1 Accuracy90.2Unverified
7SwinV2-GTop 1 Accuracy90.17Unverified
8MAWS (ViT-6.5B)Top 1 Accuracy90.1Unverified
9Florence-CoSwin-HTop 1 Accuracy90.05Unverified
10Meta Pseudo Labels (EfficientNet-B6-Wide)Top 1 Accuracy90Unverified