SOTAVerified

Image Classification

Image Classification is a fundamental task in vision recognition that aims to understand and categorize an image as a whole under a specific label. Unlike object detection, which involves classification and location of multiple objects within an image, image classification typically pertains to single-object images. When the classification becomes highly detailed or reaches instance-level, it is often referred to as image retrieval, which also involves finding similar images in a large database.

Source: Metamorphic Testing for Object Detection Systems

Papers

Showing 125 of 10419 papers

TitleStatusHype
Automatic Classification and Segmentation of Tunnel Cracks Based on Deep Learning and Visual Explanations0
MUPAX: Multidimensional Problem Agnostic eXplainable AI0
Efficient Adaptation of Pre-trained Vision Transformer underpinned by Approximately Orthogonal Fine-Tuning Strategy0
Federated Learning for Commercial Image Sources0
Adversarial attacks to image classification systems using evolutionary algorithms0
Hashed Watermark as a Filter: Defeating Forging and Overwriting Attacks in Weight-based Neural Network WatermarkingCode0
Transferring Styles for Reduced Texture Bias and Improved Robustness in Semantic Segmentation Networks0
FedGSCA: Medical Federated Learning with Global Sample Selector and Client Adaptive Adjuster under Label Noise0
ViT-ProtoNet for Few-Shot Image Classification: A Multi-Benchmark EvaluationCode0
Admissibility of Stein Shrinkage for Batch Normalization in the Presence of Adversarial Attacks0
GNN-ViTCap: GNN-Enhanced Multiple Instance Learning with Vision Transformers for Whole Slide Image Classification and Captioning0
SoftReMish: A Novel Activation Function for Enhanced Convolutional Neural Networks for Visual Recognition Performance0
Model-free Optical Processors using In Situ Reinforcement Learning with Proximal Policy Optimization0
Transferring Visual Explainability of Self-Explaining Models through Task Arithmetic0
MVNet: Hyperspectral Remote Sensing Image Classification Based on Hybrid Mamba-Transformer Vision Backbone ArchitectureCode0
Beyond Accuracy: Metrics that Uncover What Makes a 'Good' Visual DescriptorCode0
Linear Attention with Global Context: A Multipole Attention Mechanism for Vision and PhysicsCode1
Perception-Oriented Latent Coding for High-Performance Compressed Domain Semantic InferenceCode0
Revisiting CroPA: A Reproducibility Study and Enhancements for Cross-Prompt Adversarial Transferability in Vision-Language ModelsCode0
FR-CapsNet: Enhancing Low-Resolution Image Classification via Frequency Routed CapsulesCode0
Practical insights on the effect of different encodings, ansätze and measurements in quantum and hybrid convolutional neural networksCode0
Hierarchical Mask-Enhanced Dual Reconstruction Network for Few-Shot Fine-Grained Image ClassificationCode0
Learning Moderately Input-Sensitive Functions: A Case Study in QR Code Decoding0
Counterfactual Influence as a Distributional Quantity0
Disentangled representations of microscopy imagesCode0
Show:102550
← PrevPage 1 of 417Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1CoCa (finetuned)Top 1 Accuracy91Unverified
2Model soups (BASIC-L)Top 1 Accuracy90.98Unverified
3Model soups (ViT-G/14)Top 1 Accuracy90.94Unverified
4DaViT-GTop 1 Accuracy90.4Unverified
5DaViT-HTop 1 Accuracy90.2Unverified
6Meta Pseudo Labels (EfficientNet-L2)Top 1 Accuracy90.2Unverified
7SwinV2-GTop 1 Accuracy90.17Unverified
8MAWS (ViT-6.5B)Top 1 Accuracy90.1Unverified
9Florence-CoSwin-HTop 1 Accuracy90.05Unverified
10Meta Pseudo Labels (EfficientNet-B6-Wide)Top 1 Accuracy90Unverified