SOTAVerified

Image Classification

Image Classification is a fundamental task in vision recognition that aims to understand and categorize an image as a whole under a specific label. Unlike object detection, which involves classification and location of multiple objects within an image, image classification typically pertains to single-object images. When the classification becomes highly detailed or reaches instance-level, it is often referred to as image retrieval, which also involves finding similar images in a large database.

Source: Metamorphic Testing for Object Detection Systems

Papers

Showing 22512300 of 10419 papers

TitleStatusHype
Explaining and Harnessing Adversarial ExamplesCode1
Unsupervised Domain Adaptation by BackpropagationCode1
Going Deeper with ConvolutionsCode1
Very Deep Convolutional Networks for Large-Scale Image RecognitionCode1
ImageNet Large Scale Visual Recognition ChallengeCode1
Recurrent Models of Visual AttentionCode1
OverFeat: Integrated Recognition, Localization and Detection using Convolutional NetworksCode1
Improving neural networks by preventing co-adaptation of feature detectorsCode1
Automatic Classification and Segmentation of Tunnel Cracks Based on Deep Learning and Visual Explanations0
Adversarial attacks to image classification systems using evolutionary algorithms0
Efficient Adaptation of Pre-trained Vision Transformer underpinned by Approximately Orthogonal Fine-Tuning Strategy0
Federated Learning for Commercial Image Sources0
MUPAX: Multidimensional Problem Agnostic eXplainable AI0
Hashed Watermark as a Filter: Defeating Forging and Overwriting Attacks in Weight-based Neural Network WatermarkingCode0
Transferring Styles for Reduced Texture Bias and Improved Robustness in Semantic Segmentation Networks0
FedGSCA: Medical Federated Learning with Global Sample Selector and Client Adaptive Adjuster under Label Noise0
ViT-ProtoNet for Few-Shot Image Classification: A Multi-Benchmark EvaluationCode0
Admissibility of Stein Shrinkage for Batch Normalization in the Presence of Adversarial Attacks0
GNN-ViTCap: GNN-Enhanced Multiple Instance Learning with Vision Transformers for Whole Slide Image Classification and Captioning0
Model-free Optical Processors using In Situ Reinforcement Learning with Proximal Policy Optimization0
SoftReMish: A Novel Activation Function for Enhanced Convolutional Neural Networks for Visual Recognition Performance0
Transferring Visual Explainability of Self-Explaining Models through Task Arithmetic0
MVNet: Hyperspectral Remote Sensing Image Classification Based on Hybrid Mamba-Transformer Vision Backbone ArchitectureCode0
Beyond Accuracy: Metrics that Uncover What Makes a 'Good' Visual DescriptorCode0
Perception-Oriented Latent Coding for High-Performance Compressed Domain Semantic InferenceCode0
Revisiting CroPA: A Reproducibility Study and Enhancements for Cross-Prompt Adversarial Transferability in Vision-Language ModelsCode0
Learning Moderately Input-Sensitive Functions: A Case Study in QR Code Decoding0
Hierarchical Mask-Enhanced Dual Reconstruction Network for Few-Shot Fine-Grained Image ClassificationCode0
FR-CapsNet: Enhancing Low-Resolution Image Classification via Frequency Routed CapsulesCode0
Practical insights on the effect of different encodings, ansätze and measurements in quantum and hybrid convolutional neural networksCode0
Disentangled representations of microscopy imagesCode0
Counterfactual Influence as a Distributional Quantity0
Iterative Quantum Feature Maps0
One Prototype Is Enough: Single-Prototype Activation for Interpretable Image ClassificationCode0
SIM-Net: A Multimodal Fusion Network Using Inferred 3D Object Shape Point Clouds from RGB Images for 2D Classification0
Unfolding the Past: A Comprehensive Deep Learning Approach to Analyzing Incunabula Pages0
DRO-Augment Framework: Robustness by Synergizing Wasserstein Distributionally Robust Optimization and Data Augmentation0
Robust Training with Data Augmentation for Medical Imaging Classification0
Efficient Transformations in Deep Learning Convolutional Neural Networks0
J3DAI: A tiny DNN-Based Edge AI Accelerator for 3D-Stacked CMOS Image Sensor0
Enhancing One-run Privacy Auditing with Quantile Regression-Based Membership Inference0
FedWSIDD: Federated Whole Slide Image Classification via Dataset Distillation0
Compositional Attribute Imbalance in Vision Datasets0
One-Shot Neural Architecture Search with Network Similarity Directed Initialization for Pathological Image Classification0
DDS-NAS: Dynamic Data Selection within Neural Architecture Search via On-line Hard Example Mining applied to Image Classification0
Train Once, Forget Precisely: Anchored Optimization for Efficient Post-Hoc Unlearning0
Finding Optimal Kernel Size and Dimension in Convolutional Neural Networks An Architecture Optimization Approach0
Evaluating Cell Type Inference in Vision Language Models Under Varying Visual ContextCode0
Cross-architecture universal feature coding via distribution alignment0
Intriguing Frequency Interpretation of Adversarial Robustness for CNNs and ViTs0
Show:102550
← PrevPage 46 of 209Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1CoCa (finetuned)Top 1 Accuracy91Unverified
2Model soups (BASIC-L)Top 1 Accuracy90.98Unverified
3Model soups (ViT-G/14)Top 1 Accuracy90.94Unverified
4DaViT-GTop 1 Accuracy90.4Unverified
5Meta Pseudo Labels (EfficientNet-L2)Top 1 Accuracy90.2Unverified
6DaViT-HTop 1 Accuracy90.2Unverified
7SwinV2-GTop 1 Accuracy90.17Unverified
8MAWS (ViT-6.5B)Top 1 Accuracy90.1Unverified
9Florence-CoSwin-HTop 1 Accuracy90.05Unverified
10RevCol-HTop 1 Accuracy90Unverified