SOTAVerified

Image Classification

Image Classification is a fundamental task in vision recognition that aims to understand and categorize an image as a whole under a specific label. Unlike object detection, which involves classification and location of multiple objects within an image, image classification typically pertains to single-object images. When the classification becomes highly detailed or reaches instance-level, it is often referred to as image retrieval, which also involves finding similar images in a large database.

Source: Metamorphic Testing for Object Detection Systems

Papers

Showing 59015950 of 10420 papers

TitleStatusHype
How Out-of-Distribution Detection Learning Theory Enhances Transformer: Learnability and Reliability0
Coverage Testing of Deep Learning Models using Dataset Characterization0
Measuring Unintended Memorisation of Unique Private Features in Neural Networks0
flexgrid2vec: Learning Efficient Visual Representations Vectors0
A Hybrid Differential Evolution Approach to Designing Deep Convolutional Neural Networks for Image Classification0
Let's Go Shopping (LGS) -- Web-Scale Image-Text Dataset for Visual Concept Understanding0
Diabetic retinopathy image classification method based on GreenBen data augmentation0
Covariance-corrected Whitening Alleviates Network Degeneration on Imbalanced Classification0
G-RepsNet: A Fast and General Construction of Equivariant Networks for Arbitrary Matrix Groups0
Leveraging Chat-Based Large Vision Language Models for Multimodal Out-Of-Context Detection0
Measuring the Effect of Causal Disentanglement on the Adversarial Robustness of Neural Network Models0
Measuring the Interpretability of Unsupervised Representations via Quantized Reversed Probing0
Coupling Visual Semantics of Artificial Neural Networks and Human Brain Function via Synchronized Activations0
Leveraging Conditional Mutual Information to Improve Large Language Model Fine-Tuning For Classification0
Leveraging counterfactual concepts for debugging and improving CNN model performance0
Active Transfer Learning with Zero-Shot Priors: Reusing Past Datasets for Future Tasks0
Leveraging Deep Learning and Xception Architecture for High-Accuracy MRI Classification in Alzheimer Diagnosis0
Leveraging Diffusion Models for Synthetic Data Augmentation in Protein Subcellular Localization Classification0
Greedy Policy Search: A Simple Baseline for Learnable Test-Time Augmentation0
GreedyNAS: Towards Fast One-Shot NAS with Greedy Supernet0
Diagnosis of Alzheimer's Disease via Multi-modality 3D Convolutional Neural Network0
Leveraging Internal Representations of Model for Magnetic Image Classification0
Measuring the Effectiveness of Self-Supervised Learning using Calibrated Learning Curves0
Measuring the Success of Diffusion Models at Imitating Human Artists0
Leveraging Mid-Level Deep Representations For Predicting Face Attributes in the Wild0
Diagnosis of Skin Cancer Using VGG16 and VGG19 Based Transfer Learning Models0
Text Descriptions are Compressive and Invariant Representations for Visual Learning0
Leveraging Perceptual Scores for Dataset Pruning in Computer Vision Tasks0
Leveraging Semi-Supervised Learning to Enhance Data Mining for Image Classification under Limited Labeled Data0
Leveraging Spatial and Semantic Feature Extraction for Skin Cancer Diagnosis with Capsule Networks and Graph Neural Networks0
Leveraging Superfluous Information in Contrastive Representation Learning0
Leveraging Systematic Knowledge of 2D Transformations0
MedFocusCLIP : Improving few shot classification in medical datasets using pixel wise attention0
Break a Lag: Triple Exponential Moving Average for Enhanced Optimization0
MDL-NAS: A Joint Multi-Domain Learning Framework for Vision Transformer0
Attack Agnostic Statistical Method for Adversarial Detection0
Grassmann Pooling as Compact Homogeneous Bilinear Pooling for Fine-Grained Visual Classification0
LEVIS: Large Exact Verifiable Input Spaces for Neural Networks0
GRASP: A Rehearsal Policy for Efficient Online Continual Learning0
Coupled End-to-End Transfer Learning With Generalized Fisher Information0
Measuring directional bias amplification in image captions using predictability0
AT-SNN: Adaptive Tokens for Vision Transformer on Spiking Neural Network0
GraphViz2Vec: A Structure-aware Feature Generation Model to Improve Classification in GNNs0
DIET-SNN: Direct Input Encoding With Leakage and Threshold Optimization in Deep Spiking Neural Networks0
Diff3Dformer: Leveraging Slice Sequence Diffusion for Enhanced 3D CT Classification with Transformer Networks0
Lie Algebra Canonicalization: Equivariant Neural Operators under arbitrary Lie Groups0
A Hybrid Architecture for On-Device Compressive Machine Learning0
DiffCLIP: Leveraging Stable Diffusion for Language Grounded 3D Classification0
Graph Structural Aggregation for Explainable Learning0
Graphs for deep learning representations0
Show:102550
← PrevPage 119 of 209Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1CoCa (finetuned)Top 1 Accuracy91Unverified
2Model soups (BASIC-L)Top 1 Accuracy90.98Unverified
3Model soups (ViT-G/14)Top 1 Accuracy90.94Unverified
4DaViT-GTop 1 Accuracy90.4Unverified
5Meta Pseudo Labels (EfficientNet-L2)Top 1 Accuracy90.2Unverified
6DaViT-HTop 1 Accuracy90.2Unverified
7SwinV2-GTop 1 Accuracy90.17Unverified
8MAWS (ViT-6.5B)Top 1 Accuracy90.1Unverified
9Florence-CoSwin-HTop 1 Accuracy90.05Unverified
10RevCol-HTop 1 Accuracy90Unverified