SOTAVerified

Image Classification

Image Classification is a fundamental task in vision recognition that aims to understand and categorize an image as a whole under a specific label. Unlike object detection, which involves classification and location of multiple objects within an image, image classification typically pertains to single-object images. When the classification becomes highly detailed or reaches instance-level, it is often referred to as image retrieval, which also involves finding similar images in a large database.

Source: Metamorphic Testing for Object Detection Systems

Papers

Showing 18511900 of 10419 papers

TitleStatusHype
Delving into Out-of-Distribution Detection with Medical Vision-Language ModelsCode1
Rotate to Attend: Convolutional Triplet Attention ModuleCode1
Dendritic Learning-incorporated Vision Transformer for Image RecognitionCode1
Depth Uncertainty in Neural NetworksCode1
Deformable ProtoPNet: An Interpretable Image Classifier Using Deformable PrototypesCode1
Shredder: Learning Noise Distributions to Protect Inference PrivacyCode1
data2vec: A General Framework for Self-supervised Learning in Speech, Vision and LanguageCode1
A Fuzzy Rank-based Ensemble of CNN Models for Classification of Cervical CytologyCode1
DeiT III: Revenge of the ViTCode1
CLIP the Gap: A Single Domain Generalization Approach for Object DetectionCode1
SageMix: Saliency-Guided Mixup for Point CloudsCode1
Saliency-Regularized Deep Multi-Task LearningCode1
FocusNet: Classifying Better by Focusing on Confusing ClassesCode1
Sample Prior Guided Robust Model Learning to Suppress Noisy LabelsCode1
DeepVoxNet2: Yet another CNN frameworkCode1
Scalable Penalized Regression for Noise Detection in Learning with Noisy LabelsCode1
A fuzzy distance-based ensemble of deep models for cervical cancer detectionCode1
CLR: Channel-wise Lightweight Reprogramming for Continual LearningCode1
Defending Against Unforeseen Failure Modes with Latent Adversarial TrainingCode1
DeiT-LT Distillation Strikes Back for Vision Transformer Training on Long-Tailed DatasetsCode1
UniUSNet: A Promptable Framework for Universal Ultrasound Disease Prediction and Tissue SegmentationCode1
Deep Transferring QuantizationCode1
Deep Unlearning: Fast and Efficient Gradient-free Approach to Class ForgettingCode1
Deep Subdomain Adaptation Network for Image ClassificationCode1
A Fully Tensorized Recurrent Neural NetworkCode1
Deep Transfer Learning for Land Use and Land Cover Classification: A Comparative StudyCode1
Scaling Vision with Sparse Mixture of ExpertsCode1
Scheduled Restart Momentum for Accelerated Stochastic Gradient DescentCode1
DeepViT: Towards Deeper Vision TransformerCode1
SDF2Net: Shallow to Deep Feature Fusion Network for PolSAR Image ClassificationCode1
Searching for Low-Bit Weights in Quantized Neural NetworksCode1
CMW-Net: Learning a Class-Aware Sample Weighting Mapping for Robust Deep LearningCode1
Adversarially-Trained Deep Nets Transfer Better: Illustration on Image ClassificationCode1
Danish Fungi 2020 -- Not Just Another Image Recognition DatasetCode1
Segformer++: Efficient Token-Merging Strategies for High-Resolution Semantic SegmentationCode1
Seg-LSTM: Performance of xLSTM for Semantic Segmentation of Remotely Sensed ImagesCode1
Self-Ensembling Vision Transformer (SEViT) for Robust Medical Image ClassificationCode1
Self-Knowledge Distillation with Progressive Refinement of TargetsCode1
Self Pre-training with Masked Autoencoders for Medical Image Classification and SegmentationCode1
PeCLR: Self-Supervised 3D Hand Pose Estimation from monocular RGB via Equivariant Contrastive LearningCode1
DarkneTZ: Towards Model Privacy at the Edge using Trusted Execution EnvironmentsCode1
CNN Filter DB: An Empirical Investigation of Trained Convolutional FiltersCode1
Self-Supervised Learning for Large-Scale Unsupervised Image ClusteringCode1
Self-supervised Learning for Sonar Image ClassificationCode1
DARTS: Differentiable Architecture SearchCode1
Self-training with Noisy Student improves ImageNet classificationCode1
Semantic-Aware Dual Contrastive Learning for Multi-label Image ClassificationCode1
Dual-Perspective Semantic-Aware Representation Blending for Multi-Label Image Recognition with Partial LabelsCode1
Semantic Generative Augmentations for Few-Shot CountingCode1
Delta-STN: Efficient Bilevel Optimization for Neural Networks using Structured Response JacobiansCode1
Show:102550
← PrevPage 38 of 209Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1CoCa (finetuned)Top 1 Accuracy91Unverified
2Model soups (BASIC-L)Top 1 Accuracy90.98Unverified
3Model soups (ViT-G/14)Top 1 Accuracy90.94Unverified
4DaViT-GTop 1 Accuracy90.4Unverified
5Meta Pseudo Labels (EfficientNet-L2)Top 1 Accuracy90.2Unverified
6DaViT-HTop 1 Accuracy90.2Unverified
7SwinV2-GTop 1 Accuracy90.17Unverified
8MAWS (ViT-6.5B)Top 1 Accuracy90.1Unverified
9Florence-CoSwin-HTop 1 Accuracy90.05Unverified
10RevCol-HTop 1 Accuracy90Unverified