SOTAVerified

Image Classification

Image Classification is a fundamental task in vision recognition that aims to understand and categorize an image as a whole under a specific label. Unlike object detection, which involves classification and location of multiple objects within an image, image classification typically pertains to single-object images. When the classification becomes highly detailed or reaches instance-level, it is often referred to as image retrieval, which also involves finding similar images in a large database.

Source: Metamorphic Testing for Object Detection Systems

Papers

Showing 41014150 of 10420 papers

TitleStatusHype
Visual correspondence-based explanations improve AI robustness and human-AI team accuracyCode1
SSIVD-Net: A Novel Salient Super Image Classification & Detection Technique for Weaponized ViolenceCode1
Distribution Learning Based on Evolutionary Algorithm Assisted Deep Neural Networks for Imbalanced Image Classification0
AMF: Adaptable Weighting Fusion with Multiple Fine-tuning for Image Classification0
Adaptive occlusion sensitivity analysis for visually explaining video recognition networksCode0
Few-shot Learning with Class-Covariance Metric for Hyperspectral Image ClassificationCode1
Dynamic Channel Selection in Self-Supervised LearningCode0
An Encryption Method of ConvMixer Models without Performance Degradation0
Black-box Few-shot Knowledge DistillationCode1
Jigsaw-ViT: Learning Jigsaw Puzzles in Vision TransformerCode1
TransCL: Transformer Makes Strong and Flexible Compressive LearningCode1
Robust Scene Inference under Noise-Blur Dual Corruptions0
Spatial-Channel Token Distillation for Vision MLPsCode0
SSBNet: Improving Visual Recognition Efficiency by Adaptive Sampling0
Online Knowledge Distillation via Mutual Contrastive Learning for Visual RecognitionCode1
Just Rotate it: Deploying Backdoor Attacks via Rotation Transformation0
Do Perceptually Aligned Gradients Imply Adversarial Robustness?Code0
Deep transfer learning method based on automatic domain alignment and moment matchingCode0
Efficient CNN Architecture Design Guided by VisualizationCode0
TinyViT: Fast Pretraining Distillation for Small Vision Transformers0
Generating and Detecting True Ambiguity: A Forgotten Danger in DNN Supervision Testing0
AutoDiCE: Fully Automated Distributed CNN Inference at the EdgeCode1
Latent Discriminant deterministic UncertaintyCode1
FedDM: Iterative Distribution Matching for Communication-Efficient Federated Learning0
EASNet: Searching Elastic and Accurate Network Architecture for Stereo MatchingCode0
Towards Accurate and Robust Classification in Continuously Transitioning Industrial Sprays with Mixup0
Tailoring Self-Supervision for Supervised LearningCode1
Rectifying Open-set Object Detection: A Taxonomy, Practical Applications, and Proper Evaluation0
LR-Net: A Block-based Convolutional Neural Network for Low-Resolution Image ClassificationCode1
Moment Centralization based Gradient Descent Optimizers for Convolutional Neural NetworksCode0
Vision Transformers: From Semantic Segmentation to Dense PredictionCode3
Balanced Contrastive Learning for Long-Tailed Visual RecognitionCode1
Consistent Polyhedral Surrogates for Top-k Classification and Variants0
Residual and Attentional Architectures for Vector-Symbols0
Few-shot Fine-grained Image Classification via Multi-Frequency Neighborhood and Double-cross ModulationCode0
Multi-manifold Attention for Vision Transformers0
Robustar: Interactive Toolbox Supporting Precise Data Annotation for Robust Vision LearningCode1
Fully trainable Gaussian derivative convolutional layerCode0
ViT-NeT: Interpretable Vision Transformers with Neural Tree DecoderCode1
Zero-Shot Temporal Action Detection via Vision-Language PromptingCode1
Improving Deep Neural Network Random Initialization Through Neuronal RewiringCode0
Generative Adversarial Networks Based on Transformer Encoder and Convolution Block for Hyperspectral Image Classification0
Progress and limitations of deep networks to recognize objects in unusual posesCode1
Sound Randomized Smoothing in Floating-Point ArithmeticsCode0
QSAN: A Near-term Achievable Quantum Self-Attention Network0
Universal Adaptive Data Augmentation0
Tree Structure-Aware Few-Shot Image Classification via Hierarchical AggregationCode1
Learning Discriminative Representation via Metric Learning for Imbalanced Medical Image Classification0
Provably Adversarially Robust Nearest Prototype ClassifiersCode0
Current Trends in Deep Learning for Earth Observation: An Open-source Benchmark Arena for Image ClassificationCode2
Show:102550
← PrevPage 83 of 209Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1CoCa (finetuned)Top 1 Accuracy91Unverified
2Model soups (BASIC-L)Top 1 Accuracy90.98Unverified
3Model soups (ViT-G/14)Top 1 Accuracy90.94Unverified
4DaViT-GTop 1 Accuracy90.4Unverified
5Meta Pseudo Labels (EfficientNet-L2)Top 1 Accuracy90.2Unverified
6DaViT-HTop 1 Accuracy90.2Unverified
7SwinV2-GTop 1 Accuracy90.17Unverified
8MAWS (ViT-6.5B)Top 1 Accuracy90.1Unverified
9Florence-CoSwin-HTop 1 Accuracy90.05Unverified
10RevCol-HTop 1 Accuracy90Unverified