SOTAVerified

Image Classification

Image Classification is a fundamental task in vision recognition that aims to understand and categorize an image as a whole under a specific label. Unlike object detection, which involves classification and location of multiple objects within an image, image classification typically pertains to single-object images. When the classification becomes highly detailed or reaches instance-level, it is often referred to as image retrieval, which also involves finding similar images in a large database.

Source: Metamorphic Testing for Object Detection Systems

Papers

Showing 61516200 of 10420 papers

TitleStatusHype
Exploiting Invariance in Training Deep Neural NetworksCode0
DAP: Detection-Aware Pre-training with Weak SupervisionCode0
MT3: Meta Test-Time Training for Self-Supervised Test-Time AdaptionCode1
Distribution Alignment: A Unified Framework for Long-tail Visual RecognitionCode1
Model-Contrastive Federated LearningCode1
Learning Representational Invariances for Data-Efficient Action RecognitionCode1
Rethinking Spatial Dimensions of Vision TransformersCode1
Automating Defense Against Adversarial Attacks: Discovery of Vulnerabilities and Application of Multi-INT Imagery to Protect Deployed Models0
Data Augmentation in a Hybrid Approach for Aspect-Based Sentiment AnalysisCode0
"Weak AI" is Likely to Never Become "Strong AI", So What is its Greatest Value for us?0
Selective Output Smoothing Regularization: Regularize Neural Networks by Softening Output Distributions0
Rethinking Neural Operations for Diverse TasksCode1
Classifying Video based on Automatic Content Detection Overview0
ViViT: A Video Vision TransformerCode1
Capsule Network is Not More Robust than Convolutional Network0
Multi-Scale Vision Longformer: A New Vision Transformer for High-Resolution Image EncodingCode1
CvT: Introducing Convolutions to Vision TransformersCode1
[Re] Rigging the Lottery: Making All Tickets WinnersCode1
Explaining Representation by Mutual Information0
BA^2M: A Batch Aware Attention Module for Image Classification0
TransCenter: Transformers with Dense Representations for Multiple-Object TrackingCode1
IoU Attack: Towards Temporally Coherent Black-Box Adversarial Attack for Visual Object TrackingCode1
On the benefits of robust models in modulation recognition0
Going Deeper Into Face Detection: A Survey0
CrossViT: Cross-Attention Multi-Scale Vision Transformer for Image ClassificationCode1
Explore the Knowledge contained in Network Weights to Obtain Sparse Neural Networks0
Understanding Robustness of Transformers for Image Classification0
Distilling Object Detectors via Decoupled FeaturesCode1
MedSelect: Selective Labeling for Medical Image Classification Combining Meta-Learning with Deep Reinforcement LearningCode1
Contrastive Learning based Hybrid Networks for Long-Tailed Image Classification0
Unsupervised Robust Domain Adaptation without Source Data0
A Comprehensive Review of Image Analysis Methods for Microorganism Counting: From Classical Image Processing to Deep Learning Approaches0
ECINN: Efficient Counterfactuals from Invertible Neural NetworksCode0
Swin Transformer: Hierarchical Vision Transformer using Shifted WindowsCode2
Spatial-spectral Hyperspectral Image Classification via Multiple Random Anchor Graphs Ensemble Learning0
Preserve, Promote, or Attack? GNN Explanation via Topology Perturbation0
Self-Supervised Training Enhances Online Continual Learning0
Contrast to Divide: Self-Supervised Pre-Training for Learning with Noisy LabelsCode1
Diverse Branch Block: Building a Convolution as an Inception-like UnitCode1
Factors of Influence for Transfer Learning across Diverse Appearance Domains and Task Types0
W2WNet: a two-module probabilistic Convolutional Neural Network with embedded data cleansing functionality0
AutoMix: Unveiling the Power of Mixup for Stronger ClassifiersCode1
EPRNet: Efficient Pyramid Representation Network for Real-Time Street Scene SegmentationCode0
Enhanced Gradient for Differentiable Architecture Search0
BossNAS: Exploring Hybrid CNN-transformers with Block-wisely Self-supervised Neural Architecture SearchCode1
MetaSAug: Meta Semantic Augmentation for Long-Tailed Visual RecognitionCode1
Scaling Local Self-Attention for Parameter Efficient Visual BackbonesCode1
Characterizing and Improving the Robustness of Self-Supervised Learning through Background Augmentations0
DeepViT: Towards Deeper Vision TransformerCode1
Deep Neural Networks Learn Meta-Structures from Noisy Labels in Semantic Segmentation0
Show:102550
← PrevPage 124 of 209Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1CoCa (finetuned)Top 1 Accuracy91Unverified
2Model soups (BASIC-L)Top 1 Accuracy90.98Unverified
3Model soups (ViT-G/14)Top 1 Accuracy90.94Unverified
4DaViT-GTop 1 Accuracy90.4Unverified
5Meta Pseudo Labels (EfficientNet-L2)Top 1 Accuracy90.2Unverified
6DaViT-HTop 1 Accuracy90.2Unverified
7SwinV2-GTop 1 Accuracy90.17Unverified
8MAWS (ViT-6.5B)Top 1 Accuracy90.1Unverified
9Florence-CoSwin-HTop 1 Accuracy90.05Unverified
10RevCol-HTop 1 Accuracy90Unverified