SOTAVerified

Image Classification

Image Classification is a fundamental task in vision recognition that aims to understand and categorize an image as a whole under a specific label. Unlike object detection, which involves classification and location of multiple objects within an image, image classification typically pertains to single-object images. When the classification becomes highly detailed or reaches instance-level, it is often referred to as image retrieval, which also involves finding similar images in a large database.

Source: Metamorphic Testing for Object Detection Systems

Papers

Showing 34513500 of 10419 papers

TitleStatusHype
Building Vision Transformers with Hierarchy Aware Feature Aggregation0
Boosting Whole Slide Image Classification from the Perspectives of Distribution, Correlation and Magnification0
DIME-FM : DIstilling Multimodal and Efficient Foundation Models0
LaPE: Layer-adaptive Position Embedding for Vision Transformers with Independent Layer NormalizationCode1
Growing a Brain with Sparsity-Inducing Generation for Continual LearningCode0
Adaptive Image Anonymization in the Context of Image Classification with Neural Networks0
Tiny Updater: Towards Efficient Neural Network-Driven Software UpdatingCode0
LexLIP: Lexicon-Bottlenecked Language-Image Pre-Training for Large-Scale Image-Text Sparse RetrievalCode1
Adaptive and Background-Aware Vision Transformer for Real-Time UAV TrackingCode1
Personalized Semantics Excitation for Federated Image Classification0
Automated Knowledge Distillation via Monte Carlo Tree SearchCode0
Temporal-Coded Spiking Neural Networks with Dynamic Firing Threshold: Learning with Event-Driven Backpropagation0
XiNet: Efficient Neural Networks for tinyML0
Self-supervised Pre-training for Mirror Detection0
Scene-Aware Label Graph Learning for Multi-Label Image Classification0
Rethinking Fast Fourier Convolution in Image Inpainting0
Vision HGNN: An Image is More than a Graph of NodesCode1
Neural Rate Estimator and Unsupervised Learning for Efficient Distributed Image Analytics in Split-DNN ModelsCode0
Memory-Friendly Scalable Super-Resolution via Rewinding Lottery Ticket Hypothesis0
A New Dataset Based on Images Taken by Blind People for Testing the Robustness of Image Classification Models Trained for ImageNet CategoriesCode0
Bias-Eliminating Augmentation Learning for Debiased Federated Learning0
PIP-Net: Patch-Based Intuitive Prototypes for Interpretable Image ClassificationCode1
RA-CLIP: Retrieval Augmented Contrastive Language-Image Pre-Training0
Deep Factorized Metric LearningCode1
Image as a Foreign Language: BEiT Pretraining for Vision and Vision-Language Tasks0
Evolved Part Masking for Self-Supervised Learning0
Rate Gradient Approximation Attack Threats Deep Spiking Neural NetworksCode1
DropKey for Vision Transformer0
Boundary Unlearning: Rapid Forgetting of Deep Networks via Shifting the Decision Boundary0
MDL-NAS: A Joint Multi-Domain Learning Framework for Vision Transformer0
DISC: Learning From Noisy Labels via Dynamic Instance-Specific Selection and CorrectionCode1
iCLIP: Bridging Image Classification and Contrastive Language-Image Pre-Training for Visual RecognitionCode0
Quantum-Inspired Spectral-Spatial Pyramid Network for Hyperspectral Image Classification0
ViewNet: A Novel Projection-Based Backbone With View Pooling for Few-Shot Point Cloud ClassificationCode1
AdaptiveMix: Improving GAN Training via Feature Space ShrinkageCode1
PEFAT: Boosting Semi-Supervised Medical Image Classification via Pseudo-Loss Estimation and Feature Adversarial TrainingCode0
Initialization Noise in Image Gradients and Saliency Maps0
Co-Training 2L Submodels for Visual Recognition0
ProD: Prompting-To-Disentangle Domain Knowledge for Cross-Domain Few-Shot Image Classification0
A General Regret Bound of Preconditioned Gradient Method for DNN TrainingCode1
Efficient On-device Training via Gradient FilteringCode1
GoogLe2Net: Going Transverse with Convolutions0
Neural Collapse in Deep Linear Networks: From Balanced to Imbalanced DataCode1
Chest X-Ray Images Classification with CNNCode0
TransIFC: Invariant Cues-aware Feature Concentration Learning for Efficient Fine-grained Bird Image Classification0
DiRaC-I: Identifying Diverse and Rare Training Classes for Zero-Shot Learning0
Machine Learning and Thermography Applied to the Detection and Classification of Cracks in Building0
Learning Multimodal Data Augmentation in Feature SpaceCode1
On Learning the Structure of Clusters in Graphs0
Thermal Heating in ReRAM Crossbar Arrays: Challenges and Solutions0
Show:102550
← PrevPage 70 of 209Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1CoCa (finetuned)Top 1 Accuracy91Unverified
2Model soups (BASIC-L)Top 1 Accuracy90.98Unverified
3Model soups (ViT-G/14)Top 1 Accuracy90.94Unverified
4DaViT-GTop 1 Accuracy90.4Unverified
5Meta Pseudo Labels (EfficientNet-L2)Top 1 Accuracy90.2Unverified
6DaViT-HTop 1 Accuracy90.2Unverified
7SwinV2-GTop 1 Accuracy90.17Unverified
8MAWS (ViT-6.5B)Top 1 Accuracy90.1Unverified
9Florence-CoSwin-HTop 1 Accuracy90.05Unverified
10RevCol-HTop 1 Accuracy90Unverified