SOTAVerified

Image Classification

Image Classification is a fundamental task in vision recognition that aims to understand and categorize an image as a whole under a specific label. Unlike object detection, which involves classification and location of multiple objects within an image, image classification typically pertains to single-object images. When the classification becomes highly detailed or reaches instance-level, it is often referred to as image retrieval, which also involves finding similar images in a large database.

Source: Metamorphic Testing for Object Detection Systems

Papers

Showing 34513475 of 10420 papers

TitleStatusHype
Building Vision Transformers with Hierarchy Aware Feature Aggregation0
Boosting Whole Slide Image Classification from the Perspectives of Distribution, Correlation and Magnification0
DIME-FM : DIstilling Multimodal and Efficient Foundation Models0
Growing a Brain with Sparsity-Inducing Generation for Continual LearningCode0
XiNet: Efficient Neural Networks for tinyML0
LaPE: Layer-adaptive Position Embedding for Vision Transformers with Independent Layer NormalizationCode1
Scene-Aware Label Graph Learning for Multi-Label Image Classification0
Tiny Updater: Towards Efficient Neural Network-Driven Software UpdatingCode0
Adaptive Image Anonymization in the Context of Image Classification with Neural Networks0
Adaptive and Background-Aware Vision Transformer for Real-Time UAV TrackingCode1
Temporal-Coded Spiking Neural Networks with Dynamic Firing Threshold: Learning with Event-Driven Backpropagation0
Automated Knowledge Distillation via Monte Carlo Tree SearchCode0
LexLIP: Lexicon-Bottlenecked Language-Image Pre-Training for Large-Scale Image-Text Sparse RetrievalCode1
FCCNs: Fully Complex-valued Convolutional Networks using Complex-valued Color Model and Loss FunctionCode1
Self-supervised Pre-training for Mirror Detection0
Rethinking Fast Fourier Convolution in Image Inpainting0
Vision HGNN: An Image is More than a Graph of NodesCode1
RA-CLIP: Retrieval Augmented Contrastive Language-Image Pre-Training0
Memory-Friendly Scalable Super-Resolution via Rewinding Lottery Ticket Hypothesis0
A New Dataset Based on Images Taken by Blind People for Testing the Robustness of Image Classification Models Trained for ImageNet CategoriesCode0
Bias-Eliminating Augmentation Learning for Debiased Federated Learning0
PIP-Net: Patch-Based Intuitive Prototypes for Interpretable Image ClassificationCode1
Neural Rate Estimator and Unsupervised Learning for Efficient Distributed Image Analytics in Split-DNN ModelsCode0
Deep Factorized Metric LearningCode1
ViewNet: A Novel Projection-Based Backbone With View Pooling for Few-Shot Point Cloud ClassificationCode1
Show:102550
← PrevPage 139 of 417Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1CoCa (finetuned)Top 1 Accuracy91Unverified
2Model soups (BASIC-L)Top 1 Accuracy90.98Unverified
3Model soups (ViT-G/14)Top 1 Accuracy90.94Unverified
4DaViT-GTop 1 Accuracy90.4Unverified
5Meta Pseudo Labels (EfficientNet-L2)Top 1 Accuracy90.2Unverified
6DaViT-HTop 1 Accuracy90.2Unverified
7SwinV2-GTop 1 Accuracy90.17Unverified
8MAWS (ViT-6.5B)Top 1 Accuracy90.1Unverified
9Florence-CoSwin-HTop 1 Accuracy90.05Unverified
10RevCol-HTop 1 Accuracy90Unverified