SOTAVerified

Image Classification

Image Classification is a fundamental task in vision recognition that aims to understand and categorize an image as a whole under a specific label. Unlike object detection, which involves classification and location of multiple objects within an image, image classification typically pertains to single-object images. When the classification becomes highly detailed or reaches instance-level, it is often referred to as image retrieval, which also involves finding similar images in a large database.

Source: Metamorphic Testing for Object Detection Systems

Papers

Showing 53265350 of 10420 papers

TitleStatusHype
Tiered Pruning for Efficient Differentialble Inference-Aware Neural Architecture Search0
Robust Collaborative Learning with Linear Gradient OverheadCode0
I2DFormer: Learning Image to Document Attention for Zero-Shot Image Classification0
PicT: A Slim Weakly Supervised Vision Transformer for Pavement Distress ClassificationCode0
Relaxed Attention for Transformer Models0
Frequency Dropout: Feature-Level Regularization via Randomized Filtering0
Fine-grained Classification of Solder Joints with α-skew Jensen-Shannon Divergence0
On the Adversarial Transferability of ConvMixer Models0
S^3R: Self-supervised Spectral Regression for Hyperspectral Histopathology Image Classification0
On the Shift Invariance of Max Pooling Feature Maps in Convolutional Neural Networks0
Semantic Segmentation using Neural Ordinary Differential Equations0
Towards Bridging the Performance Gaps of Joint Energy-based ModelsCode0
Enhance the Visual Representation via Discrete Adversarial TrainingCode0
Deep tensor networks with matrix product operators0
A Mosquito is Worth 16x16 Larvae: Evaluation of Deep Learning Architectures for Mosquito Larvae ClassificationCode0
Continual Learning with Dependency Preserving Hypernetworks0
Continual Learning for Class- and Domain-Incremental Semantic Segmentation0
Confidence-Guided Data Augmentation for Improved Semi-Supervised Training0
Top-Tuning: a study on transfer learning for an efficient alternative to fine tuning for image classification with fast kernel methods0
A Continual Development Methodology for Large-scale Multitask Dynamic ML SystemsCode0
On the Surprising Effectiveness of Transformers in Low-Labeled Video Recognition0
Medical Image Segmentation using LeViT-UNet++: A Case Study on GI Tract Data0
Deep Reinforcement Learning for Task Offloading in UAV-Aided Smart Farm Networks0
OmniVL:One Foundation Model for Image-Language and Video-Language Tasks0
PaLI: A Jointly-Scaled Multilingual Language-Image Model0
Show:102550
← PrevPage 214 of 417Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1CoCa (finetuned)Top 1 Accuracy91Unverified
2Model soups (BASIC-L)Top 1 Accuracy90.98Unverified
3Model soups (ViT-G/14)Top 1 Accuracy90.94Unverified
4DaViT-GTop 1 Accuracy90.4Unverified
5Meta Pseudo Labels (EfficientNet-L2)Top 1 Accuracy90.2Unverified
6DaViT-HTop 1 Accuracy90.2Unverified
7SwinV2-GTop 1 Accuracy90.17Unverified
8MAWS (ViT-6.5B)Top 1 Accuracy90.1Unverified
9Florence-CoSwin-HTop 1 Accuracy90.05Unverified
10Meta Pseudo Labels (EfficientNet-B6-Wide)Top 1 Accuracy90Unverified