SOTAVerified

Image Classification

Image Classification is a fundamental task in vision recognition that aims to understand and categorize an image as a whole under a specific label. Unlike object detection, which involves classification and location of multiple objects within an image, image classification typically pertains to single-object images. When the classification becomes highly detailed or reaches instance-level, it is often referred to as image retrieval, which also involves finding similar images in a large database.

Source: Metamorphic Testing for Object Detection Systems

Papers

Showing 31763200 of 10420 papers

TitleStatusHype
Exploiting LMM-based knowledge for image classification tasks0
Blockout: Dynamic Model Selection for Hierarchical Deep Networks0
Does Haze Removal Help CNN-based Image Classification?0
Exploiting Local Features from Deep Networks for Image Retrieval0
Dirichlet-based Histogram Feature Transform for Image Classification0
ADFQ-ViT: Activation-Distribution-Friendly Post-Training Quantization for Vision Transformers0
Does Robustness on ImageNet Transfer to Downstream Tasks?0
Blockchain Enabled Trustless API Marketplace0
Does Visual Pretraining Help End-to-End Reasoning?0
Blink: Fast and Generic Collectives for Distributed ML0
Direct Quantization for Training Highly Accurate Low Bit-width Deep Neural Networks0
Exploiting Kernel Compression on BNNs0
Exploiting Nontrivial Connectivity for Automatic Speech Recognition0
Exploiting Activation based Gradient Output Sparsity to Accelerate Backpropagation in CNNs0
Domain2Vec: Deep Domain Generalization0
Domain Adaptation and Image Classification via Deep Conditional Adaptation Network0
Exploiting Category Names for Few-Shot Classification with Vision-Language Models0
An Efficient and Small Convolutional Neural Network for Pest Recognition -- ExquisiteNet0
Domain Adaptive Monocular Depth Estimation With Semantic Information0
Directional Gradient Projection for Robust Fine-Tuning of Foundation Models0
Domain Adaptive Skin Lesion Classification via Conformal Ensemble of Vision Transformers0
Domain Adaptive Transfer Learning on Visual Attention Aware Data Augmentation for Fine-grained Visual Categorization0
Explicitly Modeled Attention Maps for Image Classification0
Exploiting Contextual Uncertainty of Visual Data for Efficient Training of Deep Models0
Directing DNNs Attention for Facial Attribution Classification using Gradient-weighted Class Activation Mapping0
Show:102550
← PrevPage 128 of 417Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1CoCa (finetuned)Top 1 Accuracy91Unverified
2Model soups (BASIC-L)Top 1 Accuracy90.98Unverified
3Model soups (ViT-G/14)Top 1 Accuracy90.94Unverified
4DaViT-GTop 1 Accuracy90.4Unverified
5Meta Pseudo Labels (EfficientNet-L2)Top 1 Accuracy90.2Unverified
6DaViT-HTop 1 Accuracy90.2Unverified
7SwinV2-GTop 1 Accuracy90.17Unverified
8MAWS (ViT-6.5B)Top 1 Accuracy90.1Unverified
9Florence-CoSwin-HTop 1 Accuracy90.05Unverified
10RevCol-HTop 1 Accuracy90Unverified