SOTAVerified

Image Classification

Image Classification is a fundamental task in vision recognition that aims to understand and categorize an image as a whole under a specific label. Unlike object detection, which involves classification and location of multiple objects within an image, image classification typically pertains to single-object images. When the classification becomes highly detailed or reaches instance-level, it is often referred to as image retrieval, which also involves finding similar images in a large database.

Source: Metamorphic Testing for Object Detection Systems

Papers

Showing 43264350 of 10420 papers

TitleStatusHype
Delving into the Openness of CLIPCode0
Effects of Auxiliary Knowledge on Continual Learning0
Supernet Training for Federated Image Classification under System HeterogeneityCode0
Distributional loss for convolutional neural network regression and application to GNSS multi-path estimation0
Learning rich optical embeddings for privacy-preserving lensless image classification0
YOLOv5s-GTB: light-weighted and improved YOLOv5s for bridge crack detection0
A memory-efficient neural ODE framework based on high-level adjoint differentiationCode1
VL-BEiT: Generative Vision-Language Pretraining0
Prefix Conditioning Unifies Language and Label Supervision0
Leveraging Systematic Knowledge of 2D Transformations0
Optimizing Relevance Maps of Vision Transformers Improves RobustnessCode1
CVM-Cervix: A Hybrid Cervical Pap-Smear Image Classification Framework Using CNN, Visual Transformer and Multilayer Perceptron0
Multilingual Image Corpus – Towards a Multimodal and Multilingual Dataset0
Analysis of Catastrophic Forgetting for Random Orthogonal Transformation Tasks in the Overparameterized Regime0
Federated Learning in Non-IID Settings Aided by Differentially Private Synthetic DataCode0
Landslide4Sense: Reference Benchmark Data and Deep Learning Models for Landslide Detection0
CLIP4IDC: CLIP for Image Difference CaptioningCode1
Vision GNN: An Image is Worth Graph of NodesCode4
Efficient Self-supervised Vision Pretraining with Local Masked ReconstructionCode1
Star algorithm for NN ensemblingCode0
Dataset Distillation using Neural Feature RegressionCode0
Transformer with Fourier Integral Attentions0
Deep learning pipeline for image classification on mobile phones0
Asynchronous Hierarchical Federated Learning0
FHIST: A Benchmark for Few-shot Classification of Histological Images0
Show:102550
← PrevPage 174 of 417Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1CoCa (finetuned)Top 1 Accuracy91Unverified
2Model soups (BASIC-L)Top 1 Accuracy90.98Unverified
3Model soups (ViT-G/14)Top 1 Accuracy90.94Unverified
4DaViT-GTop 1 Accuracy90.4Unverified
5Meta Pseudo Labels (EfficientNet-L2)Top 1 Accuracy90.2Unverified
6DaViT-HTop 1 Accuracy90.2Unverified
7SwinV2-GTop 1 Accuracy90.17Unverified
8MAWS (ViT-6.5B)Top 1 Accuracy90.1Unverified
9Florence-CoSwin-HTop 1 Accuracy90.05Unverified
10RevCol-HTop 1 Accuracy90Unverified