SOTAVerified

Image Classification

Image Classification is a fundamental task in vision recognition that aims to understand and categorize an image as a whole under a specific label. Unlike object detection, which involves classification and location of multiple objects within an image, image classification typically pertains to single-object images. When the classification becomes highly detailed or reaches instance-level, it is often referred to as image retrieval, which also involves finding similar images in a large database.

Source: Metamorphic Testing for Object Detection Systems

Papers

Showing 32763300 of 10420 papers

TitleStatusHype
A2S-NAS: Asymmetric Spectral-Spatial Neural Architecture Search For Hyperspectral Image Classification0
StudyFormer : Attention-Based and Dynamic Multi View Classifier for X-ray images0
Real-Time Damage Detection in Fiber Lifting Ropes Using Lightweight Convolutional Neural NetworksCode0
A Gradient Boosting Approach for Training Convolutional and Deep Neural NetworksCode0
Open-domain Visual Entity Recognition: Towards Recognizing Millions of Wikipedia EntitiesCode1
DISCO: Distributed Inference with Sparse Communications0
Stress and Adaptation: Applying Anna Karenina Principle in Deep Learning for Image Classification0
Deep Active Learning in the Presence of Label Noise: A Survey0
Magnification Invariant Medical Image Analysis: A Comparison of Convolutional Networks, Vision Transformers, and Token Mixers0
Analysis of Real-Time Hostile Activitiy Detection from Spatiotemporal Features Using Time Distributed Deep CNNs, RNNs and Attention-Based Mechanisms0
FrankenSplit: Efficient Neural Feature Compression with Shallow Variational Bottleneck Injection for Mobile Edge ComputingCode1
Model-based feature selection for neural networks: A mixed-integer programming approach0
CMVAE: Causal Meta VAE for Unsupervised Meta-LearningCode0
Domain-Specific Pre-training Improves Confidence in Whole Slide Image ClassificationCode0
mSAM: Micro-Batch-Averaged Sharpness-Aware Minimization0
MedViT: A Robust Vision Transformer for Generalized Medical Image ClassificationCode2
Deep Selector-JPEG: Adaptive JPEG Image Compression for Computer Vision in Image classification with Human Vision Criteria0
Gradient-based Wang-Landau Algorithm: A Novel Sampler for Output Distribution of Neural Networks over the Input Space0
Random Padding Data Augmentation0
GPT4MIA: Utilizing Generative Pre-trained Transformer (GPT-3) as A Plug-and-Play Transductive Model for Medical Image Analysis0
Towards Reliable Assessments of Demographic Disparities in Multi-Label Image Classifiers0
Meta-Album: Multi-domain Meta-Dataset for Few-Shot Image ClassificationCode1
THC: Accelerating Distributed Deep Learning Using Tensor Homomorphic CompressionCode1
Fossil Image Identification using Deep Learning Ensembles of Data Augmented MultiviewsCode0
Efficiency 360: Efficient Vision TransformersCode1
Show:102550
← PrevPage 132 of 417Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1CoCa (finetuned)Top 1 Accuracy91Unverified
2Model soups (BASIC-L)Top 1 Accuracy90.98Unverified
3Model soups (ViT-G/14)Top 1 Accuracy90.94Unverified
4DaViT-GTop 1 Accuracy90.4Unverified
5Meta Pseudo Labels (EfficientNet-L2)Top 1 Accuracy90.2Unverified
6DaViT-HTop 1 Accuracy90.2Unverified
7SwinV2-GTop 1 Accuracy90.17Unverified
8MAWS (ViT-6.5B)Top 1 Accuracy90.1Unverified
9Florence-CoSwin-HTop 1 Accuracy90.05Unverified
10RevCol-HTop 1 Accuracy90Unverified