SOTAVerified

Image Classification

Image Classification is a fundamental task in vision recognition that aims to understand and categorize an image as a whole under a specific label. Unlike object detection, which involves classification and location of multiple objects within an image, image classification typically pertains to single-object images. When the classification becomes highly detailed or reaches instance-level, it is often referred to as image retrieval, which also involves finding similar images in a large database.

Source: Metamorphic Testing for Object Detection Systems

Papers

Showing 30013050 of 10419 papers

TitleStatusHype
Multistage Relation Network With Dual-Metric for Few-Shot Hyperspectral Image ClassificationCode1
MMViT: Multiscale Multiview Vision Transformers0
Deep Fast Vision: Accelerated Deep Transfer Learning Vision Prototyping and BeyondCode1
ESPT: A Self-Supervised Episodic Spatial Pretext Task for Improving Few-Shot LearningCode1
From Association to Generation: Text-only Captioning by Unsupervised Cross-modal MappingCode1
PVP: Pre-trained Visual Parameter-Efficient Tuning0
Tensor Decomposition for Model Reduction in Neural Networks: A Review0
Advancing Ischemic Stroke Diagnosis: A Novel Two-Stage Approach for Blood Clot Origin Identification0
Sample-Specific Debiasing for Better Image-Text Models0
iMixer: hierarchical Hopfield network implies an invertible, implicit and iterative MLP-MixerCode0
Bayesian Optimization Meets Self-DistillationCode1
Evaluating Adversarial Robustness on Document Image Classification0
Graph Convolutional Networks based on Manifold Learning for Semi-Supervised Image Classification0
AwesomeMeta+: A Mixed-Prototyping Meta-Learning System Supporting AI Application Design AnywhereCode1
Now You See Me: Robust approach to Partial Occlusions0
Function-Consistent Feature DistillationCode1
MixPro: Data Augmentation with MaskMix and Progressive Attention Labeling for Vision TransformerCode1
Improving Classification Neural Networks by using Absolute activation function (MNIST/LeNET-5 example)Code0
The Case for Hierarchical Deep Learning Inference at the Network Edge0
Vision Transformer for Efficient Chest X-ray and Gastrointestinal Image Classification0
SATIN: A Multi-Task Metadataset for Classifying Satellite Imagery using Vision-Language Models0
Learning Partial Correlation based Deep Visual Representation for Image ClassificationCode1
Exploiting Patch Sizes and Resolutions for Multi-Scale Deep Learning in Mammogram Image Classification0
WATT-EffNet: A Lightweight and Accurate Model for Classifying Aerial Disaster ImagesCode0
DeformableFormer: Classification of Endoscopic Ultrasound Guided Fine Needle Biopsy in Pancreatic Diseases0
Hyperbolic Geometry in Computer Vision: A Survey0
Picking Up Quantization Steps for Compressed Image ClassificationCode0
Graph based Label Enhancement for Multi-instance Multi-label learning0
Learning Self-Supervised Representations for Label Efficient Cross-Domain Knowledge Transfer on Diabetic Retinopathy Fundus ImagesCode0
Multi-domain learning CNN model for microscopy image classification0
Learning Bottleneck Concepts in Image ClassificationCode1
Backpropagation-free Training of Deep Physical Neural Networks0
Get Rid Of Your Trail: Remotely Erasing Backdoors in Federated Learning0
A baseline on continual learning methods for video action recognition0
Multi-view Vision-Prompt Fusion Network: Can 2D Pre-trained Model Boost 3D Point Cloud Data-scarce Learning?0
Angle based dynamic learning rate for gradient descentCode0
DCN-T: Dual Context Network with Transformer for Hyperspectral Image ClassificationCode1
Baybayin Character Instance Detection0
ContraCluster: Learning to Classify without Labels by Contrastive Self-Supervision and Prototype-Based Semi-Supervision0
Quantum machine learning for image classification0
Hyperbolic Image-Text RepresentationsCode1
Performance of GAN-based augmentation for deep learning COVID-19 image classificationCode0
Do humans and machines have the same eyes? Human-machine perceptual differences on image classification0
Visual Instruction TuningCode6
OOD-CV-v2: An extended Benchmark for Robustness to Out-of-Distribution Shifts of Individual Nuisances in Natural Images0
Self-Supervised Learning from Non-Object Centric Images with a Geometric Transformation Sensitive ArchitectureCode0
Promises and Pitfalls of the Linearized Laplace in Bayesian OptimizationCode0
A Survey on Few-Shot Class-Incremental Learning0
Chain of Thought Prompt Tuning in Vision Language Models0
Autoencoders with Intrinsic Dimension Constraints for Learning Low Dimensional Image Representations0
Show:102550
← PrevPage 61 of 209Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1CoCa (finetuned)Top 1 Accuracy91Unverified
2Model soups (BASIC-L)Top 1 Accuracy90.98Unverified
3Model soups (ViT-G/14)Top 1 Accuracy90.94Unverified
4DaViT-GTop 1 Accuracy90.4Unverified
5Meta Pseudo Labels (EfficientNet-L2)Top 1 Accuracy90.2Unverified
6DaViT-HTop 1 Accuracy90.2Unverified
7SwinV2-GTop 1 Accuracy90.17Unverified
8MAWS (ViT-6.5B)Top 1 Accuracy90.1Unverified
9Florence-CoSwin-HTop 1 Accuracy90.05Unverified
10RevCol-HTop 1 Accuracy90Unverified