SOTAVerified

Image Classification

Image Classification is a fundamental task in vision recognition that aims to understand and categorize an image as a whole under a specific label. Unlike object detection, which involves classification and location of multiple objects within an image, image classification typically pertains to single-object images. When the classification becomes highly detailed or reaches instance-level, it is often referred to as image retrieval, which also involves finding similar images in a large database.

Source: Metamorphic Testing for Object Detection Systems

Papers

Showing 801850 of 10419 papers

TitleStatusHype
Categorical Relation-Preserving Contrastive Knowledge Distillation for Medical Image ClassificationCode1
Almost-Orthogonal Layers for Efficient General-Purpose Lipschitz NetworksCode1
Category Query Learning for Human-Object Interaction ClassificationCode1
Automated Relational Meta-learningCode1
ESPT: A Self-Supervised Episodic Spatial Pretext Task for Improving Few-Shot LearningCode1
EVA-CLIP: Improved Training Techniques for CLIP at ScaleCode1
A New Semi-supervised Learning Benchmark for Classifying View and Diagnosing Aortic Stenosis from EchocardiogramsCode1
TransCenter: Transformers with Dense Representations for Multiple-Object TrackingCode1
Causal Transportability for Visual RecognitionCode1
Evaluation of Deep Neural Network Domain Adaptation Techniques for Image RecognitionCode1
Evolutionary Neural AutoML for Deep LearningCode1
Evolving Attention with Residual ConvolutionsCode1
Can An Image Classifier Suffice For Action Recognition?Code1
Evo-ViT: Slow-Fast Token Evolution for Dynamic Vision TransformerCode1
Deep Unlearning: Fast and Efficient Gradient-free Approach to Class ForgettingCode1
CEFHRI: A Communication Efficient Federated Learning Framework for Recognizing Industrial Human-Robot InteractionCode1
All you need is a good initCode1
3D U^2-Net: A 3D Universal U-Net for Multi-Domain Medical Image SegmentationCode1
EXplainable Neural-Symbolic Learning (X-NeSyL) methodology to fuse deep learning representations with expert knowledge graphs: the MonuMAI cultural heritage use caseCode1
Explaining and Harnessing Adversarial ExamplesCode1
Deep Semantic-Visual Alignment for Zero-Shot Remote Sensing Image Scene ClassificationCode1
An In-depth Study of Stochastic BackpropagationCode1
A Dual-Direction Attention Mixed Feature Network for Facial Expression RecognitionCode1
Cervical Cytology Classification Using PCA & GWO Enhanced Deep Features SelectionCode1
Multi-Scale Vision Longformer: A New Vision Transformer for High-Resolution Image EncodingCode1
Exploiting Label Skews in Federated Learning with Model ConcatenationCode1
Centrality and Consistency: Two-Stage Clean Samples Identification for Learning with Instance-Dependent Noisy LabelsCode1
Exploring Vision Transformers for Fine-grained ClassificationCode1
Extremely Lightweight Quantization Robust Real-Time Single-Image Super Resolution for Mobile DevicesCode1
Eye-gaze Guided Multi-modal Alignment for Medical Representation LearningCode1
Deep Subdomain Adaptation Network for Image ClassificationCode1
Channel Importance Matters in Few-Shot Image ClassificationCode1
Automated detection of COVID-19 cases from chest X-ray images using deep neural network and XGBoostCode1
Far Away in the Deep Space: Dense Nearest-Neighbor-Based Out-of-Distribution DetectionCode1
An Open-source Tool for Hyperspectral Image Augmentation in TensorflowCode1
PAD-Net: An Efficient Framework for Dynamic NetworksCode1
ChestX-ray8: Hospital-scale Chest X-ray Database and Benchmarks on Weakly-Supervised Classification and Localization of Common Thorax DiseasesCode1
Fast and Private Inference of Deep Neural Networks by Co-designing Activation FunctionsCode1
Adaptive and Background-Aware Vision Transformer for Real-Time UAV TrackingCode1
CheXWorld: Exploring Image World Modeling for Radiograph Representation LearningCode1
CHiLS: Zero-Shot Image Classification with Hierarchical Label SetsCode1
Automated Learning Rate Scheduler for Large-batch TrainingCode1
Deep Transfer Learning for Land Use and Land Cover Classification: A Comparative StudyCode1
Fast Fishing: Approximating BAIT for Efficient and Scalable Deep Active Image ClassificationCode1
Advancing Vision Transformers with Group-Mix AttentionCode1
Fast Hierarchical Games for Image ExplanationsCode1
Advantages and Bottlenecks of Quantum Machine Learning for Remote SensingCode1
Class Adaptive Network CalibrationCode1
DeepViT: Towards Deeper Vision TransformerCode1
Deep Reinforcement Learning for Band Selection in Hyperspectral Image ClassificationCode1
Show:102550
← PrevPage 17 of 209Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1CoCa (finetuned)Top 1 Accuracy91Unverified
2Model soups (BASIC-L)Top 1 Accuracy90.98Unverified
3Model soups (ViT-G/14)Top 1 Accuracy90.94Unverified
4DaViT-GTop 1 Accuracy90.4Unverified
5Meta Pseudo Labels (EfficientNet-L2)Top 1 Accuracy90.2Unverified
6DaViT-HTop 1 Accuracy90.2Unverified
7SwinV2-GTop 1 Accuracy90.17Unverified
8MAWS (ViT-6.5B)Top 1 Accuracy90.1Unverified
9Florence-CoSwin-HTop 1 Accuracy90.05Unverified
10RevCol-HTop 1 Accuracy90Unverified