SOTAVerified

Image Classification

Image Classification is a fundamental task in vision recognition that aims to understand and categorize an image as a whole under a specific label. Unlike object detection, which involves classification and location of multiple objects within an image, image classification typically pertains to single-object images. When the classification becomes highly detailed or reaches instance-level, it is often referred to as image retrieval, which also involves finding similar images in a large database.

Source: Metamorphic Testing for Object Detection Systems

Papers

Showing 11511200 of 10419 papers

TitleStatusHype
Nested Collaborative Learning for Long-Tailed Visual RecognitionCode1
VGGIN-Net: Deep Transfer Network for Imbalanced Breast Cancer DatasetCode1
Parameter-efficient Model Adaptation for Vision TransformersCode1
CNN Filter DB: An Empirical Investigation of Trained Convolutional FiltersCode1
CHEX: CHannel EXploration for CNN Model CompressionCode1
A Novel Approach for detecting Normal, COVID-19 and Pneumonia patient using only binary classifications from chest CT-ScansCode1
Learning to Prompt for Open-Vocabulary Object Detection with Vision-Language ModelCode1
Knowledge Mining with Scene Text for Fine-Grained RecognitionCode1
How to Robustify Black-Box ML Models? A Zeroth-Order Optimization PerspectiveCode1
Uncertainty-aware Contrastive Distillation for Incremental Semantic SegmentationCode1
A Stitch in Time Saves Nine: A Train-Time Regularizing Loss for Improved Neural Network CalibrationCode1
Moving Window Regression: A Novel Approach to Ordinal RegressionCode1
Semisupervised Cross-scale Graph Prototypical Network for Hyperspectral Image ClassificationCode1
PaCa-ViT: Learning Patch-to-Cluster Attention in Vision TransformersCode1
DTFD-MIL: Double-Tier Feature Distillation Multiple Instance Learning for Histopathology Whole Slide Image ClassificationCode1
Improving Generalization in Federated Learning by Seeking Flat MinimaCode1
FedDC: Federated Learning with Non-IID Data via Local Drift Decoupling and CorrectionCode1
Test-time Adaptation with Slot-Centric ModelsCode1
ViM: Out-Of-Distribution with Virtual-logit MatchingCode1
CoWs on Pasture: Baselines and Benchmarks for Language-Driven Zero-Shot Object NavigationCode1
Do Deep Networks Transfer Invariances Across Classes?Code1
Inducing Neural Collapse in Imbalanced Learning: Do We Really Need a Learnable Classifier at the End of Deep Neural Network?Code1
DATA: Domain-Aware and Task-Aware Self-supervised LearningCode1
DocXClassifier: High Performance Explainable Deep Network for Document Image ClassificationCode1
Open Set Recognition using Vision Transformer with an Additional Detection HeadCode1
Bamboo: Building Mega-Scale Vision Dataset Continually with Human-Machine SynergyCode1
Scalable Penalized Regression for Noise Detection in Learning with Noisy LabelsCode1
Energy-Latency Attacks via Sponge PoisoningCode1
Deep AutoAugmentCode1
Active Token MixerCode1
Deep Multimodal Guidance for Medical Image ClassificationCode1
Self Pre-training with Masked Autoencoders for Medical Image Classification and SegmentationCode1
Graph Attention Transformer Network for Multi-Label Image ClassificationCode1
Selective-Supervised Contrastive Learning with Noisy LabelsCode1
PASS: Part-Aware Self-Supervised Pre-Training for Person Re-IdentificationCode1
Dynamic MLP for Fine-Grained Image Classification by Leveraging Geographical and Temporal InformationCode1
WaveMix: Resource-efficient Token Mixing for ImagesCode1
Class-Aware Contrastive Semi-Supervised LearningCode1
DiT: Self-supervised Pre-training for Document Image TransformerCode1
Random Quantum Neural Networks (RQNN) for Noisy Image RecognitionCode1
Exploring Hierarchical Graph Representation for Large-Scale Zero-Shot Image ClassificationCode1
Evaluating the Adversarial Robustness of Adaptive Test-time DefensesCode1
Attribute Descent: Simulating Object-Centric Datasets on the Content Level and BeyondCode1
Relational Surrogate Loss LearningCode1
TeachAugment: Data Augmentation Optimization Using Teacher KnowledgeCode1
ChimeraMix: Image Classification on Small Datasets via Masked Feature MixingCode1
DataMUX: Data Multiplexing for Neural NetworksCode1
Multi-task UNet: Jointly Boosting Saliency Prediction and Disease Classification on Chest X-ray ImagesCode1
ScoreNet: Learning Non-Uniform Attention and Augmentation for Transformer-Based Histopathological Image ClassificationCode1
Pruning Networks with Cross-Layer Ranking & k-Reciprocal Nearest FiltersCode1
Show:102550
← PrevPage 24 of 209Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1CoCa (finetuned)Top 1 Accuracy91Unverified
2Model soups (BASIC-L)Top 1 Accuracy90.98Unverified
3Model soups (ViT-G/14)Top 1 Accuracy90.94Unverified
4DaViT-GTop 1 Accuracy90.4Unverified
5Meta Pseudo Labels (EfficientNet-L2)Top 1 Accuracy90.2Unverified
6DaViT-HTop 1 Accuracy90.2Unverified
7SwinV2-GTop 1 Accuracy90.17Unverified
8MAWS (ViT-6.5B)Top 1 Accuracy90.1Unverified
9Florence-CoSwin-HTop 1 Accuracy90.05Unverified
10RevCol-HTop 1 Accuracy90Unverified