Image Classification

Image Classification is a fundamental task in vision recognition that aims to understand and categorize an image as a whole under a specific label. Unlike object detection, which involves classification and location of multiple objects within an image, image classification typically pertains to single-object images. When the classification becomes highly detailed or reaches instance-level, it is often referred to as image retrieval, which also involves finding similar images in a large database.

Source: Metamorphic Testing for Object Detection Systems

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 126–150 of 10419 papers

Title	Date	Tasks	Status	Hype
Learning Transferable Visual Models From Natural Language Supervision	Feb 26, 2021	Action RecognitionBenchmarking	CodeCode Available	2
GroupMamba: Efficient Group-Based Visual State Space Model	Jul 18, 2024	image-classificationImage Classification	CodeCode Available	2
Fixing the train-test resolution discrepancy: FixEfficientNet	Mar 18, 2020	Data AugmentationImage Classification	CodeCode Available	2
Fixing the train-test resolution discrepancy	Jun 14, 2019	Data AugmentationFine-Grained Image Classification	CodeCode Available	2
FixMatch: Simplifying Semi-Supervised Learning with Consistency and Confidence	Jan 21, 2020	Image ClassificationPseudo Label	CodeCode Available	2
Fast Vision Transformers with HiLo Attention	May 26, 2022	BenchmarkingEfficient ViTs	CodeCode Available	2
3D-RCNet: Learning from Transformer to Build a 3D Relational ConvNet for Hyperspectral Image Classification	Aug 25, 2024	Computational EfficiencyHyperspectral Image Classification	CodeCode Available	2
FixMatch: Simplifying Semi-Supervised Learning with Consistency and Confidence	Jan 21, 2020	Image ClassificationPseudo Label	CodeCode Available	2
ERS: a novel comprehensive endoscopy image dataset for machine learning, compliant with the MST 3.0 specification	Jan 21, 2022	BIG-bench Machine Learningimage-classification	CodeCode Available	2
MobileOne: An Improved One millisecond Mobile Backbone	Jun 8, 2022	Efficient Neural NetworkGaze Estimation	CodeCode Available	2
EMR-Merging: Tuning-Free High-Performance Model Merging	May 23, 2024	Image ClassificationImage Retrieval	CodeCode Available	2
FasterViT: Fast Vision Transformers with Hierarchical Attention	Jun 9, 2023	Image Classificationobject-detection	CodeCode Available	2
HAIR: Hypernetworks-based All-in-One Image Restoration	Aug 15, 2024	5-Degradation Blind All-in-One Image RestorationAll	CodeCode Available	2
ALBench: A Framework for Evaluating Active Learning in Object Detection	Jul 27, 2022	Active Learningimage-classification	CodeCode Available	2
MogaNet: Multi-order Gated Aggregation Network	Nov 7, 2022	3D Human Pose EstimationImage Classification	CodeCode Available	2
Aligning Domain-specific Distribution and Classifier for Cross-domain Classification from Multiple Sources	Jan 4, 2022	Domain Adaptationdomain classification	CodeCode Available	2
Efficient Multi-Scale Attention Module with Cross-Spatial Learning	May 23, 2023	Dimensionality Reductionimage-classification	CodeCode Available	2
2DMamba: Efficient State Space Model for Image Representation with Applications on Giga-Pixel Whole Slide Image Classification	Dec 1, 2024	Computational Efficiencyimage-classification	CodeCode Available	2
UNetFormer: A UNet-like Transformer for Efficient Semantic Segmentation of Remote Sensing Urban Scene Imagery	Sep 18, 2021	Change DetectionDecoder	CodeCode Available	2
EdgeNeXt: Efficiently Amalgamated CNN-Transformer Architecture for Mobile Vision Applications	Jun 21, 2022	Image ClassificationObject Detection	CodeCode Available	2
ParC-Net: Position Aware Circular Convolution with Merits from ConvNets and Transformer	Mar 8, 2022	Image Classificationobject-detection	CodeCode Available	2
Effective Data Augmentation With Diffusion Models	Feb 7, 2023	Data AugmentationDiversity	CodeCode Available	2
DGR-MIL: Exploring Diverse Global Representation in Multiple Instance Learning for Whole Slide Image Classification	Jul 4, 2024	DescriptiveDiversity	CodeCode Available	2
Dilated Neighborhood Attention Transformer	Sep 29, 2022	Image ClassificationInstance Segmentation	CodeCode Available	2
Agent Attention: On the Integration of Softmax and Linear Attention	Dec 14, 2023	Computational Efficiencyimage-classification	CodeCode Available	2

Show:10 25 50

← PrevPage 6 of 417Next →

All datasets ImageNet CIFAR-10 CIFAR-100 STL-10 ObjectNet MNIST SVHN iNaturalist 2018 ImageNet ReaL Flowers-102 Clothing1M mini WebVision 1.0

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	CoCa (finetuned)	Top 1 Accuracy	91	—	Unverified
2	Model soups (BASIC-L)	Top 1 Accuracy	90.98	—	Unverified
3	Model soups (ViT-G/14)	Top 1 Accuracy	90.94	—	Unverified
4	DaViT-G	Top 1 Accuracy	90.4	—	Unverified
5	Meta Pseudo Labels (EfficientNet-L2)	Top 1 Accuracy	90.2	—	Unverified
6	DaViT-H	Top 1 Accuracy	90.2	—	Unverified
7	SwinV2-G	Top 1 Accuracy	90.17	—	Unverified
8	MAWS (ViT-6.5B)	Top 1 Accuracy	90.1	—	Unverified
9	Florence-CoSwin-H	Top 1 Accuracy	90.05	—	Unverified
10	RevCol-H	Top 1 Accuracy	90	—	Unverified