Image Classification

Image Classification is a fundamental task in vision recognition that aims to understand and categorize an image as a whole under a specific label. Unlike object detection, which involves classification and location of multiple objects within an image, image classification typically pertains to single-object images. When the classification becomes highly detailed or reaches instance-level, it is often referred to as image retrieval, which also involves finding similar images in a large database.

Source: Metamorphic Testing for Object Detection Systems

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 101–125 of 10420 papers

Title	Date	Tasks	Status	Hype
Focal Modulation Networks	Mar 22, 2022	image-classificationImage Classification	CodeCode Available	2
Fast Vision Transformers with HiLo Attention	May 26, 2022	BenchmarkingEfficient ViTs	CodeCode Available	2
Aligning Domain-specific Distribution and Classifier for Cross-domain Classification from Multiple Sources	Jan 4, 2022	Domain Adaptationdomain classification	CodeCode Available	2
ERS: a novel comprehensive endoscopy image dataset for machine learning, compliant with the MST 3.0 specification	Jan 21, 2022	BIG-bench Machine Learningimage-classification	CodeCode Available	2
Frontiers in Intelligent Colonoscopy	Oct 22, 2024	Image Captioning	CodeCode Available	2
HAIR: Hypernetworks-based All-in-One Image Restoration	Aug 15, 2024	5-Degradation Blind All-in-One Image RestorationAll	CodeCode Available	2
Efficient Multi-Scale Attention Module with Cross-Spatial Learning	May 23, 2023	Dimensionality Reductionimage-classification	CodeCode Available	2
EfficientViM: Efficient Vision Mamba with Hidden State Mixer based State Space Duality	Nov 22, 2024	Efficient Neural NetworkImage Classification	CodeCode Available	2
UNetFormer: A UNet-like Transformer for Efficient Semantic Segmentation of Remote Sensing Urban Scene Imagery	Sep 18, 2021	Change DetectionDecoder	CodeCode Available	2
Agent Attention: On the Integration of Softmax and Linear Attention	Dec 14, 2023	Computational Efficiencyimage-classification	CodeCode Available	2
Effective Data Augmentation With Diffusion Models	Feb 7, 2023	Data AugmentationDiversity	CodeCode Available	2
EMR-Merging: Tuning-Free High-Performance Model Merging	May 23, 2024	Image ClassificationImage Retrieval	CodeCode Available	2
EdgeNeXt: Efficiently Amalgamated CNN-Transformer Architecture for Mobile Vision Applications	Jun 21, 2022	Image ClassificationObject Detection	CodeCode Available	2
FasterViT: Fast Vision Transformers with Hierarchical Attention	Jun 9, 2023	Image Classificationobject-detection	CodeCode Available	2
Fixing the train-test resolution discrepancy	Jun 14, 2019	Data AugmentationFine-Grained Image Classification	CodeCode Available	2
MogaNet: Multi-order Gated Aggregation Network	Nov 7, 2022	3D Human Pose EstimationImage Classification	CodeCode Available	2
Dilated Neighborhood Attention Transformer	Sep 29, 2022	Image ClassificationInstance Segmentation	CodeCode Available	2
FixMatch: Simplifying Semi-Supervised Learning with Consistency and Confidence	Jan 21, 2020	Image ClassificationPseudo Label	CodeCode Available	2
DEYO: DETR with YOLO for End-to-End Object Detection	Feb 26, 2024	DecoderGPU	CodeCode Available	2
FSFM: A Generalizable Face Security Foundation Model via Self-Supervised Facial Representation Learning	Dec 16, 2024	DeepFake Detectiondiffusion-generated faces detection	CodeCode Available	2
GalLoP: Learning Global and Local Prompts for Vision-Language Models	Jul 1, 2024	DiversityDomain Generalization	CodeCode Available	2
Generalized Parametric Contrastive Learning	Sep 26, 2022	Contrastive LearningDomain Generalization	CodeCode Available	2
GIT: A Generative Image-to-text Transformer for Vision and Language	May 27, 2022	DecoderImage Captioning	CodeCode Available	2
Global Context Vision Transformers	Jun 20, 2022	image-classificationImage Classification	CodeCode Available	2
3D-RCNet: Learning from Transformer to Build a 3D Relational ConvNet for Hyperspectral Image Classification	Aug 25, 2024	Computational EfficiencyHyperspectral Image Classification	CodeCode Available	2

Show:10 25 50

← PrevPage 5 of 417Next →

All datasets ImageNet CIFAR-10 CIFAR-100 STL-10 ObjectNet MNIST SVHN iNaturalist 2018 ImageNet ReaL Flowers-102 Clothing1M mini WebVision 1.0

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	CoCa (finetuned)	Top 1 Accuracy	91	—	Unverified
2	Model soups (BASIC-L)	Top 1 Accuracy	90.98	—	Unverified
3	Model soups (ViT-G/14)	Top 1 Accuracy	90.94	—	Unverified
4	DaViT-G	Top 1 Accuracy	90.4	—	Unverified
5	Meta Pseudo Labels (EfficientNet-L2)	Top 1 Accuracy	90.2	—	Unverified
6	DaViT-H	Top 1 Accuracy	90.2	—	Unverified
7	SwinV2-G	Top 1 Accuracy	90.17	—	Unverified
8	MAWS (ViT-6.5B)	Top 1 Accuracy	90.1	—	Unverified
9	Florence-CoSwin-H	Top 1 Accuracy	90.05	—	Unverified
10	Meta Pseudo Labels (EfficientNet-B6-Wide)	Top 1 Accuracy	90	—	Unverified