SOTAVerified

Image Classification

Image Classification is a fundamental task in vision recognition that aims to understand and categorize an image as a whole under a specific label. Unlike object detection, which involves classification and location of multiple objects within an image, image classification typically pertains to single-object images. When the classification becomes highly detailed or reaches instance-level, it is often referred to as image retrieval, which also involves finding similar images in a large database.

Source: Metamorphic Testing for Object Detection Systems

Papers

Showing 18261850 of 10420 papers

TitleStatusHype
SDF2Net: Shallow to Deep Feature Fusion Network for PolSAR Image ClassificationCode1
Offline Writer Identification Using Convolutional Neural Network Activation Features0
Searching a Lightweight Network Architecture for Thermal Infrared Pedestrian Tracking0
Enhancing Continuous Domain Adaptation with Multi-Path Transfer Curriculum0
Investigating the Robustness of Vision Transformers against Label Noise in Medical Image Classification0
DEYO: DETR with YOLO for End-to-End Object DetectionCode2
MV-Swin-T: Mammogram Classification with Multi-view Swin TransformerCode1
Intelligent Known and Novel Aircraft Recognition -- A Shift from Classification to Similarity Learning for Combat Identification0
EncodingNet: A Novel Encoding-based MAC Design for Efficient Neural Network AccelerationCode0
Key Design Choices in Source-Free Unsupervised Domain Adaptation: An In-depth Empirical Analysis0
A Comprehensive Survey of Convolutions in Deep Learning: Applications, Challenges, and Future Trends0
Foveated Retinotopy Improves Classification and Localization in CNNs0
G-RepsNet: A Fast and General Construction of Equivariant Networks for Arbitrary Matrix Groups0
SoK: Analyzing Adversarial Examples: A Framework to Study Adversary Knowledge0
PaCKD: Pattern-Clustered Knowledge Distillation for Compressing Memory Access Prediction ModelsCode0
Partial Search in a Frozen Network is Enough to Find a Strong Lottery Ticket0
Byzantine-Robust Federated Learning: Impact of Client Subsampling and Local Updates0
How do Hyenas deal with Human Speech? Speech Recognition and Translation with ConfHyena0
LangXAI: Integrating Large Vision Models for Generating Textual Explanations to Enhance Explainability in Visual Perception TasksCode1
Integrating kNN with Foundation Models for Adaptable and Privacy-Aware Image ClassificationCode0
Weakly Supervised Object Detection in Chest X-Rays with Differentiable ROI Proposal Networks and Soft ROI PoolingCode1
Perceiving Longer Sequences With Bi-Directional Cross-Attention TransformersCode1
CowScape: Quantitative reconstruction of the conformational landscape of biological macromolecules from cryo-EM data0
Efficient Multimodal Learning from Data-centric PerspectiveCode5
ReViT: Enhancing Vision Transformers Feature Diversity with Attention Residual ConnectionsCode1
Show:102550
← PrevPage 74 of 417Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1CoCa (finetuned)Top 1 Accuracy91Unverified
2Model soups (BASIC-L)Top 1 Accuracy90.98Unverified
3Model soups (ViT-G/14)Top 1 Accuracy90.94Unverified
4DaViT-GTop 1 Accuracy90.4Unverified
5Meta Pseudo Labels (EfficientNet-L2)Top 1 Accuracy90.2Unverified
6DaViT-HTop 1 Accuracy90.2Unverified
7SwinV2-GTop 1 Accuracy90.17Unverified
8MAWS (ViT-6.5B)Top 1 Accuracy90.1Unverified
9Florence-CoSwin-HTop 1 Accuracy90.05Unverified
10RevCol-HTop 1 Accuracy90Unverified