SOTAVerified

Image Classification

Image Classification is a fundamental task in vision recognition that aims to understand and categorize an image as a whole under a specific label. Unlike object detection, which involves classification and location of multiple objects within an image, image classification typically pertains to single-object images. When the classification becomes highly detailed or reaches instance-level, it is often referred to as image retrieval, which also involves finding similar images in a large database.

Source: Metamorphic Testing for Object Detection Systems

Papers

Showing 501550 of 10419 papers

TitleStatusHype
Practical Continual Forgetting for Pre-trained Vision ModelsCode2
Efficient Few-Shot Medical Image Analysis via Hierarchical Contrastive Vision-Language Learning0
Shape-Based Single Object Classification Using Ensemble Method Classifiers0
HydraMix: Multi-Image Feature Mixing for Small Data Image Classification0
MIAFEx: An Attention-based Feature Extraction Method for Medical Image ClassificationCode0
IDEA: Image Description Enhanced CLIP-AdapterCode0
Balance Divergence for Knowledge Distillation0
A Low-cost and Ultra-lightweight Binary Neural Network for Traffic Signal Recognition0
Training Hybrid Neural Networks with Multimode Optical Nonlinearities Using Digital Twins0
deepTerra -- AI Land Classification Made Easy0
Parameter-Inverted Image Pyramid Networks for Visual Perception and Multimodal UnderstandingCode2
PRKAN: Parameter-Reduced Kolmogorov-Arnold Networks0
Uncertainty Guarantees on Automated Precision Weeding using Conformal Prediction0
Adaptive Noise-Tolerant Network for Image Segmentation0
Rice Leaf Disease Detection: A Comparative Study Between CNN, Transformer and Non-neural Network Architectures0
LarvSeg: Exploring Image Classification Data For Large Vocabulary Semantic Segmentation via Category-wise Attentive ClassifierCode0
Kolmogorov-Arnold networks for metal surface defect classification0
Averaged Adam accelerates stochastic optimization in the training of deep neural network approximations for partial differential equation and optimal control problemsCode0
TakuNet: an Energy-Efficient CNN for Real-Time Inference on Embedded UAV systems in Emergency Response ScenariosCode2
Merging Feed-Forward Sublayers for Compressed TransformersCode1
A CT Image Classification Network Framework for Lung Tumors Based on Pre-trained MobileNetV2 Model and Transfer learning, And Its Application and Market Analysis in the Medical field0
A 1Mb mixed-precision quantized encoder for image classification and patch-based compression0
A New Perspective on Privacy Protection in Federated Learning with Granular-Ball ComputingCode0
MambaHSI: Spatial-Spectral Mamba for Hyperspectral Image ClassificationCode2
Online Continual Learning: A Systematic Literature Review of Approaches, Challenges, and BenchmarksCode1
Comparison of Neural Models for X-ray Image Classification in COVID-19 Detection0
Discrete Wavelet Transform-Based Capsule Network for Hyperspectral Image Classification0
Planarian Neural Networks: Evolutionary Patterns from Basic Bilateria Shaping Modern Artificial Neural Network Architectures0
Temporal Feature Weaving for Neonatal Echocardiographic Viewpoint Video ClassificationCode0
MedFocusCLIP : Improving few shot classification in medical datasets using pixel wise attention0
Dolphin: Closed-loop Open-ended Auto-research through Thinking, Practice, and Feedback0
FTA-FTL: A Fine-Tuned Aggregation Federated Transfer Learning Scheme for Lithology Microscopic Image ClassificationCode0
Plant Leaf Disease Detection and Classification Using Deep Learning: A Review and A Proposed System on Bangladesh's Perspective0
Deep-Relative-Trust-Based Diffusion for Decentralized Deep Learning0
FedRSClip: Federated Learning for Remote Sensing Scene Classification Using Vision-Language Models0
Exploring Secure Machine Learning Through Payload Injection and FGSM Attacks on ResNet-500
A Separable Self-attention Inspired by the State Space Model for Computer VisionCode0
Google is all you need: Semi-Supervised Transfer Learning Strategy For Light Multimodal Multi-Task Classification Model0
A Multi-task Supervised Compression Model for Split ComputingCode0
M3amba: Memory Mamba is All You Need for Whole Slide Image Classification0
Retaining Knowledge and Enhancing Long-Text Representations in CLIP through Dual-Teacher Distillation0
Beyond Image Classification: A Video Benchmark and Dual-Branch Hybrid Discrimination Framework for Compositional Zero-Shot Learning0
Directional Label Diffusion Model for Learning from Noisy LabelsCode0
Multi-modal Vision Pre-training for Medical Image Analysis0
Towards Universal Dataset Distillation via Task-Driven Diffusion0
Correlative and Discriminative Label Grouping for Multi-Label Visual Prompt Tuning0
Saliuitl: Ensemble Salience Guided Recovery of Adversarial Patches against CNNsCode0
HistoFS: Non-IID Histopathologic Whole Slide Image Classification via Federated Style Transfer with RoI-Preserving0
LoKi: Low-dimensional KAN for Efficient Fine-tuning Image Models0
Training-free Neural Architecture Search through Variance of Knowledge of Deep Network Weights0
Show:102550
← PrevPage 11 of 209Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1CoCa (finetuned)Top 1 Accuracy91Unverified
2Model soups (BASIC-L)Top 1 Accuracy90.98Unverified
3Model soups (ViT-G/14)Top 1 Accuracy90.94Unverified
4DaViT-GTop 1 Accuracy90.4Unverified
5Meta Pseudo Labels (EfficientNet-L2)Top 1 Accuracy90.2Unverified
6DaViT-HTop 1 Accuracy90.2Unverified
7SwinV2-GTop 1 Accuracy90.17Unverified
8MAWS (ViT-6.5B)Top 1 Accuracy90.1Unverified
9Florence-CoSwin-HTop 1 Accuracy90.05Unverified
10RevCol-HTop 1 Accuracy90Unverified