SOTAVerified

Image Classification

Image Classification is a fundamental task in vision recognition that aims to understand and categorize an image as a whole under a specific label. Unlike object detection, which involves classification and location of multiple objects within an image, image classification typically pertains to single-object images. When the classification becomes highly detailed or reaches instance-level, it is often referred to as image retrieval, which also involves finding similar images in a large database.

Source: Metamorphic Testing for Object Detection Systems

Papers

Showing 29512975 of 10420 papers

TitleStatusHype
Interpretable Bilingual Multimodal Large Language Model for Diverse Biomedical Tasks0
Noise Adaption Network for Morse Code Image ClassificationCode0
A Combinatorial Approach to Neural Emergent Communication0
Spatial-Temporal Search for Spiking Neural Networks0
Deep Learning for Active Region Classification: A Systematic Study from Convolutional Neural Networks to Vision Transformers0
Backdoor in Seconds: Unlocking Vulnerabilities in Large Pre-trained Models via Model Editing0
Enhancing Multimodal Medical Image Classification using Cross-Graph Modal Contrastive LearningCode0
New Insight in Cervical Cancer Diagnosis Using Convolution Neural Network Architecture0
Development of CNN Architectures using Transfer Learning Methods for Medical Image Classification0
Benchmarking Large Language Models for Image Classification of Marine MammalsCode0
Altogether: Image Captioning via Re-aligning Alt-text0
KANICE: Kolmogorov-Arnold Networks with Interactive Convolutional ElementsCode0
Data Obfuscation through Latent Space Projection (LSP) for Privacy-Preserving AI Governance: Case Studies in Medical Diagnosis and Finance Fraud Detection0
Efficient Neural Network Training via Subset Pretraining0
Domain-Adaptive Pre-training of Self-Supervised Foundation Models for Medical Image Classification in Gastrointestinal EndoscopyCode0
ViMoE: An Empirical Study of Designing Vision Mixture-of-Experts0
Bayesian Concept Bottleneck Models with LLM PriorsCode0
P-YOLOv8: Efficient and Accurate Real-Time Detection of Distracted Driving0
Visual Representation Learning Guided By Multi-modal Prior Knowledge0
Open-vocabulary vs. Closed-set: Best Practice for Few-shot Object Detection Considering Text DescribabilityCode0
Reflexive Guidance: Improving OoDD in Vision-Language Models via Self-Guided Image-Adaptive Concept Generation0
Visual Navigation of Digital Libraries: Retrieval and Classification of Images in the National Library of Norway's Digitised Book CollectionCode0
Comparative Evaluation of Clustered Federated Learning MethodsCode0
On the Influence of Shape, Texture and Color for Learning Semantic Segmentation0
How Do Training Methods Influence the Utilization of Vision Models?Code0
Show:102550
← PrevPage 119 of 417Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1CoCa (finetuned)Top 1 Accuracy91Unverified
2Model soups (BASIC-L)Top 1 Accuracy90.98Unverified
3Model soups (ViT-G/14)Top 1 Accuracy90.94Unverified
4DaViT-GTop 1 Accuracy90.4Unverified
5Meta Pseudo Labels (EfficientNet-L2)Top 1 Accuracy90.2Unverified
6DaViT-HTop 1 Accuracy90.2Unverified
7SwinV2-GTop 1 Accuracy90.17Unverified
8MAWS (ViT-6.5B)Top 1 Accuracy90.1Unverified
9Florence-CoSwin-HTop 1 Accuracy90.05Unverified
10RevCol-HTop 1 Accuracy90Unverified