SOTAVerified

Image Classification

Image Classification is a fundamental task in vision recognition that aims to understand and categorize an image as a whole under a specific label. Unlike object detection, which involves classification and location of multiple objects within an image, image classification typically pertains to single-object images. When the classification becomes highly detailed or reaches instance-level, it is often referred to as image retrieval, which also involves finding similar images in a large database.

Source: Metamorphic Testing for Object Detection Systems

Papers

Showing 29513000 of 10419 papers

TitleStatusHype
Spatial-Temporal Search for Spiking Neural Networks0
A Combinatorial Approach to Neural Emergent Communication0
Interpretable Bilingual Multimodal Large Language Model for Diverse Biomedical Tasks0
New Insight in Cervical Cancer Diagnosis Using Convolution Neural Network Architecture0
Deep Learning for Active Region Classification: A Systematic Study from Convolutional Neural Networks to Vision Transformers0
Backdoor in Seconds: Unlocking Vulnerabilities in Large Pre-trained Models via Model Editing0
Enhancing Multimodal Medical Image Classification using Cross-Graph Modal Contrastive LearningCode0
Development of CNN Architectures using Transfer Learning Methods for Medical Image Classification0
KANICE: Kolmogorov-Arnold Networks with Interactive Convolutional ElementsCode0
Benchmarking Large Language Models for Image Classification of Marine MammalsCode0
Altogether: Image Captioning via Re-aligning Alt-text0
Data Obfuscation through Latent Space Projection (LSP) for Privacy-Preserving AI Governance: Case Studies in Medical Diagnosis and Finance Fraud Detection0
Efficient Neural Network Training via Subset Pretraining0
P-YOLOv8: Efficient and Accurate Real-Time Detection of Distracted Driving0
Bayesian Concept Bottleneck Models with LLM PriorsCode0
Visual Representation Learning Guided By Multi-modal Prior Knowledge0
Domain-Adaptive Pre-training of Self-Supervised Foundation Models for Medical Image Classification in Gastrointestinal EndoscopyCode0
ViMoE: An Empirical Study of Designing Vision Mixture-of-Experts0
Open-vocabulary vs. Closed-set: Best Practice for Few-shot Object Detection Considering Text DescribabilityCode0
Visual Navigation of Digital Libraries: Retrieval and Classification of Images in the National Library of Norway's Digitised Book CollectionCode0
Reflexive Guidance: Improving OoDD in Vision-Language Models via Self-Guided Image-Adaptive Concept Generation0
Comparative Evaluation of Clustered Federated Learning MethodsCode0
On the Influence of Shape, Texture and Color for Learning Semantic Segmentation0
How Do Training Methods Influence the Utilization of Vision Models?Code0
A Hybrid Feature Fusion Deep Learning Framework for Leukemia Cancer Detection in Microscopic Blood Sample Using Gated Recurrent Unit and Uncertainty Quantification0
LoLDU: Low-Rank Adaptation via Lower-Diag-Upper Decomposition for Parameter-Efficient Fine-TuningCode0
Performance of Gaussian Mixture Model Classifiers on Embedded Feature SpacesCode0
Reproducibility study of "LICO: Explainable Models with Language-Image Consistency"Code0
Augmentation Policy Generation for Image Classification Using Large Language Models0
Feature Augmentation for Self-supervised Contrastive Learning: A Closer Look0
PND-Net: Plant Nutrition Deficiency and Disease Classification using Graph Convolutional Network0
Efficiera Residual Networks: Hardware-Friendly Fully Binary Weight with 2-bit Activation Model Achieves Practical ImageNet AccuracyCode0
Ensemble of ConvNeXt V2 and MaxViT for Long-Tailed CXR Classification with View-Based AggregationCode0
Will the Inclusion of Generated Data Amplify Bias Across Generations in Future Image Classification Models?0
SkillAggregation: Reference-free LLM-Dependent Aggregation0
SGLP: A Similarity Guided Fast Layer Partition Pruning for Compressing Large Deep ModelsCode0
big.LITTLE Vision Transformer for Efficient Visual Recognition0
Enhancing JEPAs with Spatial Conditioning: Robust and Efficient Representation Learning0
Towards Better Multi-head Attention via Channel-wise Sample PermutationCode0
Towards a More Complete Theory of Function Preserving Transforms0
Understanding Robustness of Parameter-Efficient Tuning for Image ClassificationCode0
Provably Reliable Conformal Prediction Sets in the Presence of Data Poisoning0
EG-SpikeFormer: Eye-Gaze Guided Transformer on Spiking Neural Networks for Medical Image Analysis0
Diabetic retinopathy image classification method based on GreenBen data augmentation0
Deep Transfer Learning: Model Framework and Error Analysis0
Efficient Hyperparameter Importance Assessment for CNNs0
Accelerated Distributed Stochastic Non-Convex Optimization over Time-Varying Directed Networks0
Semantic Token Reweighting for Interpretable and Controllable Text Embeddings in CLIP0
Cross-Domain Evaluation of Few-Shot Classification Models: Natural Images vs. Histopathological Images0
What is Left After Distillation? How Knowledge Transfer Impacts Fairness and Bias0
Show:102550
← PrevPage 60 of 209Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1CoCa (finetuned)Top 1 Accuracy91Unverified
2Model soups (BASIC-L)Top 1 Accuracy90.98Unverified
3Model soups (ViT-G/14)Top 1 Accuracy90.94Unverified
4DaViT-GTop 1 Accuracy90.4Unverified
5Meta Pseudo Labels (EfficientNet-L2)Top 1 Accuracy90.2Unverified
6DaViT-HTop 1 Accuracy90.2Unverified
7SwinV2-GTop 1 Accuracy90.17Unverified
8MAWS (ViT-6.5B)Top 1 Accuracy90.1Unverified
9Florence-CoSwin-HTop 1 Accuracy90.05Unverified
10RevCol-HTop 1 Accuracy90Unverified