SOTAVerified

Image Classification

Image Classification is a fundamental task in vision recognition that aims to understand and categorize an image as a whole under a specific label. Unlike object detection, which involves classification and location of multiple objects within an image, image classification typically pertains to single-object images. When the classification becomes highly detailed or reaches instance-level, it is often referred to as image retrieval, which also involves finding similar images in a large database.

Source: Metamorphic Testing for Object Detection Systems

Papers

Showing 801850 of 10419 papers

TitleStatusHype
Development of CNN Architectures using Transfer Learning Methods for Medical Image Classification0
KANICE: Kolmogorov-Arnold Networks with Interactive Convolutional ElementsCode0
Altogether: Image Captioning via Re-aligning Alt-text0
Frontiers in Intelligent ColonoscopyCode2
Domain-Adaptive Pre-training of Self-Supervised Foundation Models for Medical Image Classification in Gastrointestinal EndoscopyCode0
Efficient Neural Network Training via Subset Pretraining0
Visual Representation Learning Guided By Multi-modal Prior Knowledge0
Bayesian Concept Bottleneck Models with LLM PriorsCode0
ViMoE: An Empirical Study of Designing Vision Mixture-of-Experts0
P-YOLOv8: Efficient and Accurate Real-Time Detection of Distracted Driving0
AutoTrain: No-code training for state-of-the-art modelsCode7
Open-vocabulary vs. Closed-set: Best Practice for Few-shot Object Detection Considering Text DescribabilityCode0
Visual Navigation of Digital Libraries: Retrieval and Classification of Images in the National Library of Norway's Digitised Book CollectionCode0
Spatial-Mamba: Effective Visual State Space Models via Structure-Aware State FusionCode2
Reflexive Guidance: Improving OoDD in Vision-Language Models via Self-Guided Image-Adaptive Concept Generation0
On the Influence of Shape, Texture and Color for Learning Semantic Segmentation0
Comparative Evaluation of Clustered Federated Learning MethodsCode0
How Do Training Methods Influence the Utilization of Vision Models?Code0
A Hybrid Feature Fusion Deep Learning Framework for Leukemia Cancer Detection in Microscopic Blood Sample Using Gated Recurrent Unit and Uncertainty Quantification0
Reproducibility study of "LICO: Explainable Models with Language-Image Consistency"Code0
Performance of Gaussian Mixture Model Classifiers on Embedded Feature SpacesCode0
Augmentation Policy Generation for Image Classification Using Large Language Models0
LoLDU: Low-Rank Adaptation via Lower-Diag-Upper Decomposition for Parameter-Efficient Fine-TuningCode0
Is Less More? Exploring Token Condensation as Training-free Adaptation for CLIPCode1
Interpreting and Analysing CLIP's Zero-Shot Image Classification via Mutual KnowledgeCode1
PND-Net: Plant Nutrition Deficiency and Disease Classification using Graph Convolutional Network0
Feature Augmentation for Self-supervised Contrastive Learning: A Closer Look0
Efficiera Residual Networks: Hardware-Friendly Fully Binary Weight with 2-bit Activation Model Achieves Practical ImageNet AccuracyCode0
SGLP: A Similarity Guided Fast Layer Partition Pruning for Compressing Large Deep ModelsCode0
Towards Better Multi-head Attention via Channel-wise Sample PermutationCode0
Towards a More Complete Theory of Function Preserving Transforms0
GlobalMamba: Global Image Serialization for Vision MambaCode1
Will the Inclusion of Generated Data Amplify Bias Across Generations in Future Image Classification Models?0
Ensemble of ConvNeXt V2 and MaxViT for Long-Tailed CXR Classification with View-Based AggregationCode0
big.LITTLE Vision Transformer for Efficient Visual Recognition0
SkillAggregation: Reference-free LLM-Dependent Aggregation0
Enhancing JEPAs with Spatial Conditioning: Robust and Efficient Representation Learning0
Provably Reliable Conformal Prediction Sets in the Presence of Data Poisoning0
Robust 3D Point Clouds Classification based on Declarative DefendersCode1
Understanding Robustness of Parameter-Efficient Tuning for Image ClassificationCode0
Deep Transfer Learning: Model Framework and Error Analysis0
Diabetic retinopathy image classification method based on GreenBen data augmentation0
EG-SpikeFormer: Eye-Gaze Guided Transformer on Spiking Neural Networks for Medical Image Analysis0
Cross-Domain Evaluation of Few-Shot Classification Models: Natural Images vs. Histopathological Images0
Accelerated Distributed Stochastic Non-Convex Optimization over Time-Varying Directed Networks0
Efficient Hyperparameter Importance Assessment for CNNs0
Semantic Token Reweighting for Interpretable and Controllable Text Embeddings in CLIP0
DeBiFormer: Vision Transformer with Deformable Agent Bi-level Routing AttentionCode1
Frequency-Temporal Attention Network for Remote Sensing Imagery Change DetectionCode0
Bilinear MLPs enable weight-based mechanistic interpretabilityCode1
Show:102550
← PrevPage 17 of 209Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1CoCa (finetuned)Top 1 Accuracy91Unverified
2Model soups (BASIC-L)Top 1 Accuracy90.98Unverified
3Model soups (ViT-G/14)Top 1 Accuracy90.94Unverified
4DaViT-GTop 1 Accuracy90.4Unverified
5Meta Pseudo Labels (EfficientNet-L2)Top 1 Accuracy90.2Unverified
6DaViT-HTop 1 Accuracy90.2Unverified
7SwinV2-GTop 1 Accuracy90.17Unverified
8MAWS (ViT-6.5B)Top 1 Accuracy90.1Unverified
9Florence-CoSwin-HTop 1 Accuracy90.05Unverified
10RevCol-HTop 1 Accuracy90Unverified