SOTAVerified

Image Classification

Image Classification is a fundamental task in vision recognition that aims to understand and categorize an image as a whole under a specific label. Unlike object detection, which involves classification and location of multiple objects within an image, image classification typically pertains to single-object images. When the classification becomes highly detailed or reaches instance-level, it is often referred to as image retrieval, which also involves finding similar images in a large database.

Source: Metamorphic Testing for Object Detection Systems

Papers

Showing 53015350 of 10420 papers

TitleStatusHype
Deep Learning based CNN Model for Classification and Detection of Individuals Wearing Face Mask0
Integrating Propositional and Relational Label Side Information for Hierarchical Zero-Shot Image Classification0
Integrating Scene Text and Visual Appearance for Fine-Grained Image Classification0
Integration and Performance Analysis of Artificial Intelligence and Computer Vision Based on Deep Learning Algorithms0
Integration of Roadside Camera Images and Weather Data for Monitoring Winter Road Surface Conditions0
Intelligent Cervical Spine Fracture Detection Using Deep Learning Methods0
Accelerating Deep Neural Networks via Semi-Structured Activation Sparsity0
Deep learning based prediction of Alzheimer's disease from magnetic resonance images0
Hierarchical Vision Transformer with Prototypes for Interpretable Medical Image Classification0
Interaction as Explanation: A User Interaction-based Method for Explaining Image Classification Models0
Hierarchical Transfer Convolutional Neural Networks for Image Classification0
Crossmodal Knowledge Distillation with WordNet-Relaxed Text Embeddings for Robust Image Classification0
Hierarchical Sparse Attention Framework for Computationally Efficient Classification of Biological Cells0
Improved and Explainable Cervical Cancer Classification using Ensemble Pooling of Block Fused Descriptors0
Hierarchical Side-Tuning for Vision Transformers0
Interleaving Learning, with Application to Neural Architecture Search0
Deep Learning for Active Region Classification: A Systematic Study from Convolutional Neural Networks to Vision Transformers0
Attentive CutMix: An Enhanced Data Augmentation Approach for Deep Learning Based Image Classification0
AIQViT: Architecture-Informed Post-Training Quantization for Vision Transformers0
Learning Dependency Structures for Weak Supervision Models0
Interpolation between CNNs and ResNets0
Learning Discriminative Representation via Metric Learning for Imbalanced Medical Image Classification0
Learning Fine-grained Features via a CNN Tree for Large-scale Classification0
Hierarchical Semantic Tree Concept Whitening for Interpretable Image Classification0
Hierarchical ResNeXt Models for Breast Cancer Histology Image Classification0
Interpretable Bilingual Multimodal Large Language Model for Diverse Biomedical Tasks0
Interpretable Deep Models for Cardiac Resynchronisation Therapy Response Prediction0
Interpretable Failure Detection with Human-Level Concepts0
Interpretable Few-Shot Image Classification via Prototypical Concept-Guided Mixture of LoRA Experts0
Cross-Modality Attention with Semantic Graph Embedding for Multi-Label Classification0
Cross-Modal Information Maximization for Medical Imaging: CMIM0
Hierarchical Representation based Query-Specific Prototypical Network for Few-Shot Image Classification0
Interpretable Mammographic Image Classification using Case-Based Reasoning and Deep Learning0
Attention Tree: Learning Hierarchies of Visual Features for Large-Scale Image Recognition0
Deep learning for lithological classification of carbonate rock micro-CT images0
Learning Deep Context-Network Architectures for Image Annotation0
Hierarchical Recurrent Neural Encoder for Video Representation with Application to Captioning0
Cross-Modal Concept Learning and Inference for Vision-Language Models0
Bonseyes AI Pipeline -- bringing AI to you. End-to-end integration of data, algorithms and deployment tools0
Interpreting Equivariant Representations0
Interpreting Interpretations: Organizing Attribution Methods by Criteria0
Interpreting the Predictions of Complex ML Models by Layer-wise Relevance Propagation0
Auxiliary Cross-Modal Representation Learning with Triplet Loss Functions for Online Handwriting Recognition0
Deep Learning Generalization and the Convex Hull of Training Sets0
Interventional Black-Box Explanations0
Deep Learning Generalization, Extrapolation, and Over-parameterization0
Hierarchical Image Classification with A Literally Toy Dataset0
Intriguing Frequency Interpretation of Adversarial Robustness for CNNs and ViTs0
Robustness via Deep Low-Rank Representations0
Attention Spiking Neural Networks0
Show:102550
← PrevPage 107 of 209Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1CoCa (finetuned)Top 1 Accuracy91Unverified
2Model soups (BASIC-L)Top 1 Accuracy90.98Unverified
3Model soups (ViT-G/14)Top 1 Accuracy90.94Unverified
4DaViT-GTop 1 Accuracy90.4Unverified
5Meta Pseudo Labels (EfficientNet-L2)Top 1 Accuracy90.2Unverified
6DaViT-HTop 1 Accuracy90.2Unverified
7SwinV2-GTop 1 Accuracy90.17Unverified
8MAWS (ViT-6.5B)Top 1 Accuracy90.1Unverified
9Florence-CoSwin-HTop 1 Accuracy90.05Unverified
10RevCol-HTop 1 Accuracy90Unverified