SOTAVerified

Image Classification

Image Classification is a fundamental task in vision recognition that aims to understand and categorize an image as a whole under a specific label. Unlike object detection, which involves classification and location of multiple objects within an image, image classification typically pertains to single-object images. When the classification becomes highly detailed or reaches instance-level, it is often referred to as image retrieval, which also involves finding similar images in a large database.

Source: Metamorphic Testing for Object Detection Systems

Papers

Showing 15511600 of 10419 papers

TitleStatusHype
Leveraging Cross-Modal Neighbor Representation for Improved CLIP ClassificationCode1
A Method of Moments Embedding Constraint and its Application to Semi-Supervised LearningCode0
SPLICE -- Streamlining Digital Pathology Image Processing0
SDFD: Building a Versatile Synthetic Face Image Dataset with Diverse Attributes0
Lacunarity Pooling Layers for Plant Image Classification using Texture AnalysisCode0
IMWA: Iterative Model Weight Averaging Benefits Class-Imbalanced Learning Tasks0
Vision Transformer-based Adversarial Domain AdaptationCode0
Efficient Multi-Model Fusion with Adversarial Complementary Representation Learning0
MiMICRI: Towards Domain-centered Counterfactual Explanations of Cardiovascular Image Classification ModelsCode0
MoDE: CLIP Data Experts via Clustering0
Rethinking model prototyping through the MedMNIST+ dataset collectionCode1
Brain Storm Optimization Based Swarm Learning for Diabetic Retinopathy Image Classification0
Compressed Meta-Optical Encoder for Image Classification0
An MRP Formulation for Supervised Learning: Generalized Temporal Difference Learning Models0
Deep multi-prototype capsule networks0
A review of deep learning-based information fusion techniques for multimodal medical image classification0
CoProNN: Concept-based Prototypical Nearest Neighbors for Explaining Vision ModelsCode1
Importance of Disjoint Sampling in Conventional and Transformer Models for Hyperspectral Image ClassificationCode0
Pyramid Hierarchical Transformer for Hyperspectral Image ClassificationCode1
A Comprehensive Survey for Hyperspectral Image Classification: The Evolution from Conventional to Transformers and Mamba ModelsCode1
Learning-to-learn enables rapid learning with phase-change memory-based in-memory computing0
WangLab at MEDIQA-M3G 2024: Multimodal Medical Answer Generation using Large Language Models0
CKD: Contrastive Knowledge Distillation from A Sample-wise PerspectiveCode0
EncodeNet: A Framework for Boosting DNN Accuracy with Entropy-driven Generalized Converting Autoencoder0
I2CANSAY:Inter-Class Analogical Augmentation and Intra-Class Significance Analysis for Non-Exemplar Online Task-Free Continual Learning0
3D-Convolution Guided Spectral-Spatial Transformer for Hyperspectral Image ClassificationCode0
Nested-TNT: Hierarchical Vision Transformers with Multi-Scale Feature Processing0
On-board classification of underwater images using hybrid classical-quantum CNN based method0
Next Generation Loss Function for Image ClassificationCode1
A Hybrid Generative and Discriminative PointNet on Unordered Point Sets0
Transformer-Based Classification Outcome Prediction for Multimodal Stroke Treatment0
A Perspective on Deep Vision Performance with Standard Image and Video Codecs0
Concept Induction using LLMs: a user experiment for assessment0
An Experimental Study on Exploring Strong Lightweight Vision Transformers via Masked Image Modeling Pre-TrainingCode2
A Progressive Framework of Vision-language Knowledge Distillation and Alignment for Multilingual Scene0
GhostNetV3: Exploring the Training Strategies for Compact Models0
InfoMatch: Entropy Neural Estimation for Semi-Supervised Image ClassificationCode1
Achieving Rotation Invariance in Convolution Operations: Shifting from Data-Driven to Mechanism-Assured0
Supervised Contrastive Vision Transformer for Breast Histopathological Image Classification0
MobileNetV4 -- Universal Models for the Mobile EcosystemCode3
Med-MoE: Mixture of Domain-Specific Experts for Lightweight Medical Vision-Language ModelsCode2
Lighter, Better, Faster Multi-Source Domain Adaptation with Gaussian Mixture Models and Optimal TransportCode0
Tree Bandits for Generative Bayes0
Assessing The Impact of CNN Auto Encoder-Based Image Denoising on Image Classification TasksCode0
Vocabulary-free Image Classification and Semantic SegmentationCode0
A variable metric proximal stochastic gradient method: an application to classification problemsCode0
Distributed Federated Learning-Based Deep Learning Model for Privacy MRI Brain Tumor Detection0
Pseudo-label Learning with Calibrated Confidence Using an Energy-based Model0
Interaction as Explanation: A User Interaction-based Method for Explaining Image Classification Models0
Coreset Selection for Object Detection0
Show:102550
← PrevPage 32 of 209Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1CoCa (finetuned)Top 1 Accuracy91Unverified
2Model soups (BASIC-L)Top 1 Accuracy90.98Unverified
3Model soups (ViT-G/14)Top 1 Accuracy90.94Unverified
4DaViT-GTop 1 Accuracy90.4Unverified
5Meta Pseudo Labels (EfficientNet-L2)Top 1 Accuracy90.2Unverified
6DaViT-HTop 1 Accuracy90.2Unverified
7SwinV2-GTop 1 Accuracy90.17Unverified
8MAWS (ViT-6.5B)Top 1 Accuracy90.1Unverified
9Florence-CoSwin-HTop 1 Accuracy90.05Unverified
10RevCol-HTop 1 Accuracy90Unverified