SOTAVerified

Image Classification

Image Classification is a fundamental task in vision recognition that aims to understand and categorize an image as a whole under a specific label. Unlike object detection, which involves classification and location of multiple objects within an image, image classification typically pertains to single-object images. When the classification becomes highly detailed or reaches instance-level, it is often referred to as image retrieval, which also involves finding similar images in a large database.

Source: Metamorphic Testing for Object Detection Systems

Papers

Showing 80518100 of 10420 papers

TitleStatusHype
Few-Shot Learning Approach on Tuberculosis Classification Based on Chest X-Ray Images0
A fast dynamic graph convolutional network and CNN parallel network for hyperspectral image classification0
Computer vision and machine learning for medical image analysis: recent advances, challenges, and way forward0
Revisiting Metric Learning for Few-Shot Image Classification0
Feature Whitening via Gradient Transformation for Improved Convergence0
Rethinking the Zigzag Flattening for Image Reading0
Rethinking Two Consensuses of the Transferability in Deep Learning0
Revisiting MLLMs: An In-Depth Analysis of Image Classification Abilities0
Rethinking VLMs and LLMs for Image Classification0
Few-shot Image Classification with Multi-Facet Prototypes0
Few-Shot Image Classification via Contrastive Self-Supervised Learning0
Foveated Retinotopy Improves Classification and Localization in CNNs0
A Self-Supervised Learning Pipeline for Demographically Fair Facial Attribute Classification0
Compute Less to Get More: Using ORC to Improve Sparse Filtering0
A Fistful of Words: Learning Transferable Visual Models from Bag-of-Words Supervision0
Retrieval-enriched zero-shot image classification in low-resource domains0
Few-shot Image Classification based on Gradual Machine Learning0
Few-Shot Image Classification and Segmentation as Visual Question Answering Using Vision-Language Models0
Compute-Efficient Medical Image Classification with Softmax-Free Transformers and Sequence Normalization0
Few-Shot Image Classification Along Sparse Graphs0
Reveal of Vision Transformers Robustness against Adversarial Attacks0
FedDiff: Diffusion Model Driven Federated Learning for Multi-Modal and Multi-Clients0
Reversed Active Learning based Atrous DenseNet for Pathological Image Classification0
Reverse engineering adversarial attacks with fingerprints from adversarial examples0
FedDropoutAvg: Generalizable federated learning for histopathology image classification0
Unauthorized AI cannot Recognize Me: Reversible Adversarial Example0
Few-Shot Hyperspectral Image Classification With Unknown Classes Using Multitask Deep Learning0
Computed tomography using meta-optics0
A Self-Supervised Feature Map Augmentation (FMA) Loss and Combined Augmentations Finetuning to Efficiently Improve the Robustness of CNNs0
LAM3D: Leveraging Attention for Monocular 3D Object Detection0
Review of AlexNet for Medical Image Classification0
A Test Statistic Estimation-based Approach for Establishing Self-interpretable CNN-based Binary Classifiers0
Rice Leaf Disease Detection: A Comparative Study Between CNN, Transformer and Non-neural Network Architectures0
Revisiting Adversarial Risk0
Revisiting Adversarial Robustness of Classifiers With a Reject Option0
Revisiting a kNN-based Image Classification System with High-capacity Storage0
Robust Brain MRI Image Classification with SIBOW-SVM0
RRR-Net: Reusing, Reducing, and Recycling a Deep Backbone Network0
Scene Uncertainty and the Wellington Posterior of Deterministic Image Classifiers0
Revisiting Data Augmentation for Rotational Invariance in Convolutional Neural Networks0
Federated Learning for Commercial Image Sources0
The Efficacy of SHIELD under Different Threat Models0
Few-shot Fine-grained Image Classification via Multi-Frequency Neighborhood and Double-cross ModulationCode0
A self-interpretable module for deep image classification on small dataCode0
Few-Class Arena: A Benchmark for Efficient Selection of Vision Models and Dataset Difficulty MeasurementCode0
Few and Fewer: Learning Better from Few Examples Using Fewer Base ClassesCode0
Feedback Gradient Descent: Efficient and Stable Optimization with Orthogonality for DNNsCode0
Compressing Vision Transformers for Low-Resource Visual LearningCode0
A second-order-like optimizer with adaptive gradient scaling for deep learningCode0
Compressing Deep CNNs using Basis Representation and Spectral Fine-tuningCode0
Show:102550
← PrevPage 162 of 209Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1CoCa (finetuned)Top 1 Accuracy91Unverified
2Model soups (BASIC-L)Top 1 Accuracy90.98Unverified
3Model soups (ViT-G/14)Top 1 Accuracy90.94Unverified
4DaViT-GTop 1 Accuracy90.4Unverified
5Meta Pseudo Labels (EfficientNet-L2)Top 1 Accuracy90.2Unverified
6DaViT-HTop 1 Accuracy90.2Unverified
7SwinV2-GTop 1 Accuracy90.17Unverified
8MAWS (ViT-6.5B)Top 1 Accuracy90.1Unverified
9Florence-CoSwin-HTop 1 Accuracy90.05Unverified
10RevCol-HTop 1 Accuracy90Unverified