SOTAVerified

Image Classification

Image Classification is a fundamental task in vision recognition that aims to understand and categorize an image as a whole under a specific label. Unlike object detection, which involves classification and location of multiple objects within an image, image classification typically pertains to single-object images. When the classification becomes highly detailed or reaches instance-level, it is often referred to as image retrieval, which also involves finding similar images in a large database.

Source: Metamorphic Testing for Object Detection Systems

Papers

Showing 29513000 of 10419 papers

TitleStatusHype
One-Shot Online Testing of Deep Neural Networks Based on Distribution Shift Detection0
Causal Analysis for Robust Interpretability of Neural Networks0
SuSana Distancia is all you need: Enforcing class separability in metric learning via two novel distance-based loss functions for few-shot image classification0
Learning More Discriminative Local Descriptors for Few-shot Learning0
Predictive Models from Quantum Computer Benchmarks0
Straightening Out the Straight-Through Estimator: Overcoming Optimization Challenges in Vector Quantized Networks0
Enhancing Performance of Vision Transformers on Small Datasets through Local Inductive Bias Incorporation0
Parameter-Efficient Fine-Tuning for Medical Image Analysis: The Missed Opportunity0
To transfer or not transfer: Unified transferability metric and analysis0
Two-in-One: A Model Hijacking Attack Against Text Generation Models0
Saturated Non-Monotonic Activation Functions0
Meta-Optimization for Higher Model Generalizability in Single-Image Depth Prediction0
Feature Channel Adaptive Enhancement for Fine-Grained Visual Classification0
Meta-Learners for Few-Shot Weakly-Supervised Medical Image SegmentationCode0
Cascaded Cross-Attention Networks for Data-Efficient Whole-Slide Image Classification Using Transformers0
Learning Semi-supervised Gaussian Mixture Models for Generalized Category DiscoveryCode1
Medical supervised masked autoencoders: Crafting a better masking strategy and efficient fine-tuning schedule for medical image classificationCode0
Towards Effective Visual Representations for Partial-Label LearningCode1
Explainable Knowledge Distillation for On-device Chest X-Ray Classification0
DPMLBench: Holistic Evaluation of Differentially Private Machine LearningCode1
A Multi-modal Approach to Single-modal Visual Place Classification0
Alternating Gradient Descent and Mixture-of-Experts for Integrated Multimodal Perception0
Architectural Vision for Quantum Computing in the Edge-Cloud ContinuumCode0
CAMIL: Context-Aware Multiple Instance Learning for Cancer Detection and Subtyping in Whole Slide ImagesCode1
Fashion CUT: Unsupervised domain adaptation for visual pattern classification in clothes using synthetic data and pseudo-labels0
DC3DCD: unsupervised learning for multiclass 3D point cloud change detectionCode1
Semantic Embedded Deep Neural Network: A Generic Approach to Boost Multi-Label Image Classification Performance0
Investigating the Corruption Robustness of Image Classifiers with Random Lp-norm CorruptionsCode0
LABO: Towards Learning Optimal Label Regularization via Bi-level Optimization0
Understanding Gaussian Attention Bias of Vision Transformers Using Effective Receptive FieldsCode0
Creative Discovery using QD SearchCode0
Pick your Poison: Undetectability versus Robustness in Data Poisoning Attacks0
Boldness-Recalibration for Binary Event Predictions0
Reduction of Class Activation Uncertainty with Background InformationCode1
Semantic Segmentation using Vision Transformers: A survey0
Human Attention-Guided Explainable Artificial Intelligence for Computer Vision Models0
Breast Cancer Diagnosis Using Machine Learning Techniques0
LatentAugment: Dynamically Optimized Latent Probabilities of Data AugmentationCode0
Forward-Forward Contrastive Learning0
Image Captioners Sometimes Tell More Than Images They See0
Unsupervised Mutual Transformer Learning for Multi-Gigapixel Whole Slide Image Classification0
mAedesID: Android Application for Aedes Mosquito Species Identification using Convolutional Neural Network0
Long-Tailed Recognition by Mutual Information Maximization between Latent Features and Ground-Truth LabelsCode1
On the Impact of Data Quality on Image Classification Fairness0
CSP: Self-Supervised Contrastive Spatial Pre-Training for Geospatial-Visual RepresentationsCode1
FCA: Taming Long-tailed Federated Medical Image Classification by Classifier AnchoringCode0
TPMIL: Trainable Prototype Enhanced Multiple Instance Learning for Whole Slide Image ClassificationCode0
Detecting Novelties with Empty Classes0
Instruction-ViT: Multi-Modal Prompts for Instruction Learning in ViT0
POUF: Prompt-oriented unsupervised fine-tuning for large pre-trained modelsCode1
Show:102550
← PrevPage 60 of 209Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1CoCa (finetuned)Top 1 Accuracy91Unverified
2Model soups (BASIC-L)Top 1 Accuracy90.98Unverified
3Model soups (ViT-G/14)Top 1 Accuracy90.94Unverified
4DaViT-GTop 1 Accuracy90.4Unverified
5Meta Pseudo Labels (EfficientNet-L2)Top 1 Accuracy90.2Unverified
6DaViT-HTop 1 Accuracy90.2Unverified
7SwinV2-GTop 1 Accuracy90.17Unverified
8MAWS (ViT-6.5B)Top 1 Accuracy90.1Unverified
9Florence-CoSwin-HTop 1 Accuracy90.05Unverified
10RevCol-HTop 1 Accuracy90Unverified