SOTAVerified

Image Classification

Image Classification is a fundamental task in vision recognition that aims to understand and categorize an image as a whole under a specific label. Unlike object detection, which involves classification and location of multiple objects within an image, image classification typically pertains to single-object images. When the classification becomes highly detailed or reaches instance-level, it is often referred to as image retrieval, which also involves finding similar images in a large database.

Source: Metamorphic Testing for Object Detection Systems

Papers

Showing 34013450 of 10419 papers

TitleStatusHype
Boosting Medical Image Classification with Segmentation Foundation Model0
Saliency-guided and Patch-based Mixup for Long-tailed Skin Cancer Image Classification0
Robust Image Classification in the Presence of Out-of-Distribution and Adversarial Samples Using Attractors in Neural Networks0
Comparison of fine-tuning strategies for transfer learning in medical image classification0
Adaptive Randomized Smoothing: Certified Adversarial Robustness for Multi-Step DefencesCode0
Forgetting Order of Continual Learning: Examples That are Learned First are Forgotten Last0
AI-Based Copyright Detection Of An Image In a Video Using Degree Of Similarity And Image Hashing0
Large-Scale Evaluation of Open-Set Image Classification TechniquesCode0
LaCoOT: Layer Collapse through Optimal Transport0
The Penalized Inverse Probability Measure for Conformal Classification0
MirrorCheck: Efficient Adversarial Defense for Vision-Language Models0
How Out-of-Distribution Detection Learning Theory Enhances Transformer: Learnability and Reliability0
Conceptual Learning via Embedding Approximations for Reinforcing Interpretability and TransparencyCode0
Transformation-Dependent Adversarial Attacks0
Multi-Teacher Multi-Objective Meta-Learning for Zero-Shot Hyperspectral Band Selection0
A^2-MAE: A spatial-temporal-spectral unified remote sensing pre-training method based on anchor-aware masked autoencoder0
Intelligent Multi-View Test Time AugmentationCode0
DistilDoc: Knowledge Distillation for Visually-Rich Document Applications0
Accurate Explanation Model for Image Classifiers using Class Association EmbeddingCode0
AdaNCA: Neural Cellular Automata As Adaptors For More Robust Vision Transformer0
Fairness-Aware Meta-Learning via Nash Bargaining0
EEG-ImageNet: An Electroencephalogram Dataset and Benchmarks with Image Visual Stimuli of Multi-Granularity LabelsCode0
DualMamba: A Lightweight Spectral-Spatial Mamba-Convolution Network for Hyperspectral Image Classification0
Minimizing Energy Costs in Deep Learning Model Training: The Gaussian Sampling Approach0
Equivariant Neural Tangent Kernels0
Multi-Objective Neural Architecture Search for In-Memory Computing0
Which Backbone to Use: A Resource-efficient Domain Specific Comparison for Computer VisionCode0
Evolution-aware VAriance (EVA) Coreset Selection for Medical Image Classification0
Aligning Human Knowledge with Visual Concepts Towards Explainable Medical Image Classification0
REP: Resource-Efficient Prompting for Rehearsal-Free Continual Learning0
Classification Metrics for Image Explanations: Towards Building Reliable XAI-EvaluationsCode0
A Novel Time Series-to-Image Encoding Approach for Weather Phenomena Classification0
The Unmet Promise of Synthetic Training Images: Using Retrieved Real Images Performs BetterCode0
Data-Free Generative Replay for Class-Incremental Learning on Imbalanced DataCode0
Cooperative Meta-Learning with Gradient AugmentationCode0
OCCAM: Towards Cost-Efficient and Accuracy-Aware Image Classification Inference0
ReDistill: Residual Encoded Distillation for Peak Memory Reduction0
Can Language Models Use Forecasting Strategies?0
Tiny models from tiny data: Textual and null-text inversion for few-shot distillationCode0
Exploiting LMM-based knowledge for image classification tasks0
Identification of Stone Deterioration Patterns with Large Multimodal ModelsCode0
Convolutional Neural Networks and Vision Transformers for Fashion MNIST Classification: A Literature Review0
Exploring Effects of Hyperdimensional Vectors for Tsetlin Machines0
CoLa-DCE -- Concept-guided Latent Diffusion Counterfactual Explanations0
Understanding the Cross-Domain Capabilities of Video-Based Few-Shot Action Recognition Models0
DDA: Dimensionality Driven Augmentation Search for Contrastive Learning in Laparoscopic SurgeryCode0
Visual Car Brand Classification by Implementing a Synthetic Image Dataset Creation Pipeline0
Asynchronous Multi-Server Federated Learning for Geo-Distributed Clients0
Compute-Efficient Medical Image Classification with Softmax-Free Transformers and Sequence Normalization0
Task-oriented Embedding Counts: Heuristic Clustering-driven Feature Fine-tuning for Whole Slide Image Classification0
Show:102550
← PrevPage 69 of 209Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1CoCa (finetuned)Top 1 Accuracy91Unverified
2Model soups (BASIC-L)Top 1 Accuracy90.98Unverified
3Model soups (ViT-G/14)Top 1 Accuracy90.94Unverified
4DaViT-GTop 1 Accuracy90.4Unverified
5Meta Pseudo Labels (EfficientNet-L2)Top 1 Accuracy90.2Unverified
6DaViT-HTop 1 Accuracy90.2Unverified
7SwinV2-GTop 1 Accuracy90.17Unverified
8MAWS (ViT-6.5B)Top 1 Accuracy90.1Unverified
9Florence-CoSwin-HTop 1 Accuracy90.05Unverified
10RevCol-HTop 1 Accuracy90Unverified