SOTAVerified

Image Classification

Image Classification is a fundamental task in vision recognition that aims to understand and categorize an image as a whole under a specific label. Unlike object detection, which involves classification and location of multiple objects within an image, image classification typically pertains to single-object images. When the classification becomes highly detailed or reaches instance-level, it is often referred to as image retrieval, which also involves finding similar images in a large database.

Source: Metamorphic Testing for Object Detection Systems

Papers

Showing 17011750 of 10419 papers

TitleStatusHype
Learning with SASQuaTCh: a Novel Variational Quantum Transformer Architecture with Kernel-Based Self-Attention0
Tensor network compressibility of convolutional models0
Estimating Physical Information Consistency of Channel Data Augmentation for Remote Sensing Images0
LayoutLLM: Large Language Model Instruction Tuning for Visually Rich Document Understanding0
Safeguarding Medical Image Segmentation Datasets against Unauthorized Training via Contour- and Texture-Aware Perturbations0
SynerMix: Synergistic Mixup Solution for Enhanced Intra-Class Cohesion and Inter-Class Separability in Image ClassificationCode0
MTP: Advancing Remote Sensing Foundation Model via Multi-Task PretrainingCode3
Bridge the Modality and Capability Gaps in Vision-Language Model Selection0
Leveraging feature communication in federated learning for remote sensing image classification0
Building Optimal Neural Architectures using Interpretable KnowledgeCode0
SIFT-DBT: Self-supervised Initialization and Fine-Tuning for Imbalanced Digital Breast Tomosynthesis Image ClassificationCode0
Using evolutionary computation to optimize task performance of unclocked, recurrent Boolean circuits in FPGAs0
LUWA Dataset: Learning Lithic Use-Wear Analysis on Microscopic Images0
Improved EATFormer: A Vision Transformer for Medical Image Classification0
Just Shift It: Test-Time Prototype Shifting for Zero-Shot Generalization with Vision-Language ModelsCode1
Prompt-Guided Adaptive Model Transformation for Whole Slide Image Classification0
Eye-gaze Guided Multi-modal Alignment for Medical Representation LearningCode1
SEVEN: Pruning Transformer Model by Reserving SentinelsCode0
IPCL: Iterative Pseudo-Supervised Contrastive Learning to Improve Self-Supervised Feature RepresentationCode0
Posterior Uncertainty Quantification in Neural Networks using Data AugmentationCode0
A Systematic Review of Generalization Research in Medical Image Classification0
Uncertainty-Calibrated Test-Time Model Adaptation without Forgetting0
Leveraging Spatial and Semantic Feature Extraction for Skin Cancer Diagnosis with Capsule Networks and Graph Neural Networks0
Image and Point-cloud Classification for Jet Analysis in High-Energy Physics: A survey0
Continual Forgetting for Pre-trained Vision ModelsCode2
Better (pseudo-)labels for semi-supervised instance segmentation0
Multiple Teachers-Meticulous Student: A Domain Adaptive Meta-Knowledge Distillation Model for Medical Image ClassificationCode0
Potential of Domain Adaptation in Machine Learning in Ecology and Hydrology to Improve Model Extrapolability0
Fuzzy Rank-based Late Fusion Technique for Cytology image Segmentation0
Understanding Robustness of Visual State Space Models for Image ClassificationCode0
RetMIL: Retentive Multiple Instance Learning for Histopathological Whole Slide Image Classification0
Automatic location detection based on deep learningCode0
Forward Learning of Graph Neural NetworksCode1
When Training-Free NAS Meets Vision Transformer: A Neural Tangent Kernel Perspective0
InterLUDE: Interactions between Labeled and Unlabeled Data to Enhance Semi-Supervised LearningCode1
PALM: Pushing Adaptive Learning Rate Mechanisms for Continual Test-Time AdaptationCode0
Few-Shot Image Classification and Segmentation as Visual Question Answering Using Vision-Language Models0
Fast and reliable uncertainty quantification with neural network ensembles for industrial image classification0
RadCLIP: Enhancing Radiologic Image Analysis through Contrastive Language-Image Pre-trainingCode1
Deep Learning for Multi-Level Detection and Localization of Myocardial Scars Based on Regional Strain Validated on Virtual Patients0
Multi-criteria Token Fusion with One-step-ahead Attention for Efficient Vision TransformersCode1
Frozen Feature Augmentation for Few-Shot Image Classification0
Learning on JPEG-LDPC Compressed Images: Classifying with Syndromes0
Achieving Pareto Optimality using Efficient Parameter Reduction for DNNs in Resource-Constrained Edge Environment0
Can We Talk Models Into Seeing the World Differently?Code1
CardioCaps: Attention-based Capsule Network for Class-Imbalanced Echocardiogram ClassificationCode0
XCoOp: Explainable Prompt Learning for Computer-Aided Diagnosis via Concept-guided Context Optimization0
The First to Know: How Token Distributions Reveal Hidden Knowledge in Large Vision-Language Models?Code1
Transformers Get Stable: An End-to-End Signal Propagation Theory for Language ModelsCode1
Randomized Principal Component Analysis for Hyperspectral Image Classification0
Show:102550
← PrevPage 35 of 209Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1CoCa (finetuned)Top 1 Accuracy91Unverified
2Model soups (BASIC-L)Top 1 Accuracy90.98Unverified
3Model soups (ViT-G/14)Top 1 Accuracy90.94Unverified
4DaViT-GTop 1 Accuracy90.4Unverified
5Meta Pseudo Labels (EfficientNet-L2)Top 1 Accuracy90.2Unverified
6DaViT-HTop 1 Accuracy90.2Unverified
7SwinV2-GTop 1 Accuracy90.17Unverified
8MAWS (ViT-6.5B)Top 1 Accuracy90.1Unverified
9Florence-CoSwin-HTop 1 Accuracy90.05Unverified
10RevCol-HTop 1 Accuracy90Unverified