SOTAVerified

Image Classification

Image Classification is a fundamental task in vision recognition that aims to understand and categorize an image as a whole under a specific label. Unlike object detection, which involves classification and location of multiple objects within an image, image classification typically pertains to single-object images. When the classification becomes highly detailed or reaches instance-level, it is often referred to as image retrieval, which also involves finding similar images in a large database.

Source: Metamorphic Testing for Object Detection Systems

Papers

Showing 51100 of 10419 papers

TitleStatusHype
Can We Infer Confidential Properties of Training Data from LLMs?0
DeepTraverse: A Depth-First Search Inspired Network for Algorithmic Visual Understanding0
Detecção da Psoríase Utilizando Visão Computacional: Uma Abordagem Comparativa Entre CNNs e Vision Transformers0
ScalableHD: Scalable and High-Throughput Hyperdimensional Computing Inference on Multi-Core CPUs0
InceptionMamba: An Efficient Hybrid Network with Large Band Convolution and Bottleneck MambaCode1
Hyperspectral Image Classification via Transformer-based Spectral-Spatial Attention Decoupling and Adaptive GatingCode0
Normalized Radon Cumulative Distribution Transforms for Invariance and Robustness in Optimal Transport Based Image ClassificationCode0
Biologically Inspired Deep Learning Approaches for Fetal Ultrasound Image Classification0
Hyperbolic Dual Feature Augmentation for Open-Environment0
An Adaptive Method Stabilizing Activations for Enhanced GeneralizationCode0
Mind the Gap: Removing the Discretization Gap in Differentiable Logic Gate Networks0
Improving Memory Efficiency for Training KANs via Meta LearningCode0
pFedSOP : Accelerating Training Of Personalized Federated Learning Using Second-Order Optimization0
Mobility-Aware Asynchronous Federated Learning with Dynamic Sparsification0
SAFE: Finding Sparse and Flat Minima to Improve PruningCode1
Rewriting the Budget: A General Framework for Black-Box Attacks Under Cost AsymmetryCode0
FPDANet: A Multi-Section Classification Model for Intelligent Screening of Fetal Ultrasound0
Eigenspectrum Analysis of Neural Networks without Aspect Ratio BiasCode1
Interpretable Few-Shot Image Classification via Prototypical Concept-Guided Mixture of LoRA Experts0
Recent Advances in Medical Image Classification0
KOALA++: Efficient Kalman-Based Optimization of Neural Networks with Gradient-Covariance Products0
Enhancing Interpretable Image Classification Through LLM Agents and Conditional Concept Bottleneck Models0
Quantifying task-relevant representational similarity using decision variable correlation0
Structured Pruning and Quantization for Learned Image CompressionCode0
OD3: Optimization-free Dataset Distillation for Object DetectionCode1
Towards Graph-Based Privacy-Preserving Federated Learning: ModelNet -- A ResNet-based Model Classification Dataset0
Optimal Weighted Convolution for Classification and DenosingCode2
Provably Improving Generalization of Few-Shot Models with Synthetic Data0
Proxy-FDA: Proxy-based Feature Distribution Alignment for Fine-tuning Vision Foundation Models without Forgetting0
SASP: Strip-Aware Spatial Perception for Fine-Grained Bird Image Classification0
GeoVision Labeler: Zero-Shot Geospatial Classification with Vision and Language ModelsCode2
MaCP: Minimal yet Mighty Adaptation via Hierarchical Cosine Projection0
BIRD: Behavior Induction via Representation-structure Distillation0
Boosting Domain Incremental Learning: Selecting the Optimal Parameters is All You NeedCode0
Deep Modeling and Optimization of Medical Image ClassificationCode0
DSAGL: Dual-Stream Attention-Guided Learning for Weakly Supervised Whole Slide Image Classification0
MCFNet: A Multimodal Collaborative Fusion Network for Fine-Grained Semantic Classification0
Test-Time Adaptation of Vision-Language Models for Open-Vocabulary Semantic SegmentationCode1
Leveraging Diffusion Models for Synthetic Data Augmentation in Protein Subcellular Localization Classification0
Frequency-Adaptive Discrete Cosine-ViT-ResNet Architecture for Sparse-Data Vision0
Task-Oriented Low-Label Semantic Communication With Self-Supervised Learning0
Diagnosing and Mitigating Modality Interference in Multimodal Large Language ModelsCode0
Advancements in Medical Image Classification through Fine-Tuning Natural Domain Foundation ModelsCode0
Applications and Effect Evaluation of Generative Adversarial Networks in Semi-Supervised Learning0
DiSa: Directional Saliency-Aware Prompt Learning for Generalizable Vision-Language Models0
UORA: Uniform Orthogonal Reinitialization Adaptation in Parameter-Efficient Fine-Tuning of Large Models0
Improvement Strategies for Few-Shot Learning in OCT Image Classification of Rare Retinal Diseases0
Differential Privacy Analysis of Decentralized Gossip Averaging under Varying Threat Models0
Mosaic: Data-Free Knowledge Distillation via Mixture-of-Experts for Heterogeneous Distributed EnvironmentsCode0
Remote Sensing Image Classification with Decoupled Knowledge Distillation0
Show:102550
← PrevPage 2 of 209Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1CoCa (finetuned)Top 1 Accuracy91Unverified
2Model soups (BASIC-L)Top 1 Accuracy90.98Unverified
3Model soups (ViT-G/14)Top 1 Accuracy90.94Unverified
4DaViT-GTop 1 Accuracy90.4Unverified
5DaViT-HTop 1 Accuracy90.2Unverified
6Meta Pseudo Labels (EfficientNet-L2)Top 1 Accuracy90.2Unverified
7SwinV2-GTop 1 Accuracy90.17Unverified
8MAWS (ViT-6.5B)Top 1 Accuracy90.1Unverified
9Florence-CoSwin-HTop 1 Accuracy90.05Unverified
10Meta Pseudo Labels (EfficientNet-B6-Wide)Top 1 Accuracy90Unverified