SOTAVerified

Image Classification

Image Classification is a fundamental task in vision recognition that aims to understand and categorize an image as a whole under a specific label. Unlike object detection, which involves classification and location of multiple objects within an image, image classification typically pertains to single-object images. When the classification becomes highly detailed or reaches instance-level, it is often referred to as image retrieval, which also involves finding similar images in a large database.

Source: Metamorphic Testing for Object Detection Systems

Papers

Showing 201250 of 10419 papers

TitleStatusHype
What Lurks Within? Concept Auditing for Shared Diffusion Models at Scale0
ECViT: Efficient Convolutional Vision Transformer with Local-Attention and Multi-scale StagesCode1
Exploring Modality Guidance to Enhance VFM-based Feature Fusion for UDA in 3D Semantic Segmentation0
ThyroidEffi 1.0: A Cost-Effective System for High-Performance Multi-Class Thyroid Carcinoma Classification0
Enhancing Multimodal In-Context Learning for Image Classification through Coreset Optimization0
MAAM: A Lightweight Multi-Agent Aggregation Module for Efficient Image Classification Based on the MindSpore Framework0
Feature Alignment and Representation Transfer in Knowledge Distillation for Large Language Models0
Bayesian continual learning and forgetting in neural networksCode1
Cross-Hierarchical Bidirectional Consistency Learning for Fine-Grained Visual Classification0
CheXWorld: Exploring Image World Modeling for Radiograph Representation LearningCode1
Human-aligned Deep Learning: Explainability, Causality, and Biological Inspiration0
Towards Accurate and Interpretable Neuroblastoma Diagnosis via Contrastive Multi-scale Pathological Image AnalysisCode1
Dynamic Memory-enhanced Transformer for Hyperspectral Image Classification0
Expert Kernel Generation Network Driven by Contextual Mapping for Hyperspectral Image ClassificationCode0
Quantum Computing Supported Adversarial Attack-Resilient Autonomous Vehicle Perception Module for Traffic Sign ClassificationCode0
Weakly Semi-supervised Whole Slide Image Classification by Two-level Cross Consistency Supervision0
FLIP Reasoning ChallengeCode0
GLUSE: Enhanced Channel-Wise Adaptive Gated Linear Units SE for Onboard Satellite Earth Observation Image ClassificationCode0
Exploring Video-Based Driver Activity Recognition under Noisy LabelsCode0
Deep Learning Approaches for Medical Imaging Under Varying Degrees of Label Availability: A Comprehensive Survey0
3D Wavelet Convolutions with Extended Receptive Fields for Hyperspectral Image ClassificationCode0
Diversity-Driven Learning: Tackling Spurious Correlations and Data Heterogeneity in Federated Models0
Embedding Radiomics into Vision Transformers for Multimodal Medical Image Classification0
LEMUR Neural Network Dataset: Towards Seamless AutoMLCode1
Sparse Deformable Mamba for Hyperspectral Image Classification0
An Efficient Quantum Classifier Based on Hamiltonian RepresentationsCode0
MGS: Markov Greedy Sums for Accurate Low-Bitwidth Floating-Point Accumulation0
A Hybrid Fully Convolutional CNN-Transformer Model for Inherently Interpretable Medical Image Classification0
Hypergraph Vision Transformers: Images are More than Nodes, More than Edges0
Comparative Analysis of Different Methods for Classifying Polychromatic Sketches0
FocalLens: Instruction Tuning Enables Zero-Shot Conditional Image Representations0
MultiCore+TPU Accelerated Multi-Modal TinyML for Livestock Behaviour Recognition0
Pychop: Emulating Low-Precision Arithmetic in Numerical Methods and Neural NetworksCode1
Identifying regions of interest in whole slide images of renal cell carcinoma0
Memory-Modular Classification: Learning to Generalize with Memory ReplacementCode0
Federated Unlearning Made Practical: Seamless Integration via Negated Pseudo-GradientsCode0
Gaze-Guided Learning: Avoiding Shortcut Bias in Visual ClassificationCode0
Secure Diagnostics: Adversarial Robustness Meets Clinical Interpretability0
RS-RAG: Bridging Remote Sensing Imagery and Comprehensive Knowledge with a Multi-Modal Dataset and Retrieval-Augmented Generation Model0
Federated Learning for Medical Image Classification: A Comprehensive Benchmark0
Spatial-Geometry Enhanced 3D Dynamic Snake Convolutional Neural Network for Hyperspectral Image ClassificationCode0
MASS: MoErging through Adaptive Subspace SelectionCode0
Scaling Federated Learning Solutions with Kubernetes for Synthesizing Histopathology ImagesCode0
Adaptive Classification of Interval-Valued Time Series0
HQViT: Hybrid Quantum Vision Transformer for Image Classification0
LLM-Guided Evolution: An Autonomous Model Optimization for Object Detection0
Deep Ensembling of Multiband Images for Earth Remote Sensing and Foramnifera DataCode0
Neural Style Transfer for Synthesising a Dataset of Ancient Egyptian Hieroglyphs0
A Randomized Zeroth-Order Hierarchical Framework for Heterogeneous Federated Learning0
All Patches Matter, More Patches Better: Enhance AI-Generated Image Detection via Panoptic Patch Learning0
Show:102550
← PrevPage 5 of 209Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1CoCa (finetuned)Top 1 Accuracy91Unverified
2Model soups (BASIC-L)Top 1 Accuracy90.98Unverified
3Model soups (ViT-G/14)Top 1 Accuracy90.94Unverified
4DaViT-GTop 1 Accuracy90.4Unverified
5DaViT-HTop 1 Accuracy90.2Unverified
6Meta Pseudo Labels (EfficientNet-L2)Top 1 Accuracy90.2Unverified
7SwinV2-GTop 1 Accuracy90.17Unverified
8MAWS (ViT-6.5B)Top 1 Accuracy90.1Unverified
9Florence-CoSwin-HTop 1 Accuracy90.05Unverified
10Meta Pseudo Labels (EfficientNet-B6-Wide)Top 1 Accuracy90Unverified