SOTAVerified

Image Classification

Image Classification is a fundamental task in vision recognition that aims to understand and categorize an image as a whole under a specific label. Unlike object detection, which involves classification and location of multiple objects within an image, image classification typically pertains to single-object images. When the classification becomes highly detailed or reaches instance-level, it is often referred to as image retrieval, which also involves finding similar images in a large database.

Source: Metamorphic Testing for Object Detection Systems

Papers

Showing 24512500 of 10419 papers

TitleStatusHype
Expert Kernel Generation Network Driven by Contextual Mapping for Hyperspectral Image ClassificationCode0
Dynamic Memory-enhanced Transformer for Hyperspectral Image Classification0
Quantum Computing Supported Adversarial Attack-Resilient Autonomous Vehicle Perception Module for Traffic Sign ClassificationCode0
FLIP Reasoning ChallengeCode0
GLUSE: Enhanced Channel-Wise Adaptive Gated Linear Units SE for Onboard Satellite Earth Observation Image ClassificationCode0
Exploring Video-Based Driver Activity Recognition under Noisy LabelsCode0
Weakly Semi-supervised Whole Slide Image Classification by Two-level Cross Consistency Supervision0
Embedding Radiomics into Vision Transformers for Multimodal Medical Image Classification0
3D Wavelet Convolutions with Extended Receptive Fields for Hyperspectral Image ClassificationCode0
Diversity-Driven Learning: Tackling Spurious Correlations and Data Heterogeneity in Federated Models0
Deep Learning Approaches for Medical Imaging Under Varying Degrees of Label Availability: A Comprehensive Survey0
An Efficient Quantum Classifier Based on Hamiltonian RepresentationsCode0
Sparse Deformable Mamba for Hyperspectral Image Classification0
MGS: Markov Greedy Sums for Accurate Low-Bitwidth Floating-Point Accumulation0
Comparative Analysis of Different Methods for Classifying Polychromatic Sketches0
Hypergraph Vision Transformers: Images are More than Nodes, More than Edges0
FocalLens: Instruction Tuning Enables Zero-Shot Conditional Image Representations0
A Hybrid Fully Convolutional CNN-Transformer Model for Inherently Interpretable Medical Image Classification0
MultiCore+TPU Accelerated Multi-Modal TinyML for Livestock Behaviour Recognition0
Identifying regions of interest in whole slide images of renal cell carcinoma0
Memory-Modular Classification: Learning to Generalize with Memory ReplacementCode0
Federated Unlearning Made Practical: Seamless Integration via Negated Pseudo-GradientsCode0
Gaze-Guided Learning: Avoiding Shortcut Bias in Visual ClassificationCode0
RS-RAG: Bridging Remote Sensing Imagery and Comprehensive Knowledge with a Multi-Modal Dataset and Retrieval-Augmented Generation Model0
Federated Learning for Medical Image Classification: A Comprehensive Benchmark0
Secure Diagnostics: Adversarial Robustness Meets Clinical Interpretability0
MASS: MoErging through Adaptive Subspace SelectionCode0
Spatial-Geometry Enhanced 3D Dynamic Snake Convolutional Neural Network for Hyperspectral Image ClassificationCode0
Scaling Federated Learning Solutions with Kubernetes for Synthesizing Histopathology ImagesCode0
Adaptive Classification of Interval-Valued Time Series0
LLM-Guided Evolution: An Autonomous Model Optimization for Object Detection0
HQViT: Hybrid Quantum Vision Transformer for Image Classification0
A Randomized Zeroth-Order Hierarchical Framework for Heterogeneous Federated Learning0
Deep Ensembling of Multiband Images for Earth Remote Sensing and Foramnifera DataCode0
Neural Style Transfer for Synthesising a Dataset of Ancient Egyptian Hieroglyphs0
All Patches Matter, More Patches Better: Enhance AI-Generated Image Detection via Panoptic Patch Learning0
Impact of Data Duplication on Deep Neural Network-Based Image Classifiers: Robust vs. Standard Models0
Geometric Median Matching for Robust k-Subset Selection from Noisy Data0
PolygoNet: Leveraging Simplified Polygonal Representation for Effective Image ClassificationCode0
Enabling Efficient Processing of Spiking Neural Networks with On-Chip Learning on Commodity Neuromorphic Processors for Edge AI Systems0
CIBR: Cross-modal Information Bottleneck Regularization for Robust CLIP Generalization0
Expanding-and-Shrinking Binary Neural NetworksCode0
Crossmodal Knowledge Distillation with WordNet-Relaxed Text Embeddings for Robust Image Classification0
Over-the-Air Edge Inference via End-to-End Metasurfaces-Integrated Artificial Neural Networks0
PixelCAM: Pixel Class Activation Mapping for Histology Image Classification and ROI LocalizationCode0
KernelDNA: Dynamic Kernel Sharing via Decoupled Naive AdaptersCode0
Efficient Dynamic Attention 3D Convolution for Hyperspectral Image ClassificationCode0
FairSAM: Fair Classification on Corrupted Data Through Sharpness-Aware Minimization0
DC-SGD: Differentially Private SGD with Dynamic Clipping through Gradient Norm Distribution Estimation0
Diffusion models applied to skin and oral cancer classification0
Show:102550
← PrevPage 50 of 209Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1CoCa (finetuned)Top 1 Accuracy91Unverified
2Model soups (BASIC-L)Top 1 Accuracy90.98Unverified
3Model soups (ViT-G/14)Top 1 Accuracy90.94Unverified
4DaViT-GTop 1 Accuracy90.4Unverified
5Meta Pseudo Labels (EfficientNet-L2)Top 1 Accuracy90.2Unverified
6DaViT-HTop 1 Accuracy90.2Unverified
7SwinV2-GTop 1 Accuracy90.17Unverified
8MAWS (ViT-6.5B)Top 1 Accuracy90.1Unverified
9Florence-CoSwin-HTop 1 Accuracy90.05Unverified
10RevCol-HTop 1 Accuracy90Unverified