SOTAVerified

Image Classification

Image Classification is a fundamental task in vision recognition that aims to understand and categorize an image as a whole under a specific label. Unlike object detection, which involves classification and location of multiple objects within an image, image classification typically pertains to single-object images. When the classification becomes highly detailed or reaches instance-level, it is often referred to as image retrieval, which also involves finding similar images in a large database.

Source: Metamorphic Testing for Object Detection Systems

Papers

Showing 601650 of 10419 papers

TitleStatusHype
Addressing Small and Imbalanced Medical Image Datasets Using Generative Models: A Comparative Study of DDPM and PGGANs with Random and Greedy K SamplingCode0
Identifying Bias in Deep Neural Networks Using Image TransformsCode0
ShotVL: Human-Centric Highlight Frame Retrieval via Language Queries0
RemoteTrimmer: Adaptive Structural Pruning for Remote Sensing Image ClassificationCode0
Real-valued continued fraction of straight linesCode0
Non-Convex Optimization in Federated Learning via Variance Reduction and Adaptive Learning0
Explicit and Implicit Graduated Optimization in Deep Neural NetworksCode0
CNNtention: Can CNNs do better with Attention?Code0
The Impact of Generalization Techniques on the Interplay Among Privacy, Utility, and Fairness in Image Classification0
FSFM: A Generalizable Face Security Foundation Model via Self-Supervised Facial Representation LearningCode2
LMM-Regularized CLIP Embeddings for Image Classification0
Does VLM Classification Benefit from LLM Description Semantics?Code1
Semi-Supervised Risk Control via Prediction-Powered Inference0
On the Generalizability of Iterative Patch Selection for Memory-Efficient High-Resolution Image ClassificationCode0
CATALOG: A Camera Trap Language-guided Contrastive Learning ModelCode0
Linked Adapters: Linking Past and Future to Present for Effective Continual Learning0
RapidNet: Multi-Level Dilated Convolution Based Mobile BackboneCode1
Err on the Side of Texture: Texture Bias on Real DataCode0
Robust image classification with multi-modal large language models0
MVQ:Towards Efficient DNN Compression and Acceleration with Masked Vector Quantization0
Data Pruning Can Do More: A Comprehensive Data Pruning Approach for Object Re-identificationCode0
DQA: An Efficient Method for Deep Quantization of Deep Neural Network Activations0
An Efficient Framework for Enhancing Discriminative Models via Diffusion TechniquesCode0
STEAM: Squeeze and Transform Enhanced Attention Module0
Stochastic Learning of Non-Conjugate Variational Posterior for Image Classification0
Advancing Attribution-Based Neural Network Explainability through Relative Absolute Magnitude Layer-Wise Relevance Propagation and Multi-Component EvaluationCode0
Learned Compression for Compressed LearningCode0
Embeddings are all you need! Achieving High Performance Medical Image Classification through Training-Free Embedding Analysis0
Revisiting Weight Averaging for Model MergingCode1
Multimodal Approaches to Fair Image Classification: An Ethical Perspective0
ALoRE: Efficient Visual Adaptation via Aggregating Low Rank Experts0
Wasserstein Distance Rivals Kullback-Leibler Divergence for Knowledge DistillationCode2
IMPACT: A Large-scale Integrated Multimodal Patent Analysis and Creation Dataset for Design PatentsCode1
Real-time Chest X-Ray Distributed Decision Support for Resource-constrained Clinics0
Leveraging Content and Context Cues for Low-Light Image EnhancementCode0
Image Classification Using Singular Value Decomposition and OptimizationCode0
An Enhancement of CNN Algorithm for Rice Leaf Disease Image Classification in Mobile Applications0
Post-Training Non-Uniform Quantization for Convolutional Neural Networks0
How transfer learning is used in generative models for image classification: improved accuracyCode0
Impact of Privacy Parameters on Deep Learning Models for Image Classification0
Convolution goes higher-order: a biologically inspired mechanism empowers image classification0
Vision Transformer-based Semantic Communications With Importance-Aware Quantization0
Hyperspectral Image Spectral-Spatial Feature Extraction via Tensor Principal Component Analysis0
Sparse autoencoders reveal selective remapping of visual concepts during adaptationCode1
MTSpark: Enabling Multi-Task Learning with Spiking Neural Networks for Generalist Agents0
Quantized and Interpretable Learning Scheme for Deep Neural Networks in Classification Task0
FedDUAL: A Dual-Strategy with Adaptive Loss and Dynamic Aggregation for Mitigating Data Heterogeneity in Federated LearningCode0
Enhancing Whole Slide Image Classification through Supervised Contrastive Domain Adaptation0
Grounding Descriptions in Images informs Zero-Shot Visual RecognitionCode1
Multisource Collaborative Domain Generalization for Cross-Scene Remote Sensing Image Classification0
Show:102550
← PrevPage 13 of 209Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1CoCa (finetuned)Top 1 Accuracy91Unverified
2Model soups (BASIC-L)Top 1 Accuracy90.98Unverified
3Model soups (ViT-G/14)Top 1 Accuracy90.94Unverified
4DaViT-GTop 1 Accuracy90.4Unverified
5Meta Pseudo Labels (EfficientNet-L2)Top 1 Accuracy90.2Unverified
6DaViT-HTop 1 Accuracy90.2Unverified
7SwinV2-GTop 1 Accuracy90.17Unverified
8MAWS (ViT-6.5B)Top 1 Accuracy90.1Unverified
9Florence-CoSwin-HTop 1 Accuracy90.05Unverified
10RevCol-HTop 1 Accuracy90Unverified