SOTAVerified

Image Classification

Image Classification is a fundamental task in vision recognition that aims to understand and categorize an image as a whole under a specific label. Unlike object detection, which involves classification and location of multiple objects within an image, image classification typically pertains to single-object images. When the classification becomes highly detailed or reaches instance-level, it is often referred to as image retrieval, which also involves finding similar images in a large database.

Source: Metamorphic Testing for Object Detection Systems

Papers

Showing 17011750 of 10419 papers

TitleStatusHype
Pretrained ViTs Yield Versatile Representations For Medical ImagesCode1
Function-Consistent Feature DistillationCode1
Collaborative Transformers for Grounded Situation RecognitionCode1
Emerging Properties in Self-Supervised Vision TransformersCode1
Probabilistic MIMO U-Net: Efficient and Accurate Uncertainty Estimation for Pixel-wise RegressionCode1
EmergencyNet: Efficient Aerial Image Classification for Drone-Based Emergency Monitoring Using Atrous Convolutional Feature FusionCode1
CEFHRI: A Communication Efficient Federated Learning Framework for Recognizing Industrial Human-Robot InteractionCode1
CellGAN: Conditional Cervical Cell Synthesis for Augmenting Cytopathological Image ClassificationCode1
Derivative Manipulation for General Example WeightingCode1
CellMix: A General Instance Relationship based Method for Data Augmentation Towards Pathology Image ClassificationCode1
Astroformer: More Data Might not be all you need for ClassificationCode1
Enhancing CLIP with CLIP: Exploring Pseudolabeling for Limited-Label Prompt TuningCode1
Protect, Show, Attend and Tell: Empowering Image Captioning Models with Ownership ProtectionCode1
Centrality and Consistency: Two-Stage Clean Samples Identification for Learning with Instance-Dependent Noisy LabelsCode1
Active Finetuning: Exploiting Annotation Budget in the Pretraining-Finetuning ParadigmCode1
Encoder-Decoder with Atrous Separable Convolution for Semantic Image SegmentationCode1
CNN Filter DB: An Empirical Investigation of Trained Convolutional FiltersCode1
Proximal Mean-field for Neural Network QuantizationCode1
A Comprehensive Approach to Unsupervised Embedding Learning based on AND AlgorithmCode1
Energy-Latency Attacks via Sponge PoisoningCode1
Engineering flexible machine learning systems by traversing functionally-invariant pathsCode1
PSAQ-ViT V2: Towards Accurate and General Data-Free Quantization for Vision TransformersCode1
A Novel Approach for detecting Normal, COVID-19 and Pneumonia patient using only binary classifications from chest CT-ScansCode1
Cervical Cytology Classification Using PCA & GWO Enhanced Deep Features SelectionCode1
Enhance Image Classification via Inter-Class Image Mixup with Diffusion ModelCode1
PseudoSeg: Designing Pseudo Labels for Semantic SegmentationCode1
Fusion of Dual Spatial Information for Hyperspectral Image ClassificationCode1
Gated Attention Coding for Training High-performance and Efficient Spiking Neural NetworksCode1
Enhancing Few-shot Image Classification with Cosine TransformerCode1
Evaluation of Deep Neural Network Domain Adaptation Techniques for Image RecognitionCode1
Channel Importance Matters in Few-Shot Image ClassificationCode1
Pyramid Hierarchical Transformer for Hyperspectral Image ClassificationCode1
A Novel Convolutional Neural Network Architecture with a Continuous SymmetryCode1
Enhancing Sharpness-Aware Optimization Through Variance SuppressionCode1
A Comprehensive Empirical Evaluation on Online Continual LearningCode1
Ensembling with Deep Generative ViewsCode1
QReLU and m-QReLU: Two novel quantum activation functions to aid medical diagnosticsCode1
PAD-Net: An Efficient Framework for Dynamic NetworksCode1
EntAugment: Entropy-Driven Adaptive Data Augmentation Framework for Image ClassificationCode1
ChestX-ray8: Hospital-scale Chest X-ray Database and Benchmarks on Weakly-Supervised Classification and Localization of Common Thorax DiseasesCode1
Entroformer: A Transformer-based Entropy Model for Learned Image CompressionCode1
Quantum-limited stochastic optical neural networks operating at a few quanta per activationCode1
Generalized Few-Shot Video Classification with Video Retrieval and Feature GenerationCode1
EPSANet: An Efficient Pyramid Squeeze Attention Block on Convolutional Neural NetworkCode1
CHEX: CHannel EXploration for CNN Model CompressionCode1
CheXFusion: Effective Fusion of Multi-View Features using Transformers for Long-Tailed Chest X-Ray ClassificationCode1
A Stitch in Time Saves Nine: A Train-Time Regularizing Loss for Improved Neural Network CalibrationCode1
CHiLS: Zero-Shot Image Classification with Hierarchical Label SetsCode1
Age Estimation Using Expectation of Label Distribution LearningCode1
FocusNet: Classifying Better by Focusing on Confusing ClassesCode1
Show:102550
← PrevPage 35 of 209Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1CoCa (finetuned)Top 1 Accuracy91Unverified
2Model soups (BASIC-L)Top 1 Accuracy90.98Unverified
3Model soups (ViT-G/14)Top 1 Accuracy90.94Unverified
4DaViT-GTop 1 Accuracy90.4Unverified
5Meta Pseudo Labels (EfficientNet-L2)Top 1 Accuracy90.2Unverified
6DaViT-HTop 1 Accuracy90.2Unverified
7SwinV2-GTop 1 Accuracy90.17Unverified
8MAWS (ViT-6.5B)Top 1 Accuracy90.1Unverified
9Florence-CoSwin-HTop 1 Accuracy90.05Unverified
10RevCol-HTop 1 Accuracy90Unverified