SOTAVerified

Image Classification

Image Classification is a fundamental task in vision recognition that aims to understand and categorize an image as a whole under a specific label. Unlike object detection, which involves classification and location of multiple objects within an image, image classification typically pertains to single-object images. When the classification becomes highly detailed or reaches instance-level, it is often referred to as image retrieval, which also involves finding similar images in a large database.

Source: Metamorphic Testing for Object Detection Systems

Papers

Showing 901950 of 10419 papers

TitleStatusHype
Compressive Visual RepresentationsCode1
Compositional Explanations of NeuronsCode1
Complementary-Label Learning for Arbitrary Losses and ModelsCode1
Attentive Weights Generation for Few Shot Learning via Information MaximizationCode1
FlexConv: Continuous Kernel Convolutions with Differentiable Kernel SizesCode1
A Partially Reversible U-Net for Memory-Efficient Volumetric Image SegmentationCode1
Attentive WaveBlock: Complementarity-enhanced Mutual Networks for Unsupervised Domain Adaptation in Person Re-identification and BeyondCode1
FlowNAS: Neural Architecture Search for Optical Flow EstimationCode1
Compounding the Performance Improvements of Assembled Techniques in a Convolutional Neural NetworkCode1
Focal Attention for Long-Range Interactions in Vision TransformersCode1
Concept Learners for Few-Shot LearningCode1
FoPro-KD: Fourier Prompted Effective Knowledge Distillation for Long-Tailed Medical Image RecognitionCode1
Adversarial Example Detection for DNN Models: A Review and Experimental ComparisonCode1
Forward Learning of Graph Neural NetworksCode1
Fourier Image TransformerCode1
FPGA: Fast Patch-Free Global Learning Framework for Fully End-to-End Hyperspectral Image ClassificationCode1
FracBNN: Accurate and FPGA-Efficient Binary Neural Networks with Fractional ActivationsCode1
FrankenSplit: Efficient Neural Feature Compression with Shallow Variational Bottleneck Injection for Mobile Edge ComputingCode1
Communication-Efficient and Privacy-Preserving Feature-based Federated Transfer LearningCode1
From ImageNet to Image Classification: Contextualizing Progress on BenchmarksCode1
Adversarial Examples in Deep Learning for Multivariate Time Series RegressionCode1
From Pixel to Patch: Synthesize Context-aware Features for Zero-shot Semantic SegmentationCode1
Combining Metric Learning and Attention Heads For Accurate and Efficient Multilabel Image ClassificationCode1
Fruit Quality and Defect Image Classification with Conditional GAN Data AugmentationCode1
Communication-Efficient Federated Learning Based on Explanation-Guided Pruning for Remote Sensing Image ClassificationCode1
Function-Consistent Feature DistillationCode1
All-in-One Image Coding for Joint Human-Machine Vision with Multi-Path AggregationCode1
Augmentation-Free Dense Contrastive Knowledge Distillation for Efficient Semantic SegmentationCode1
GaNDLF: A Generally Nuanced Deep Learning Framework for Scalable End-to-End Clinical Workflows in Medical ImagingCode1
Gated Attention Coding for Training High-performance and Efficient Spiking Neural NetworksCode1
GazeGNN: A Gaze-Guided Graph Neural Network for Chest X-ray ClassificationCode1
General E(2)-Equivariant Steerable CNNsCode1
Approaching Deep Learning through the Spectral Dynamics of WeightsCode1
Generalized Cross Entropy Loss for Training Deep Neural Networks with Noisy LabelsCode1
Augmentation Strategies for Learning with Noisy LabelsCode1
General Multi-label Image Classification with TransformersCode1
GIST: Generating Image-Specific Text for Fine-grained Object ClassificationCode1
Generative Adversarial Minority Oversampling for Spectral-Spatial Hyperspectral Image ClassificationCode1
Generative Hierarchical Features from Synthesizing ImagesCode1
Generative Interventions for Causal LearningCode1
Adversarially-Trained Deep Nets Transfer Better: Illustration on Image ClassificationCode1
Combining GANs and AutoEncoders for Efficient Anomaly DetectionCode1
Combining Human Predictions with Model Probabilities via Confusion Matrices and CalibrationCode1
GeoWINE: Geolocation based Wiki, Image,News and Event RetrievalCode1
GhostNet: More Features from Cheap OperationsCode1
Comparing Kullback-Leibler Divergence and Mean Squared Error Loss in Knowledge DistillationCode1
GLiT: Neural Architecture Search for Global and Local Image TransformerCode1
Learning Hierarchical Image Segmentation For Recognition and By RecognitionCode1
GlobalMamba: Global Image Serialization for Vision MambaCode1
Contextual Diversity for Active LearningCode1
Show:102550
← PrevPage 19 of 209Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1CoCa (finetuned)Top 1 Accuracy91Unverified
2Model soups (BASIC-L)Top 1 Accuracy90.98Unverified
3Model soups (ViT-G/14)Top 1 Accuracy90.94Unverified
4DaViT-GTop 1 Accuracy90.4Unverified
5Meta Pseudo Labels (EfficientNet-L2)Top 1 Accuracy90.2Unverified
6DaViT-HTop 1 Accuracy90.2Unverified
7SwinV2-GTop 1 Accuracy90.17Unverified
8MAWS (ViT-6.5B)Top 1 Accuracy90.1Unverified
9Florence-CoSwin-HTop 1 Accuracy90.05Unverified
10RevCol-HTop 1 Accuracy90Unverified