SOTAVerified

Image Classification

Image Classification is a fundamental task in vision recognition that aims to understand and categorize an image as a whole under a specific label. Unlike object detection, which involves classification and location of multiple objects within an image, image classification typically pertains to single-object images. When the classification becomes highly detailed or reaches instance-level, it is often referred to as image retrieval, which also involves finding similar images in a large database.

Source: Metamorphic Testing for Object Detection Systems

Papers

Showing 56515700 of 10420 papers

TitleStatusHype
Learning Augmentation Network via Influence Functions0
Learning-Based Data Storage [Vision] (Technical Report)0
Learning Binary Codes and Binary Weights for Efficient Classification0
Locally-Supervised Deep Hybrid Model for Scene Recognition0
Hard-label Manifolds: Unexpected Advantages of Query Efficiency for Finding On-manifold Adversarial Examples0
Critic Loss for Image Classification0
Learning Class-to-Image Distance with Object Matchings0
Learning CNN filters from user-drawn image markers for coconut-tree image classification0
HAO: Hardware-aware neural Architecture Optimization for Efficient Inference0
DefMamba: Deformable Visual State Space Model0
FEATHERS: Federated Architecture and Hyperparameter Search0
Learning Connectivity of Neural Networks from a Topological Perspective0
Critical Hyper-Parameters: No Random, No Cry0
Learning Consistent Deep Generative Models from Sparse Data via Prediction Constraints0
Learning Contextual Dependencies with Convolutional Hierarchical Recurrent Neural Networks0
Learning Continually from Low-shot Data Stream0
Attention-based Natural Language Person Retrieval0
Learning Cross-domain Generalizable Features by Representation Disentanglement0
Handwritten digit and letter recognition using hybrid dwt-dct with knn and svm classifier0
Learning cross space mapping via DNN using large scale click-through logs0
CRISPnet: Color Rendition ISP Net0
Benchmarking the Robustness of Semantic Segmentation Models0
HAMIL: Hierarchical Aggregation-Based Multi-Instance Learning for Microscopy Image Classification0
Hallucinating Saliency Maps for Fine-Grained Image Classification for Limited Data Domains0
Attention-based Image Upsampling0
A Hybrid Method for Training Convolutional Neural Networks0
Delay Differential Neural Networks0
Learning Deep Context-Network Architectures for Image Annotation0
Local Binary Pattern(LBP) Optimization for Feature Extraction0
Half-CNN: A General Framework for Whole-Image Regression0
Cribriform pattern detection in prostate histopathological images using deep learning models0
HAct: Out-of-Distribution Detection with Neural Net Activation Histograms0
Learning degraded image classification with restoration data fidelity0
Learning Dependency Structures for Weak Supervision Models0
Learning Discriminative Features Via Weights-biased Softmax Loss0
Learning Discriminative Multilevel Structured Dictionaries for Supervised Image Classification0
Learning Discriminative Representation via Metric Learning for Imbalanced Medical Image Classification0
Delving into Deep Image Prior for Adversarial Defense: A Novel Reconstruction-based Defense Framework0
HACT-Net: A Hierarchical Cell-to-Tissue Graph Neural Network for Histopathological Image Classification0
CRAM: Clued Recurrent Attention Model0
Diverse Knowledge Distillation (DKD): A Solution for Improving The Robustness of Ensemble Models Against Adversarial Attacks0
GvT: A Graph-based Vision Transformer with Talking-Heads Utilizing Sparsity, Trained from Scratch on Small Datasets0
A Hybrid Generative and Discriminative PointNet on Unordered Point Sets0
Learning efficient structured dictionary for image classification0
Guiding the retraining of convolutional neural networks against adversarial inputs0
Creating Scalable AGI: the Open General Intelligence Framework0
LMM-Regularized CLIP Embeddings for Image Classification0
Learning Expressive Prompting With Residuals for Vision Transformers0
LMSA: Low-relation Mutil-head Self-Attention Mechanism in Visual Transformer0
Local Color Contrastive Descriptor for Image Classification0
Show:102550
← PrevPage 114 of 209Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1CoCa (finetuned)Top 1 Accuracy91Unverified
2Model soups (BASIC-L)Top 1 Accuracy90.98Unverified
3Model soups (ViT-G/14)Top 1 Accuracy90.94Unverified
4DaViT-GTop 1 Accuracy90.4Unverified
5DaViT-HTop 1 Accuracy90.2Unverified
6Meta Pseudo Labels (EfficientNet-L2)Top 1 Accuracy90.2Unverified
7SwinV2-GTop 1 Accuracy90.17Unverified
8MAWS (ViT-6.5B)Top 1 Accuracy90.1Unverified
9Florence-CoSwin-HTop 1 Accuracy90.05Unverified
10Meta Pseudo Labels (EfficientNet-B6-Wide)Top 1 Accuracy90Unverified