SOTAVerified

Image Classification

Image Classification is a fundamental task in vision recognition that aims to understand and categorize an image as a whole under a specific label. Unlike object detection, which involves classification and location of multiple objects within an image, image classification typically pertains to single-object images. When the classification becomes highly detailed or reaches instance-level, it is often referred to as image retrieval, which also involves finding similar images in a large database.

Source: Metamorphic Testing for Object Detection Systems

Papers

Showing 77767800 of 10420 papers

TitleStatusHype
A Machine-Synesthetic Approach To DDoS Network Attack Detection0
Improving the Effectiveness of Deep Generative Data0
Rank4Class: Examining Multiclass Classification through the Lens of Learning to Rank0
Improving the Deployment of Recycling Classification through Efficient Hyper-Parameter Analysis0
Rank Selection of CP-decomposed Convolutional Layers with Variational Bayesian Matrix Factorization0
Deep Ensemble Bayesian Active Learning : Addressing the Mode Collapse issue in Monte Carlo dropout via Ensembles0
Improving the accuracy of neural networks in analog computing-in-memory systems by a generalized quantization method0
Deep Ensemble Bayesian Active Learning : Adressing the Mode Collapse issue in Monte Carlo dropout via Ensembles0
Auxiliary Tasks Enhanced Dual-affinity Learning for Weakly Supervised Semantic Segmentation0
Improving the Accuracy of Learning Example Weights for Imbalance Classification0
Improving Tail-Class Representation with Centroid Contrastive Learning0
Improving Strong-Scaling of CNN Training by Exploiting Finer-Grained Parallelism0
Improving STDP-based Visual Feature Learning with Whitening0
Raw Waveform-based Audio Classification Using Sample-level CNN Architectures0
RBFleX-NAS: Training-Free Neural Architecture Search Using Radial Basis Function Kernel and Hyperparameter Detection0
RC-DARTS: Resource Constrained Differentiable Architecture Search0
RCKD: Response-Based Cross-Task Knowledge Distillation for Pathological Image Analysis0
DeepEMD: Few-Shot Image Classification With Differentiable Earth Mover's Distance and Structured Classifiers0
R-Cut: Enhancing Explainability in Vision Transformers with Relationship Weighted Out and Cut0
Auxiliary Multimodal LSTM for Audio-visual Speech Recognition and Lipreading0
Reading Is Believing: Revisiting Language Bottleneck Models for Image Classification0
Altogether: Image Captioning via Re-aligning Alt-text0
Adaptive Data Augmentation with Deep Parallel Generative Models0
Accurate and Efficient Similarity Search for Large Scale Face Recognition0
Improving Feature Stability during Upsampling -- Spectral Artifacts and the Importance of Spatial Context0
Show:102550
← PrevPage 312 of 417Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1CoCa (finetuned)Top 1 Accuracy91Unverified
2Model soups (BASIC-L)Top 1 Accuracy90.98Unverified
3Model soups (ViT-G/14)Top 1 Accuracy90.94Unverified
4DaViT-GTop 1 Accuracy90.4Unverified
5DaViT-HTop 1 Accuracy90.2Unverified
6Meta Pseudo Labels (EfficientNet-L2)Top 1 Accuracy90.2Unverified
7SwinV2-GTop 1 Accuracy90.17Unverified
8MAWS (ViT-6.5B)Top 1 Accuracy90.1Unverified
9Florence-CoSwin-HTop 1 Accuracy90.05Unverified
10Meta Pseudo Labels (EfficientNet-B6-Wide)Top 1 Accuracy90Unverified