SOTAVerified

Image Classification

Image Classification is a fundamental task in vision recognition that aims to understand and categorize an image as a whole under a specific label. Unlike object detection, which involves classification and location of multiple objects within an image, image classification typically pertains to single-object images. When the classification becomes highly detailed or reaches instance-level, it is often referred to as image retrieval, which also involves finding similar images in a large database.

Source: Metamorphic Testing for Object Detection Systems

Papers

Showing 19512000 of 10419 papers

TitleStatusHype
Hardware Acceleration for Real-Time Wildfire Detection Onboard Drone NetworksCode0
UV-SAM: Adapting Segment Anything Model for Urban Village IdentificationCode2
Learn What You Need in Personalized Federated LearningCode0
VeCAF: Vision-language Collaborative Active Finetuning with Training Objective Awareness0
Machine Perceptual Quality: Evaluating the Impact of Severe Lossy Compression on Audio and Image ModelsCode0
How does self-supervised pretraining improve robustness against noisy labels across various medical image classification datasets?0
Activations and Gradients Compression for Model-Parallel TrainingCode0
Knee or ROC0
Efficient approximation of Earth Mover's Distance Based on Nearest Neighbor SearchCode0
A Strong Inductive Bias: Gzip for binary image classification0
Exploring Adversarial Attacks against Latent Diffusion Model from the Perspective of Adversarial Transferability0
Image edge enhancement for effective image classification0
Evaluating Data Augmentation Techniques for Coffee Leaf Disease Classification0
Learn From Zoom: Decoupled Supervised Contrastive Learning For WCE Image ClassificationCode2
Efficient Deformable ConvNets: Rethinking Dynamic and Sparse Operator for Vision ApplicationsCode4
Interpreting and Improving Attention From the Perspective of Large Kernel Convolution0
Scissorhands: Scrub Data Influence via Connection Sensitivity in NetworksCode0
Implications of Noise in Resistive Memory on Deep Neural Networks for Image Classification0
Brave: Byzantine-Resilient and Privacy-Preserving Peer-to-Peer Federated Learning0
D3GU: Multi-Target Active Domain Adaptation via Enhancing Domain AlignmentCode0
Do Vision and Language Encoders Represent the World Similarly?Code1
Efficient Fine-Tuning with Domain Adaptation for Privacy-Preserving Vision Transformer0
Setting the Record Straight on Transformer Oversmoothing0
Image classification network enhancement methods based on knowledge injection0
Let's Go Shopping (LGS) -- Web-Scale Image-Text Dataset for Visual Concept Understanding0
Benchmark Analysis of Various Pre-trained Deep Learning Models on ASSIRA Cats and Dogs Dataset0
Color-S^4L: Self-supervised Semi-supervised Learning with Image Colorization0
Dual-Channel Reliable Breast Ultrasound Image Classification Based on Explainable Attribution and Uncertainty Quantification0
A Large-Scale Empirical Study on Improving the Fairness of Image Classification ModelsCode0
conv_einsum: A Framework for Representation and Fast Evaluation of Multilinear Operations in Convolutional Tensorial Neural Networks0
Transferable Learned Image Compression-Resistant Adversarial Perturbations0
End-to-End Anti-Backdoor Learning on Images and Time Series0
Prompt-driven Latent Domain Generalization for Medical Image ClassificationCode1
On the Stability of a non-hyperbolic nonlinear map with non-bounded set of non-isolated fixed points with applications to Machine LearningCode0
CrisisViT: A Robust Vision Transformer for Crisis Image ClassificationCode0
Nonlinear functional regression by functional deep neural network with kernel embedding0
Object-oriented backdoor attack against image captioning0
Benchmarking PathCLIP for Pathology Image Analysis0
Open-Vocabulary SAM: Segment and Recognize Twenty-thousand Classes InteractivelyCode5
Improved Zero-Shot Classification by Adapting VLMs with Text DescriptionsCode1
Spikformer V2: Join the High Accuracy Club on ImageNet with an SNN TicketCode3
SyCoCa: Symmetrizing Contrastive Captioners with Attentive Masking for Multimodal Alignment0
Lightweight Adaptive Feature De-drifting for Compressed Image Classification0
ProbMCL: Simple Probabilistic Contrastive Learning for Multi-label Visual ClassificationCode0
Freeze the backbones: A Parameter-Efficient Contrastive Approach to Robust Medical Vision-Language Pre-training0
Imperio: Language-Guided Backdoor Attacks for Arbitrary Model Control0
Transferable Structural Sparse Adversarial Attack Via Exact Group Sparsity TrainingCode1
Bayesian Exploration of Pre-trained Models for Low-shot Image Classification0
Fair-VPT: Fair Visual Prompt Tuning for Image Classification0
Transductive Zero-Shot and Few-Shot CLIPCode1
Show:102550
← PrevPage 40 of 209Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1CoCa (finetuned)Top 1 Accuracy91Unverified
2Model soups (BASIC-L)Top 1 Accuracy90.98Unverified
3Model soups (ViT-G/14)Top 1 Accuracy90.94Unverified
4DaViT-GTop 1 Accuracy90.4Unverified
5Meta Pseudo Labels (EfficientNet-L2)Top 1 Accuracy90.2Unverified
6DaViT-HTop 1 Accuracy90.2Unverified
7SwinV2-GTop 1 Accuracy90.17Unverified
8MAWS (ViT-6.5B)Top 1 Accuracy90.1Unverified
9Florence-CoSwin-HTop 1 Accuracy90.05Unverified
10RevCol-HTop 1 Accuracy90Unverified