SOTAVerified

Image Classification

Image Classification is a fundamental task in vision recognition that aims to understand and categorize an image as a whole under a specific label. Unlike object detection, which involves classification and location of multiple objects within an image, image classification typically pertains to single-object images. When the classification becomes highly detailed or reaches instance-level, it is often referred to as image retrieval, which also involves finding similar images in a large database.

Source: Metamorphic Testing for Object Detection Systems

Papers

Showing 34513500 of 10419 papers

TitleStatusHype
Unsupervised Contrastive Analysis for Salient Pattern Detection using Conditional Diffusion ModelsCode0
Task-oriented Embedding Counts: Heuristic Clustering-driven Feature Fine-tuning for Whole Slide Image Classification0
Kolmogorov-Arnold Network for Satellite Image Classification in Remote SensingCode0
An Effective Weight Initialization Method for Deep Learning: Application to Satellite Image ClassificationCode0
Advancing Supervised Local Learning Beyond Classification with Long-term Feature Bank0
From Seedling to Harvest: The GrowingSoy Dataset for Weed Detection in Soy Crops via Instance SegmentationCode0
CONFINE: Conformal Prediction for Interpretable Neural Networks0
Robust Stable Spiking Neural NetworksCode0
Non-Federated Multi-Task Split Learning for Heterogeneous Sources0
GenMix: Combining Generative and Mixture Data Augmentation for Medical Image Classification0
You Only Scan Once: Efficient Multi-dimension Sequential Modeling with LightNet0
Enhancing Counterfactual Image Generation Using Mahalanobis Distance with Distribution Preferences in Feature Space0
Improving Generalization and Convergence by Enhancing Implicit RegularizationCode0
Investigating Calibration and Corruption Robustness of Post-hoc Pruned Perception CNNs: An Image Classification Benchmark Study0
Occam Gradient DescentCode0
Mitigating the Impact of Labeling Errors on Training via Rockafellian Relaxation0
LLM-based Hierarchical Concept Decomposition for Interpretable Fine-Grained Image Classification0
Multimodal Adversarial Defense for Vision-Language Models by Leveraging One-To-Many Relationships0
I Bet You Did Not Mean That: Testing Semantic Importance via BettingCode0
Verifiably Robust Conformal PredictionCode0
GIST: Greedy Independent Set Thresholding for Diverse Data Summarization0
MSPE: Multi-Scale Patch Embedding Prompts Vision Transformers to Any Resolution0
It's Not a Modality Gap: Characterizing and Addressing the Contrastive Gap0
WASH: Train your Ensemble with Communication-Efficient Weight Shuffling, then Average0
Model-Agnostic Zeroth-Order Policy Optimization for Meta-Learning of Ergodic Linear Quadratic Regulators0
On Understanding Attention-Based In-Context Learning for Categorical Data0
Superpixelwise Low-rank Approximation based Partial Label Learning for Hyperspectral Image ClassificationCode0
Breaking the False Sense of Security in Backdoor Defense through Re-Activation Attack0
ModelLock: Locking Your Model With a Spell0
Free Performance Gain from Mixing Multiple Partially Labeled Samples in Multi-label Image Classification0
Exposing Image Classifier Shortcuts with Counterfactual Frequency (CoF) Tables0
CLIP model is an Efficient Online Lifelong LearnerCode0
Class Machine Unlearning for Complex Data via Concepts Inference and Data Poisoning0
A Neurosymbolic Framework for Bias Correction in Convolutional Neural Networks0
LLS: Local Learning Rule for Deep Neural Networks Inspired by Neural Activity SynchronizationCode0
What Do You See? Enhancing Zero-Shot Image Classification with Multimodal Large Language ModelsCode0
Transformer-based Federated Learning for Multi-Label Remote Sensing Image Classification0
Grounding Stylistic Domain Generalization with Quantitative Domain Shift Measures and Synthetic Scene ImagesCode0
Harnessing Increased Client Participation with Cohort-Parallel Federated Learning0
Scalable Visual State Space Model with Fractal Scanning0
Pre-Trained Vision-Language Models as Partial Annotators0
Domain Wall Magnetic Tunnel Junction Reliable Integrate and Fire Neuron0
Adaptive Gradient Clipping for Robust Federated Learning0
Explaining Black-box Model Predictions via Two-level Nested Feature Attributions with Consistency Property0
A Lost Opportunity for Vision-Language Models: A Comparative Study of Online Test-Time Adaptation for Vision-Language Models0
Exploration of Multi-Scale Image Fusion Systems in Intelligent Medical Image Analysis0
Stochastic Online Conformal Prediction with Semi-Bandit Feedback0
Semantic Equitable Clustering: A Simple and Effective Strategy for Clustering Vision Tokens0
Markerless retro-identification complements re-identification of individual insect subjects in archived image data of biological experiments0
FLARE up your data: Diffusion-based Augmentation Method in Astronomical ImagingCode0
Show:102550
← PrevPage 70 of 209Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1CoCa (finetuned)Top 1 Accuracy91Unverified
2Model soups (BASIC-L)Top 1 Accuracy90.98Unverified
3Model soups (ViT-G/14)Top 1 Accuracy90.94Unverified
4DaViT-GTop 1 Accuracy90.4Unverified
5DaViT-HTop 1 Accuracy90.2Unverified
6Meta Pseudo Labels (EfficientNet-L2)Top 1 Accuracy90.2Unverified
7SwinV2-GTop 1 Accuracy90.17Unverified
8MAWS (ViT-6.5B)Top 1 Accuracy90.1Unverified
9Florence-CoSwin-HTop 1 Accuracy90.05Unverified
10Meta Pseudo Labels (EfficientNet-B6-Wide)Top 1 Accuracy90Unverified