SOTAVerified

Image Classification

Image Classification is a fundamental task in vision recognition that aims to understand and categorize an image as a whole under a specific label. Unlike object detection, which involves classification and location of multiple objects within an image, image classification typically pertains to single-object images. When the classification becomes highly detailed or reaches instance-level, it is often referred to as image retrieval, which also involves finding similar images in a large database.

Source: Metamorphic Testing for Object Detection Systems

Papers

Showing 14011450 of 10419 papers

TitleStatusHype
Identification of Stone Deterioration Patterns with Large Multimodal ModelsCode0
Exploring Effects of Hyperdimensional Vectors for Tsetlin Machines0
GrootVL: Tree Topology is All You Need in State Space ModelCode2
UniUSNet: A Promptable Framework for Universal Ultrasound Disease Prediction and Tissue SegmentationCode1
Understanding the Cross-Domain Capabilities of Video-Based Few-Shot Action Recognition Models0
DDA: Dimensionality Driven Augmentation Search for Contrastive Learning in Laparoscopic SurgeryCode0
Compute-Efficient Medical Image Classification with Softmax-Free Transformers and Sequence Normalization0
CoLa-DCE -- Concept-guided Latent Diffusion Counterfactual Explanations0
Asynchronous Multi-Server Federated Learning for Geo-Distributed Clients0
Visual Car Brand Classification by Implementing a Synthetic Image Dataset Creation Pipeline0
MultiMax: Sparse and Multi-Modal Attention LearningCode1
An Optimized Toolbox for Advanced Image Processing with Tsetlin Machine CompositesCode0
Unsupervised Contrastive Analysis for Salient Pattern Detection using Conditional Diffusion ModelsCode0
Task-oriented Embedding Counts: Heuristic Clustering-driven Feature Fine-tuning for Whole Slide Image Classification0
Kolmogorov-Arnold Network for Satellite Image Classification in Remote SensingCode0
CONFINE: Conformal Prediction for Interpretable Neural Networks0
From Seedling to Harvest: The GrowingSoy Dataset for Weed Detection in Soy Crops via Instance SegmentationCode0
An Effective Weight Initialization Method for Deep Learning: Application to Satellite Image ClassificationCode0
Advancing Supervised Local Learning Beyond Classification with Long-term Feature Bank0
Non-Federated Multi-Task Split Learning for Heterogeneous Sources0
Investigating Calibration and Corruption Robustness of Post-hoc Pruned Perception CNNs: An Image Classification Benchmark Study0
Enhancing Counterfactual Image Generation Using Mahalanobis Distance with Distribution Preferences in Feature Space0
Robust Stable Spiking Neural NetworksCode0
You Only Scan Once: Efficient Multi-dimension Sequential Modeling with LightNet0
Improving Generalization and Convergence by Enhancing Implicit RegularizationCode0
GenMix: Combining Generative and Mixture Data Augmentation for Medical Image Classification0
Occam Gradient DescentCode0
Mitigating the Impact of Labeling Errors on Training via Rockafellian Relaxation0
LLM-based Hierarchical Concept Decomposition for Interpretable Fine-Grained Image Classification0
GIST: Greedy Independent Set Thresholding for Diverse Data Summarization0
I Bet You Did Not Mean That: Testing Semantic Importance via BettingCode0
Multimodal Adversarial Defense for Vision-Language Models by Leveraging One-To-Many Relationships0
Verifiably Robust Conformal PredictionCode0
Improved Generation of Adversarial Examples Against Safety-aligned LLMsCode1
It's Not a Modality Gap: Characterizing and Addressing the Contrastive Gap0
Confidence-aware multi-modality learning for eye disease screeningCode1
4-bit Shampoo for Memory-Efficient Network TrainingCode1
MSPE: Multi-Scale Patch Embedding Prompts Vision Transformers to Any Resolution0
DMT-JEPA: Discriminative Masked Targets for Joint-Embedding Predictive ArchitectureCode1
Why are Visually-Grounded Language Models Bad at Image Classification?Code2
WASH: Train your Ensemble with Communication-Efficient Weight Shuffling, then Average0
Model-Agnostic Zeroth-Order Policy Optimization for Meta-Learning of Ergodic Linear Quadratic Regulators0
On Understanding Attention-Based In-Context Learning for Categorical Data0
Superpixelwise Low-rank Approximation based Partial Label Learning for Hyperspectral Image ClassificationCode0
AdaFisher: Adaptive Second Order Optimization via Fisher InformationCode2
Demystify Mamba in Vision: A Linear Attention PerspectiveCode3
Accelerating Transformers with Spectrum-Preserving Token MergingCode2
Breaking the False Sense of Security in Backdoor Defense through Re-Activation Attack0
ModelLock: Locking Your Model With a Spell0
A Neurosymbolic Framework for Bias Correction in Convolutional Neural Networks0
Show:102550
← PrevPage 29 of 209Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1CoCa (finetuned)Top 1 Accuracy91Unverified
2Model soups (BASIC-L)Top 1 Accuracy90.98Unverified
3Model soups (ViT-G/14)Top 1 Accuracy90.94Unverified
4DaViT-GTop 1 Accuracy90.4Unverified
5Meta Pseudo Labels (EfficientNet-L2)Top 1 Accuracy90.2Unverified
6DaViT-HTop 1 Accuracy90.2Unverified
7SwinV2-GTop 1 Accuracy90.17Unverified
8MAWS (ViT-6.5B)Top 1 Accuracy90.1Unverified
9Florence-CoSwin-HTop 1 Accuracy90.05Unverified
10RevCol-HTop 1 Accuracy90Unverified