SOTAVerified

Image Classification

Image Classification is a fundamental task in vision recognition that aims to understand and categorize an image as a whole under a specific label. Unlike object detection, which involves classification and location of multiple objects within an image, image classification typically pertains to single-object images. When the classification becomes highly detailed or reaches instance-level, it is often referred to as image retrieval, which also involves finding similar images in a large database.

Source: Metamorphic Testing for Object Detection Systems

Papers

Showing 13011350 of 10419 papers

TitleStatusHype
BayTTA: Uncertainty-aware medical image classification with optimized test-time augmentation using Bayesian model averagingCode0
Robustly Optimized Deep Feature Decoupling Network for Fatty Liver Diseases DetectionCode0
TSynD: Targeted Synthetic Data Generation for Enhanced Medical Image Classification0
Benchmarking Deep Learning Models on NVIDIA Jetson Nano for Real-Time Systems: An Empirical InvestigationCode0
Structured Unrestricted-Rank Matrices for Parameter Efficient Fine-tuningCode0
Implicit-Zoo: A Large-Scale Dataset of Neural Implicit Functions for 2D Images and 3D ScenesCode1
Dynamic Scheduling for Vehicle-to-Vehicle Communications Enhanced Federated Learning0
Diff3Dformer: Leveraging Slice Sequence Diffusion for Enhanced 3D CT Classification with Transformer Networks0
Speeding Up Image Classifiers with Little Companions0
Combining Supervised Learning and Reinforcement Learning for Multi-Label Classification Tasks with Partial Labels0
Improving robustness to corruptions with multiplicative weight perturbationsCode0
UNICAD: A Unified Approach for Attack Detection, Noise Reduction and Novel Class Identification0
Learning in Wilson-Cowan model for metapopulationCode0
Improving Quaternion Neural Networks with Quaternionic Activation Functions0
Jacobian Descent for Multi-Objective Optimization0
Learning with Noisy Ground Truth: From 2D Classification to 3D Reconstruction0
How to Learn More? Exploring Kolmogorov-Arnold Networks for Hyperspectral Image ClassificationCode1
Reading Is Believing: Revisiting Language Bottleneck Models for Image Classification0
PUDD: Towards Robust Multi-modal Prototype-based Deepfake Detection0
TorchSpatial: A Location Encoding Framework and Benchmark for Spatial Representation LearningCode2
Demonstrating the Efficacy of Kolmogorov-Arnold Networks in Vision TasksCode1
Real-Time Hand Gesture Recognition: Integrating Skeleton-Based Data Fusion and Multi-Stream CNNCode1
This actually looks like that: Proto-BagNets for local and global interpretability-by-designCode0
DiffExplainer: Unveiling Black Box Models Via Counterfactual GenerationCode0
rKAN: Rational Kolmogorov-Arnold NetworksCode1
Communication-Efficient Adaptive Batch Size Strategies for Distributed Local Gradient Methods0
Adaptive Adversarial Cross-Entropy Loss for Sharpness-Aware MinimizationCode0
Boosting Hyperspectral Image Classification with Gate-Shift-Fuse Mechanisms in a Novel CNN-Transformer Approach0
Seg-LSTM: Performance of xLSTM for Semantic Segmentation of Remotely Sensed ImagesCode1
Putting GPT-4o to the Sword: A Comprehensive Evaluation of Language, Vision, Speech, and Multimodal Proficiency0
Modeling & Evaluating the Performance of Convolutional Neural Networks for Classifying Steel Surface Defects0
Enhancing Cross-Prompt Transferability in Vision-Language Models through Contextual Injection of Target TokensCode0
WATT: Weight Average Test-Time Adaptation of CLIPCode2
Certification for Differentially Private Prediction in Gradient-Based TrainingCode0
LightGBM robust optimization algorithm based on topological data analysis0
CNN Based Flank Predictor for Quadruped Animal Species0
AEM: Attention Entropy Maximization for Multiple Instance Learning based Whole Slide Image ClassificationCode2
MixDiff: Mixing Natural and Synthetic Images for Robust Self-Supervised RepresentationsCode0
LayerMerge: Neural Network Depth Compression through Layer Pruning and MergingCode1
Advancing Cross-Domain Generalizability in Face Anti-Spoofing: Insights, Design, and Metrics0
Privacy Preserving Federated Learning in Medical Imaging with Uncertainty EstimationCode0
MiSuRe is all you need to explain your image segmentation0
Online Anchor-based Training for Image Classification Tasks0
Unleashing the Potential of Open-set Noisy Samples Against Label Noise for Medical Image Classification0
BSRBF-KAN: A combination of B-splines and Radial Basis Functions in Kolmogorov-Arnold NetworksCode1
Visually Consistent Hierarchical Image Classification0
Scaling the Codebook Size of VQGAN to 100,000 with a Utilization Rate of 99%Code2
BaFTA: Backprop-Free Test-Time Adaptation For Zero-Shot Vision-Language Models0
Cross-domain Open-world DiscoveryCode0
Boosting Medical Image Classification with Segmentation Foundation Model0
Show:102550
← PrevPage 27 of 209Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1CoCa (finetuned)Top 1 Accuracy91Unverified
2Model soups (BASIC-L)Top 1 Accuracy90.98Unverified
3Model soups (ViT-G/14)Top 1 Accuracy90.94Unverified
4DaViT-GTop 1 Accuracy90.4Unverified
5Meta Pseudo Labels (EfficientNet-L2)Top 1 Accuracy90.2Unverified
6DaViT-HTop 1 Accuracy90.2Unverified
7SwinV2-GTop 1 Accuracy90.17Unverified
8MAWS (ViT-6.5B)Top 1 Accuracy90.1Unverified
9Florence-CoSwin-HTop 1 Accuracy90.05Unverified
10RevCol-HTop 1 Accuracy90Unverified