SOTAVerified

Image Classification

Image Classification is a fundamental task in vision recognition that aims to understand and categorize an image as a whole under a specific label. Unlike object detection, which involves classification and location of multiple objects within an image, image classification typically pertains to single-object images. When the classification becomes highly detailed or reaches instance-level, it is often referred to as image retrieval, which also involves finding similar images in a large database.

Source: Metamorphic Testing for Object Detection Systems

Papers

Showing 13011325 of 10419 papers

TitleStatusHype
TSynD: Targeted Synthetic Data Generation for Enhanced Medical Image Classification0
Robustly Optimized Deep Feature Decoupling Network for Fatty Liver Diseases DetectionCode0
Dynamic Scheduling for Vehicle-to-Vehicle Communications Enhanced Federated Learning0
Structured Unrestricted-Rank Matrices for Parameter Efficient Fine-tuningCode0
BayTTA: Uncertainty-aware medical image classification with optimized test-time augmentation using Bayesian model averagingCode0
Mitigate the Gap: Investigating Approaches for Improving Cross-Modal Alignment in CLIPCode2
Implicit-Zoo: A Large-Scale Dataset of Neural Implicit Functions for 2D Images and 3D ScenesCode1
Speeding Up Image Classifiers with Little Companions0
Diff3Dformer: Leveraging Slice Sequence Diffusion for Enhanced 3D CT Classification with Transformer Networks0
Learning in Wilson-Cowan model for metapopulationCode0
UNICAD: A Unified Approach for Attack Detection, Noise Reduction and Novel Class Identification0
Improving robustness to corruptions with multiplicative weight perturbationsCode0
Improving Quaternion Neural Networks with Quaternionic Activation Functions0
Combining Supervised Learning and Reinforcement Learning for Multi-Label Classification Tasks with Partial Labels0
Jacobian Descent for Multi-Objective Optimization0
Learning with Noisy Ground Truth: From 2D Classification to 3D Reconstruction0
How to Learn More? Exploring Kolmogorov-Arnold Networks for Hyperspectral Image ClassificationCode1
Reading Is Believing: Revisiting Language Bottleneck Models for Image Classification0
PUDD: Towards Robust Multi-modal Prototype-based Deepfake Detection0
TorchSpatial: A Location Encoding Framework and Benchmark for Spatial Representation LearningCode2
Real-Time Hand Gesture Recognition: Integrating Skeleton-Based Data Fusion and Multi-Stream CNNCode1
DiffExplainer: Unveiling Black Box Models Via Counterfactual GenerationCode0
This actually looks like that: Proto-BagNets for local and global interpretability-by-designCode0
Demonstrating the Efficacy of Kolmogorov-Arnold Networks in Vision TasksCode1
Communication-Efficient Adaptive Batch Size Strategies for Distributed Local Gradient Methods0
Show:102550
← PrevPage 53 of 417Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1CoCa (finetuned)Top 1 Accuracy91Unverified
2Model soups (BASIC-L)Top 1 Accuracy90.98Unverified
3Model soups (ViT-G/14)Top 1 Accuracy90.94Unverified
4DaViT-GTop 1 Accuracy90.4Unverified
5DaViT-HTop 1 Accuracy90.2Unverified
6Meta Pseudo Labels (EfficientNet-L2)Top 1 Accuracy90.2Unverified
7SwinV2-GTop 1 Accuracy90.17Unverified
8MAWS (ViT-6.5B)Top 1 Accuracy90.1Unverified
9Florence-CoSwin-HTop 1 Accuracy90.05Unverified
10Meta Pseudo Labels (EfficientNet-B6-Wide)Top 1 Accuracy90Unverified