SOTAVerified

Image Classification

Image Classification is a fundamental task in vision recognition that aims to understand and categorize an image as a whole under a specific label. Unlike object detection, which involves classification and location of multiple objects within an image, image classification typically pertains to single-object images. When the classification becomes highly detailed or reaches instance-level, it is often referred to as image retrieval, which also involves finding similar images in a large database.

Source: Metamorphic Testing for Object Detection Systems

Papers

Showing 33513400 of 10419 papers

TitleStatusHype
Multi-branch CNN and grouping cascade attention for medical image classification0
Scarecrow monitoring system:employing mobilenet ssd for enhanced animal supervision0
Gradient-based Class Weighting for Unsupervised Domain Adaptation in Dense Prediction Visual Tasks0
Semantic Compositions Enhance Vision-Language Contrastive Learning0
ModelVerification.jl: a Comprehensive Toolbox for Formally Verifying Deep Neural Networks0
Image Classification for Snow Detection to Improve Pedestrian Safety0
RepAct: The Re-parameterizable Adaptive Activation FunctionCode0
Extract More from Less: Efficient Fine-Grained Visual Recognition in Low-Data RegimesCode0
Assistive Image Annotation Systems with Deep Learning and Natural Language Capabilities: A Review0
Instance Temperature Knowledge DistillationCode0
Learning Visual Conditioning Tokens to Correct Domain Shift for Fully Test-time Adaptation0
Adaptive Stochastic Weight AveragingCode0
Dynamic Scheduling for Vehicle-to-Vehicle Communications Enhanced Federated Learning0
TSynD: Targeted Synthetic Data Generation for Enhanced Medical Image Classification0
Optimization of Autonomous Driving Image Detection Based on RFAConv and Triplet Attention0
Benchmarking Deep Learning Models on NVIDIA Jetson Nano for Real-Time Systems: An Empirical InvestigationCode0
BayTTA: Uncertainty-aware medical image classification with optimized test-time augmentation using Bayesian model averagingCode0
Structured Unrestricted-Rank Matrices for Parameter Efficient Fine-tuningCode0
Robustly Optimized Deep Feature Decoupling Network for Fatty Liver Diseases DetectionCode0
Combining Supervised Learning and Reinforcement Learning for Multi-Label Classification Tasks with Partial Labels0
Improving Quaternion Neural Networks with Quaternionic Activation Functions0
Learning in Wilson-Cowan model for metapopulationCode0
UNICAD: A Unified Approach for Attack Detection, Noise Reduction and Novel Class Identification0
Speeding Up Image Classifiers with Little Companions0
Diff3Dformer: Leveraging Slice Sequence Diffusion for Enhanced 3D CT Classification with Transformer Networks0
Improving robustness to corruptions with multiplicative weight perturbationsCode0
Jacobian Descent for Multi-Objective Optimization0
Learning with Noisy Ground Truth: From 2D Classification to 3D Reconstruction0
PUDD: Towards Robust Multi-modal Prototype-based Deepfake Detection0
Reading Is Believing: Revisiting Language Bottleneck Models for Image Classification0
DiffExplainer: Unveiling Black Box Models Via Counterfactual GenerationCode0
This actually looks like that: Proto-BagNets for local and global interpretability-by-designCode0
Communication-Efficient Adaptive Batch Size Strategies for Distributed Local Gradient Methods0
Boosting Hyperspectral Image Classification with Gate-Shift-Fuse Mechanisms in a Novel CNN-Transformer Approach0
Adaptive Adversarial Cross-Entropy Loss for Sharpness-Aware MinimizationCode0
CNN Based Flank Predictor for Quadruped Animal Species0
Certification for Differentially Private Prediction in Gradient-Based TrainingCode0
Modeling & Evaluating the Performance of Convolutional Neural Networks for Classifying Steel Surface Defects0
Putting GPT-4o to the Sword: A Comprehensive Evaluation of Language, Vision, Speech, and Multimodal Proficiency0
Enhancing Cross-Prompt Transferability in Vision-Language Models through Contextual Injection of Target TokensCode0
LightGBM robust optimization algorithm based on topological data analysis0
MiSuRe is all you need to explain your image segmentation0
Advancing Cross-Domain Generalizability in Face Anti-Spoofing: Insights, Design, and Metrics0
Online Anchor-based Training for Image Classification Tasks0
Unleashing the Potential of Open-set Noisy Samples Against Label Noise for Medical Image Classification0
MixDiff: Mixing Natural and Synthetic Images for Robust Self-Supervised RepresentationsCode0
Privacy Preserving Federated Learning in Medical Imaging with Uncertainty EstimationCode0
Visually Consistent Hierarchical Image Classification0
BaFTA: Backprop-Free Test-Time Adaptation For Zero-Shot Vision-Language Models0
Cross-domain Open-world DiscoveryCode0
Show:102550
← PrevPage 68 of 209Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1CoCa (finetuned)Top 1 Accuracy91Unverified
2Model soups (BASIC-L)Top 1 Accuracy90.98Unverified
3Model soups (ViT-G/14)Top 1 Accuracy90.94Unverified
4DaViT-GTop 1 Accuracy90.4Unverified
5Meta Pseudo Labels (EfficientNet-L2)Top 1 Accuracy90.2Unverified
6DaViT-HTop 1 Accuracy90.2Unverified
7SwinV2-GTop 1 Accuracy90.17Unverified
8MAWS (ViT-6.5B)Top 1 Accuracy90.1Unverified
9Florence-CoSwin-HTop 1 Accuracy90.05Unverified
10RevCol-HTop 1 Accuracy90Unverified