SOTAVerified

Image Classification

Image Classification is a fundamental task in vision recognition that aims to understand and categorize an image as a whole under a specific label. Unlike object detection, which involves classification and location of multiple objects within an image, image classification typically pertains to single-object images. When the classification becomes highly detailed or reaches instance-level, it is often referred to as image retrieval, which also involves finding similar images in a large database.

Source: Metamorphic Testing for Object Detection Systems

Papers

Showing 32513300 of 10419 papers

TitleStatusHype
Improving GAN Training via Feature Space ShrinkageCode1
Cluster-Guided Semi-Supervised Domain Adaptation for Imbalanced Medical Image Classification0
Multi-Head Multi-Loss Model CalibrationCode0
Predicting Stock Price Movement as an Image Classification Problem0
BEL: A Bag Embedding Loss for Transformer enhances Multiple Instance Whole Slide Image Classification0
Evidence-empowered Transfer Learning for Alzheimer's Disease0
Time Series as Images: Vision Transformer for Irregularly Sampled Time SeriesCode1
Can representation learning for multimodal image registration be improved by supervision of intermediate layers?0
Empowering Networks With Scale and Rotation Equivariance Using A Similarity Convolution0
StrucTexTv2: Masked Visual-Textual Prediction for Document Image Pre-trainingCode0
Generic-to-Specific Distillation of Masked AutoencodersCode1
A Comprehensive Study on Robustness of Image Classification Models: Benchmarking and Rethinking0
GradMA: A Gradient-Memory-based Accelerated Federated Learning with Alleviated Catastrophic ForgettingCode1
Efficient Masked Autoencoders with Self-Consistency0
Deep Learning for Identifying Iran's Cultural Heritage Buildings in Need of Conservation Using Image Classification and Grad-CAMCode0
RoPAWS: Robust Semi-supervised Representation Learning from Uncurated DataCode2
Explanations for Automatic Speech Recognition0
Dirichlet-based Uncertainty Calibration for Active Domain AdaptationCode1
Learning cross space mapping via DNN using large scale click-through logs0
A Surrogate-Assisted Highly Cooperative Coevolutionary Algorithm for Hyperparameter Optimization in Deep Convolutional Neural Network0
SATBA: An Invisible Backdoor Attack Based On Spatial Attention0
A Light-weight Deep Learning Model for Remote Sensing Image Classification0
Agile Modeling: From Concept to Classifier in Minutes0
Small Sample Hyperspectral Image Classification Based on the Random Patches Network and Recursive FilteringCode0
Fact or Artifact? Revise Layer-wise Relevance Propagation on various ANN Architectures0
A2S-NAS: Asymmetric Spectral-Spatial Neural Architecture Search For Hyperspectral Image Classification0
StudyFormer : Attention-Based and Dynamic Multi View Classifier for X-ray images0
Real-Time Damage Detection in Fiber Lifting Ropes Using Lightweight Convolutional Neural NetworksCode0
A Gradient Boosting Approach for Training Convolutional and Deep Neural NetworksCode0
Open-domain Visual Entity Recognition: Towards Recognizing Millions of Wikipedia EntitiesCode1
DISCO: Distributed Inference with Sparse Communications0
Stress and Adaptation: Applying Anna Karenina Principle in Deep Learning for Image Classification0
Deep Active Learning in the Presence of Label Noise: A Survey0
Magnification Invariant Medical Image Analysis: A Comparison of Convolutional Networks, Vision Transformers, and Token Mixers0
Analysis of Real-Time Hostile Activitiy Detection from Spatiotemporal Features Using Time Distributed Deep CNNs, RNNs and Attention-Based Mechanisms0
FrankenSplit: Efficient Neural Feature Compression with Shallow Variational Bottleneck Injection for Mobile Edge ComputingCode1
Model-based feature selection for neural networks: A mixed-integer programming approach0
CMVAE: Causal Meta VAE for Unsupervised Meta-LearningCode0
Domain-Specific Pre-training Improves Confidence in Whole Slide Image ClassificationCode0
mSAM: Micro-Batch-Averaged Sharpness-Aware Minimization0
MedViT: A Robust Vision Transformer for Generalized Medical Image ClassificationCode2
Deep Selector-JPEG: Adaptive JPEG Image Compression for Computer Vision in Image classification with Human Vision Criteria0
Gradient-based Wang-Landau Algorithm: A Novel Sampler for Output Distribution of Neural Networks over the Input Space0
Random Padding Data Augmentation0
GPT4MIA: Utilizing Generative Pre-trained Transformer (GPT-3) as A Plug-and-Play Transductive Model for Medical Image Analysis0
Towards Reliable Assessments of Demographic Disparities in Multi-Label Image Classifiers0
Meta-Album: Multi-domain Meta-Dataset for Few-Shot Image ClassificationCode1
THC: Accelerating Distributed Deep Learning Using Tensor Homomorphic CompressionCode1
Fossil Image Identification using Deep Learning Ensembles of Data Augmented MultiviewsCode0
Efficiency 360: Efficient Vision TransformersCode1
Show:102550
← PrevPage 66 of 209Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1CoCa (finetuned)Top 1 Accuracy91Unverified
2Model soups (BASIC-L)Top 1 Accuracy90.98Unverified
3Model soups (ViT-G/14)Top 1 Accuracy90.94Unverified
4DaViT-GTop 1 Accuracy90.4Unverified
5Meta Pseudo Labels (EfficientNet-L2)Top 1 Accuracy90.2Unverified
6DaViT-HTop 1 Accuracy90.2Unverified
7SwinV2-GTop 1 Accuracy90.17Unverified
8MAWS (ViT-6.5B)Top 1 Accuracy90.1Unverified
9Florence-CoSwin-HTop 1 Accuracy90.05Unverified
10RevCol-HTop 1 Accuracy90Unverified