SOTAVerified

Image Classification

Image Classification is a fundamental task in vision recognition that aims to understand and categorize an image as a whole under a specific label. Unlike object detection, which involves classification and location of multiple objects within an image, image classification typically pertains to single-object images. When the classification becomes highly detailed or reaches instance-level, it is often referred to as image retrieval, which also involves finding similar images in a large database.

Source: Metamorphic Testing for Object Detection Systems

Papers

Showing 55515600 of 10420 papers

TitleStatusHype
HDF: Hybrid Deep Features for Scene Image Representation0
HD-CNN: Hierarchical Deep Convolutional Neural Networks for Large Scale Visual Recognition0
Ladder Networks for Semi-Supervised Hyperspectral Image Classification0
Have Large Vision-Language Models Mastered Art History?0
LANCE: Efficient Low-Precision Quantized Winograd Convolution for Neural Networks Based on Graphics Processing Units0
Land Cover Classification from Multi-temporal, Multi-spectral Remotely Sensed Imagery using Patch-Based Recurrent Neural Networks0
Deep Scene Image Classification With the MFAFVNet0
Landscape of Neural Architecture Search across sensors: how much do they differ ?0
Landslide4Sense: Reference Benchmark Data and Deep Learning Models for Landslide Detection0
Bayesian Test-Time Adaptation for Vision-Language Models0
Language-aware Domain Generalization Network for Cross-Scene Hyperspectral Image Classification0
AI-Compass: A Comprehensive and Effective Multi-module Testing Tool for AI Systems0
Characterizing and Improving the Robustness of Self-Supervised Learning through Background Augmentations0
Leveraging Computer Vision Application in Visual Arts: A Case Study on the Use of Residual Neural Network to Classify and Analyze Baroque Paintings0
Language-Informed Hyperspectral Image Synthesis for Imbalanced-Small Sample Classification via Semi-Supervised Conditional Diffusion Model0
Cross-Domain Evaluation of Few-Shot Classification Models: Natural Images vs. Histopathological Images0
Attention Feature Fusion Network via Knowledge Propagation for Automated Respiratory Sound Classification0
Hashing on Nonlinear Manifolds0
Attention-free Spikformer: Mixing Spike Sequences with Simple Linear Transforms0
Let's Go Shopping (LGS) -- Web-Scale Image-Text Dataset for Visual Concept Understanding0
HASeparator: Hyperplane-Assisted Softmax0
Large e-retailer image dataset for visual search and product classification0
HASA: Hybrid Architecture Search with Aggregation Strategy for Echinococcosis Classification and Ovary Segmentation in Ultrasound Images0
Large Language Models Implicitly Learn to See and Hear Just By Reading0
Cross Domain Ensemble Distillation for Domain Generalization0
Large Margin Multi-modal Multi-task Feature Extraction for Image Classification0
Large Neural Networks Learning from Scratch with Very Few Data and without Explicit Regularization0
Large-Scale 3D Scene Classification With Multi-View Volumetric CNN0
Harvesting Mid-level Visual Concepts from Large-Scale Internet Images0
Cross-domain Deep Feature Combination for Bird Species Classification with Audio-visual Data0
Leveraging Conditional Mutual Information to Improve Large Language Model Fine-Tuning For Classification0
Attention Enables Zero Approximation Error0
Harnessing The Power of Attention For Patch-Based Biomedical Image Classification0
Harnessing Increased Client Participation with Cohort-Parallel Federated Learning0
Large Scale Neural Architecture Search with Polyharmonic Splines0
Large-scale spatiotemporal photonic reservoir computer for image classification0
Cross-Domain Collaborative Learning via Cluster Canonical Correlation Analysis and Random Walker for Hyperspectral Image Classification0
Large-Scale Unsupervised Person Re-Identification with Contrastive Learning0
Large-scale Video Classification guided by Batch Normalized LSTM Translator0
Large-Scale Zero-Shot Image Classification from Rich and Diverse Textual Descriptions0
deepTerra -- AI Land Classification Made Easy0
Accelerating CNN inference on FPGAs: A Survey0
Lessons learned from the NeurIPS 2021 MetaDL challenge: Backbone fine-tuning without episodic meta-learning dominates for few-shot learning image classification0
Cross-domain CNN for Hyperspectral Image Classification0
Latent Domain Learning with Dynamic Residual Adapters0
Latent Enhancing AutoEncoder for Occluded Image Classification0
LatentGAN Autoencoder: Learning Disentangled Latent Distribution0
Latent Model Ensemble with Auto-localization0
AI-Based Copyright Detection Of An Image In a Video Using Degree Of Similarity And Image Hashing0
Less or More From Teacher: Exploiting Trilateral Geometry For Knowledge Distillation0
Show:102550
← PrevPage 112 of 209Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1CoCa (finetuned)Top 1 Accuracy91Unverified
2Model soups (BASIC-L)Top 1 Accuracy90.98Unverified
3Model soups (ViT-G/14)Top 1 Accuracy90.94Unverified
4DaViT-GTop 1 Accuracy90.4Unverified
5Meta Pseudo Labels (EfficientNet-L2)Top 1 Accuracy90.2Unverified
6DaViT-HTop 1 Accuracy90.2Unverified
7SwinV2-GTop 1 Accuracy90.17Unverified
8MAWS (ViT-6.5B)Top 1 Accuracy90.1Unverified
9Florence-CoSwin-HTop 1 Accuracy90.05Unverified
10RevCol-HTop 1 Accuracy90Unverified