SOTAVerified

Fine-Grained Image Classification

Fine-Grained Image Classification is a task in computer vision where the goal is to classify images into subcategories within a larger category. For example, classifying different species of birds or different types of flowers. This task is considered to be fine-grained because it requires the model to distinguish between subtle differences in visual appearance and patterns, making it more challenging than regular image classification tasks.

( Image credit: Looking for the Devil in the Details )

Papers

Showing 226250 of 353 papers

TitleStatusHype
Modelling Local Deep Convolutional Neural Network Features to Improve Fine-Grained Image Classification0
Fine-graind Image Classification via Combining Vision and Language0
Few-shot Learning for Domain-specific Fine-grained Image Classification0
Multimodal Semantic Transfer from Text to Image. Fine-Grained Image Classification by Distributional Semantics0
Exploring Localization for Self-supervised Fine-grained Contrastive Learning0
Unsupervised Learning using Pretrained CNN and Associative Memory Bank0
Unsupervised Part Mining for Fine-grained Image Classification0
Natural World Distribution via Adaptive Confusion Energy Regularization0
NDPNet: A novel non-linear data projection network for few-shot fine-grained image classification0
Feature Channel Adaptive Enhancement for Fine-Grained Visual Classification0
Fast Fine-grained Image Classification via Weakly Supervised Discriminative Localization0
Nonparametric Part Transfer for Fine-grained Recognition0
Object-aware Long-short-range Spatial Alignment for Few-Shot Fine-Grained Image Classification0
Object-centric Sampling for Fine-grained Image Classification0
Alignment Enhancement Network for Fine-grained Visual Categorization0
OmniVec2 - A Novel Transformer based Network for Large Scale Multimodal and Multitask Learning0
OmniVec: Learning robust representations with cross modal sharing0
On the Eigenvalues of Global Covariance Pooling for Fine-grained Visual Recognition0
On the Ideal Number of Groups for Isometric Gradient Propagation0
Vision Models Are More Robust And Fair When Pretrained On Uncurated Images Without Supervision0
Aligned to the Object, not to the Image: A Unified Pose-aligned Representation for Fine-grained Recognition0
Exploring Target Driven Image Classification0
Part-based R-CNNs for Fine-grained Category Detection0
Semantically-Prompted Language Models Improve Visual Descriptions0
Enhancing Multimodal In-Context Learning for Image Classification through Coreset Optimization0
Show:102550
← PrevPage 10 of 15Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1TResnet-L + PMDAccuracy97.3Unverified
2CMAL-NetAccuracy97.1Unverified
3I2-HOFIAccuracy96.92Unverified
4TResNet-L + ML-DecoderAccuracy96.41Unverified
5DATAccuracy96.2Unverified
6ALIGNAccuracy96.13Unverified
7SR-GNNAccuracy96.1Unverified
8EffNet-L2 (SAM)Accuracy95.96Unverified
9SaSPA + CALAccuracy95.72Unverified
10CAPAccuracy95.7Unverified
#ModelMetricClaimedVerifiedStatus
1I2-HOFIAccuracy96.42Unverified
2SR-GNNAccuracy95.4Unverified
3Inceptionv4Accuracy95.11Unverified
4CAPAccuracy94.9Unverified
5CSQA-NetAccuracy94.7Unverified
6CMAL-NetAccuracy94.7Unverified
7TBMSL-NetAccuracy94.7Unverified
8PARTAccuracy94.6Unverified
9SaSPA + CALAccuracy94.5Unverified
10AENetAccuracy94.5Unverified
#ModelMetricClaimedVerifiedStatus
1HERBSAccuracy93.1Unverified
2PIMAccuracy92.8Unverified
3MDCMAccuracy92.5Unverified
4IELTAccuracy91.8Unverified
5CAPAccuracy91.8Unverified
6SFETransAccuracy91.8Unverified
7SWAG (ViT H/14)Accuracy91.7Unverified
8ViT-NeTAccuracy91.7Unverified
9TransFGAccuracy91.7Unverified
10I2-HOFIAccuracy91.6Unverified
#ModelMetricClaimedVerifiedStatus
1HERBSAccuracy93Unverified
2MetaFormer (MetaFormer-2,384)Accuracy93Unverified
3PIMAccuracy92.8Unverified
4MPSAAccuracy92.5Unverified
5ViT-NeT (SwinV2-B)Accuracy92.5Unverified
6CSQA-NetAccuracy92.3Unverified
7I2-HOFIAccuracy92.12Unverified
8MDCMAccuracy92Unverified
9CGLAccuracy91.7Unverified
10SR-GNNAccuracy91.2Unverified