SOTAVerified

Fine-Grained Image Classification

Fine-Grained Image Classification is a task in computer vision where the goal is to classify images into subcategories within a larger category. For example, classifying different species of birds or different types of flowers. This task is considered to be fine-grained because it requires the model to distinguish between subtle differences in visual appearance and patterns, making it more challenging than regular image classification tasks.

( Image credit: Looking for the Devil in the Details )

Papers

Showing 101125 of 353 papers

TitleStatusHype
Concept Learners for Few-Shot LearningCode1
Escaping the Big Data Paradigm with Compact TransformersCode1
ImageNet-21K Pretraining for the MassesCode1
Exploration of Class Center for Fine-Grained Visual ClassificationCode1
Knowledge Mining with Scene Text for Fine-Grained RecognitionCode1
The CropAndWeed Dataset: A Multi-Modal Learning Approach for Efficient Crop and Weed ManipulationCode1
Exploring Vision Transformers for Fine-grained ClassificationCode1
Large Scale Fine-Grained Categorization and Domain-Specific Transfer LearningCode1
Contrastive Deep SupervisionCode1
Label Relation Graphs Enhanced Hierarchical Residual Network for Hierarchical Multi-Granularity ClassificationCode1
ML-Decoder: Scalable and Versatile Classification HeadCode1
Learning Partial Correlation based Deep Visual Representation for Image ClassificationCode1
Feature Fusion Vision Transformer for Fine-Grained Visual CategorizationCode1
Transformer in TransformerCode1
Long-Tail Learning with Foundation Model: Heavy Fine-Tuning HurtsCode1
Learning Attentive Pairwise Interaction for Fine-Grained ClassificationCode1
Convolutional Fine-Grained Classification with Self-Supervised Target Relation RegularizationCode1
Learning Cross-Image Object Semantic Relation in Transformer for Few-Shot Fine-Grained Image ClassificationCode1
Progressive Co-Attention Network for Fine-grained Visual ClassificationCode1
Self-Supervised Learning by Estimating Twin Class DistributionsCode1
An Attention-Locating Algorithm for Eliminating Background Effects in Fine-grained Visual ClassificationCode0
Looking for the Devil in the Details: Learning Trilinear Attention Sampling Network for Fine-grained Image RecognitionCode0
LiT Tuned Models for Efficient Species DetectionCode0
Correcting the Triplet Selection Bias for Triplet LossCode0
Morphing Tokens Draw Strong Masked Image ModelsCode0
Show:102550
← PrevPage 5 of 15Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1TResnet-L + PMDAccuracy97.3Unverified
2CMAL-NetAccuracy97.1Unverified
3I2-HOFIAccuracy96.92Unverified
4TResNet-L + ML-DecoderAccuracy96.41Unverified
5DATAccuracy96.2Unverified
6ALIGNAccuracy96.13Unverified
7SR-GNNAccuracy96.1Unverified
8EffNet-L2 (SAM)Accuracy95.96Unverified
9SaSPA + CALAccuracy95.72Unverified
10CAPAccuracy95.7Unverified
#ModelMetricClaimedVerifiedStatus
1I2-HOFIAccuracy96.42Unverified
2SR-GNNAccuracy95.4Unverified
3Inceptionv4Accuracy95.11Unverified
4CAPAccuracy94.9Unverified
5CSQA-NetAccuracy94.7Unverified
6CMAL-NetAccuracy94.7Unverified
7TBMSL-NetAccuracy94.7Unverified
8PARTAccuracy94.6Unverified
9SaSPA + CALAccuracy94.5Unverified
10AENetAccuracy94.5Unverified
#ModelMetricClaimedVerifiedStatus
1HERBSAccuracy93.1Unverified
2PIMAccuracy92.8Unverified
3MDCMAccuracy92.5Unverified
4IELTAccuracy91.8Unverified
5CAPAccuracy91.8Unverified
6SFETransAccuracy91.8Unverified
7SWAG (ViT H/14)Accuracy91.7Unverified
8ViT-NeTAccuracy91.7Unverified
9TransFGAccuracy91.7Unverified
10I2-HOFIAccuracy91.6Unverified
#ModelMetricClaimedVerifiedStatus
1HERBSAccuracy93Unverified
2MetaFormer (MetaFormer-2,384)Accuracy93Unverified
3PIMAccuracy92.8Unverified
4MPSAAccuracy92.5Unverified
5ViT-NeT (SwinV2-B)Accuracy92.5Unverified
6CSQA-NetAccuracy92.3Unverified
7I2-HOFIAccuracy92.12Unverified
8MDCMAccuracy92Unverified
9CGLAccuracy91.7Unverified
10SR-GNNAccuracy91.2Unverified