SOTAVerified

Fine-Grained Image Classification

Fine-Grained Image Classification is a task in computer vision where the goal is to classify images into subcategories within a larger category. For example, classifying different species of birds or different types of flowers. This task is considered to be fine-grained because it requires the model to distinguish between subtle differences in visual appearance and patterns, making it more challenging than regular image classification tasks.

( Image credit: Looking for the Devil in the Details )

Papers

Showing 251300 of 353 papers

TitleStatusHype
TDLS: A Top-Down Layer Searching Algorithm for Generating Counterfactual Visual Explanation0
The 2021 Hotel-ID to Combat Human Trafficking Competition Dataset0
The Application of Two-level Attention Models in Deep Convolutional Neural Network for Fine-grained Image Classification0
Learning Class Unique Features in Fine-Grained Visual Classification0
Towards Privacy-Preserving Fine-Grained Visual Classification via Hierarchical Learning from Label Proportions0
Transformer with Peak Suppression and Knowledge Guidance for Fine-grained Image Recognition0
TransIFC: Invariant Cues-aware Feature Concentration Learning for Efficient Fine-grained Bird Image Classification0
Unsupervised Learning using Pretrained CNN and Associative Memory Bank0
Unsupervised Part Mining for Fine-grained Image Classification0
Semantically-Prompted Language Models Improve Visual Descriptions0
ViT-FOD: A Vision Transformer based Fine-grained Object Discriminator0
Weakly Supervised Attention Pyramid Convolutional Neural Network for Fine-Grained Visual Classification0
Weakly Supervised Bilinear Attention Network for Fine-Grained Visual Classification0
Weakly Supervised Fine-Grained Image Categorization0
Weakly Supervised Fine-Grained Image Classification via Guassian Mixture Model Oriented Discriminative Learning0
Webly Supervised Learning Meets Zero-Shot Learning: A Hybrid Approach for Fine-Grained Classification0
When Does Contrastive Visual Representation Learning Work?0
Where to Focus: Deep Attention-based Spatially Recurrent Bilinear Networks for Fine-Grained Visual Recognition0
Zero-Shot Fine-Grained Classification by Deep Feature Learning with Semantics0
Fine-Grained Representation Learning and Recognition by Exploiting Hierarchical Semantic EmbeddingCode0
A Continual Development Methodology for Large-scale Multitask Dynamic ML SystemsCode0
Cascading Hierarchical Networks with Multi-task Balanced Loss for Fine-grained hashingCode0
Enhancing Fine-Grained 3D Object Recognition using Hybrid Multi-Modal Vision Transformer-CNN ModelsCode0
Few-shot Fine-grained Image Classification via Multi-Frequency Neighborhood and Double-cross ModulationCode0
Bilinear CNNs for Fine-grained Visual RecognitionCode0
Bag of Tricks and a Strong Baseline for FGVCCode0
Rethinking Softmax with Cross-Entropy: Neural Network Classifier as Mutual Information EstimatorCode0
Few-Shot Classification of Interactive Activities of Daily Living (InteractADL)Code0
Attribute Mix: Semantic Data Augmentation for Fine Grained RecognitionCode0
Extremely Fine-Grained Visual Classification over Resembling Glyphs in the WildCode0
A Large-Scale Car Dataset for Fine-Grained Categorization and VerificationCode0
Extract More from Less: Efficient Fine-Grained Visual Recognition in Low-Data RegimesCode0
Explored An Effective Methodology for Fine-Grained Snake RecognitionCode0
Evaluation of Output Embeddings for Fine-Grained Image ClassificationCode0
Ensembles of Vision Transformers as a New Paradigm for Automated Classification in EcologyCode0
Selective Sparse Sampling for Fine-Grained Image RecognitionCode0
EnGraf-Net: Multiple Granularity Branch Network with Fine-Coarse Graft Grained for Classification TaskCode0
End-to-end Learning of a Fisher Vector Encoding for Part Features in Fine-grained RecognitionCode0
With a Little Help from My Friends: Nearest-Neighbor Contrastive Learning of Visual RepresentationsCode0
Attribute-Aware Attention Model for Fine-grained Representation LearningCode0
ELoPE: Fine-Grained Visual Classification with Efficient Localization, Pooling and EmbeddingCode0
Attention Convolutional Binary Neural Tree for Fine-Grained Visual CategorizationCode0
Understanding Gaussian Attention Bias of Vision Transformers Using Effective Receptive FieldsCode0
DS_FusionNet: Dynamic Dual-Stream Fusion with Bidirectional Knowledge Distillation for Plant Disease RecognitionCode0
DivShift: Exploring Domain-Specific Distribution Shifts in Large-Scale, Volunteer-Collected Biodiversity DatasetsCode0
Universal Fine-grained Visual Categorization by Concept Guided LearningCode0
Deep CNNs With Spatially Weighted Pooling for Fine-Grained Car RecognitionCode0
Looking for the Devil in the Details: Learning Trilinear Attention Sampling Network for Fine-grained Image RecognitionCode0
Morphing Tokens Draw Strong Masked Image ModelsCode0
LiT Tuned Models for Efficient Species DetectionCode0
Show:102550
← PrevPage 6 of 8Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1TResnet-L + PMDAccuracy97.3Unverified
2CMAL-NetAccuracy97.1Unverified
3I2-HOFIAccuracy96.92Unverified
4TResNet-L + ML-DecoderAccuracy96.41Unverified
5DATAccuracy96.2Unverified
6ALIGNAccuracy96.13Unverified
7SR-GNNAccuracy96.1Unverified
8EffNet-L2 (SAM)Accuracy95.96Unverified
9SaSPA + CALAccuracy95.72Unverified
10CAPAccuracy95.7Unverified
#ModelMetricClaimedVerifiedStatus
1I2-HOFIAccuracy96.42Unverified
2SR-GNNAccuracy95.4Unverified
3Inceptionv4Accuracy95.11Unverified
4CAPAccuracy94.9Unverified
5CMAL-NetAccuracy94.7Unverified
6TBMSL-NetAccuracy94.7Unverified
7CSQA-NetAccuracy94.7Unverified
8PARTAccuracy94.6Unverified
9AENetAccuracy94.5Unverified
10SaSPA + CALAccuracy94.5Unverified
#ModelMetricClaimedVerifiedStatus
1HERBSAccuracy93.1Unverified
2PIMAccuracy92.8Unverified
3MDCMAccuracy92.5Unverified
4SFETransAccuracy91.8Unverified
5CAPAccuracy91.8Unverified
6IELTAccuracy91.8Unverified
7TransFGAccuracy91.7Unverified
8SWAG (ViT H/14)Accuracy91.7Unverified
9ViT-NeTAccuracy91.7Unverified
10FFVTAccuracy91.6Unverified
#ModelMetricClaimedVerifiedStatus
1HERBSAccuracy93Unverified
2MetaFormer (MetaFormer-2,384)Accuracy93Unverified
3PIMAccuracy92.8Unverified
4ViT-NeT (SwinV2-B)Accuracy92.5Unverified
5MPSAAccuracy92.5Unverified
6CSQA-NetAccuracy92.3Unverified
7I2-HOFIAccuracy92.12Unverified
8MDCMAccuracy92Unverified
9CGLAccuracy91.7Unverified
10SR-GNNAccuracy91.2Unverified