SOTAVerified

Fine-Grained Image Classification

Fine-Grained Image Classification is a task in computer vision where the goal is to classify images into subcategories within a larger category. For example, classifying different species of birds or different types of flowers. This task is considered to be fine-grained because it requires the model to distinguish between subtle differences in visual appearance and patterns, making it more challenging than regular image classification tasks.

( Image credit: Looking for the Devil in the Details )

Papers

Showing 101150 of 353 papers

TitleStatusHype
MergedNET: A simple approach for one-shot learning in siamese networks based on similarity layersCode0
Learning Multi-Subset of Classes for Fine-Grained Food RecognitionCode0
Enhancing Fine-Grained 3D Object Recognition using Hybrid Multi-Modal Vision Transformer-CNN ModelsCode0
Fine-grained Classification of Solder Joints with α-skew Jensen-Shannon Divergence0
A Continual Development Methodology for Large-scale Multitask Dynamic ML SystemsCode0
Bag of Tricks and a Strong Baseline for FGVCCode0
SR-GNN: Spatial Relation-aware Graph Neural Network for Fine-Grained Image CategorizationCode1
SIM-Trans: Structure Information Modeling Transformer for Fine-grained Visual CategorizationCode1
Conviformers: Convolutionally guided Vision TransformerCode0
Preserving Fine-Grain Feature Information in Classification via Entropic RegularizationCode0
Convolutional Fine-Grained Classification with Self-Supervised Target Relation RegularizationCode1
Visual correspondence-based explanations improve AI robustness and human-AI team accuracyCode1
Explored An Effective Methodology for Fine-Grained Snake RecognitionCode0
Few-shot Fine-grained Image Classification via Multi-Frequency Neighborhood and Double-cross ModulationCode0
ViT-NeT: Interpretable Vision Transformers with Neural Tree DecoderCode1
Contrastive Deep SupervisionCode1
Adaptive Fine-Grained Predicates Learning for Scene Graph Generation0
Learning Cross-Image Object Semantic Relation in Transformer for Few-Shot Fine-Grained Image ClassificationCode1
0/1 Deep Neural Networks via Block Coordinate Descent0
Multi-View Active Fine-Grained RecognitionCode0
On the Eigenvalues of Global Covariance Pooling for Fine-grained Visual RecognitionCode0
An Evolutionary Approach to Dynamic Introduction of Tasks in Large-scale Multitask Learning SystemsCode0
Fine-Grained Visual Classification using Self Assessment ClassifierCode1
Large Neural Networks Learning from Scratch with Very Few Data and without Explicit Regularization0
Dual Cross-Attention Learning for Fine-Grained Visual Categorization and Object Re-Identification0
Reinforcing Generated Images via Meta-learning for One-Shot Fine-Grained Visual Recognition0
Fine-Grained Predicates Learning for Scene Graph GenerationCode1
Knowledge Mining with Scene Text for Fine-Grained RecognitionCode1
ViT-FOD: A Vision Transformer based Fine-grained Object Discriminator0
Three things everyone should know about Vision TransformersCode0
Bamboo: Building Mega-Scale Vision Dataset Continually with Human-Machine SynergyCode1
Automatic Fine-grained Glomerular Lesion Recognition in Kidney Pathology0
Dynamic MLP for Fine-Grained Image Classification by Leveraging Geographical and Temporal InformationCode1
MetaFormer: A Unified Meta Framework for Fine-Grained RecognitionCode2
Ensembles of Vision Transformers as a New Paradigm for Automated Classification in EcologyCode0
Bridge the Gap between Supervised and Unsupervised Learning for Fine-Grained Classification0
Vision Models Are More Robust And Fair When Pretrained On Uncurated Images Without SupervisionCode0
Image Difference Captioning with Pre-training and Contrastive LearningCode1
A Novel Plug-in Module for Fine-Grained Visual ClassificationCode2
Dataset Condensation with Contrastive SignalsCode1
Revisiting Weakly Supervised Pre-Training of Visual Perception ModelsCode1
Label Relation Graphs Enhanced Hierarchical Residual Network for Hierarchical Multi-Granularity ClassificationCode1
A Simple Episodic Linear Probe Improves Visual Recognition in the WildCode2
Cross-Part Learning for Fine-Grained Image ClassificationCode0
Progressive Multi-stage Interactive Training in Mobile Network for Fine-grained Recognition0
Making a Bird AI Expert Work for You and MeCode1
Clue Me In: Semi-Supervised FGVC with Out-of-Distribution DataCode1
ML-Decoder: Scalable and Versatile Classification HeadCode1
Improved Robustness of Vision Transformer via PreLayerNorm in Patch Embedding0
High-Order-Interaction for weakly supervised Fine-Grained Visual CategorizationCode0
Show:102550
← PrevPage 3 of 8Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1TResnet-L + PMDAccuracy97.3Unverified
2CMAL-NetAccuracy97.1Unverified
3I2-HOFIAccuracy96.92Unverified
4TResNet-L + ML-DecoderAccuracy96.41Unverified
5DATAccuracy96.2Unverified
6ALIGNAccuracy96.13Unverified
7SR-GNNAccuracy96.1Unverified
8EffNet-L2 (SAM)Accuracy95.96Unverified
9SaSPA + CALAccuracy95.72Unverified
10CAPAccuracy95.7Unverified
#ModelMetricClaimedVerifiedStatus
1I2-HOFIAccuracy96.42Unverified
2SR-GNNAccuracy95.4Unverified
3Inceptionv4Accuracy95.11Unverified
4CAPAccuracy94.9Unverified
5CSQA-NetAccuracy94.7Unverified
6CMAL-NetAccuracy94.7Unverified
7TBMSL-NetAccuracy94.7Unverified
8PARTAccuracy94.6Unverified
9SaSPA + CALAccuracy94.5Unverified
10AENetAccuracy94.5Unverified
#ModelMetricClaimedVerifiedStatus
1HERBSAccuracy93.1Unverified
2PIMAccuracy92.8Unverified
3MDCMAccuracy92.5Unverified
4IELTAccuracy91.8Unverified
5CAPAccuracy91.8Unverified
6SFETransAccuracy91.8Unverified
7SWAG (ViT H/14)Accuracy91.7Unverified
8ViT-NeTAccuracy91.7Unverified
9TransFGAccuracy91.7Unverified
10I2-HOFIAccuracy91.6Unverified
#ModelMetricClaimedVerifiedStatus
1HERBSAccuracy93Unverified
2MetaFormer (MetaFormer-2,384)Accuracy93Unverified
3PIMAccuracy92.8Unverified
4MPSAAccuracy92.5Unverified
5ViT-NeT (SwinV2-B)Accuracy92.5Unverified
6CSQA-NetAccuracy92.3Unverified
7I2-HOFIAccuracy92.12Unverified
8MDCMAccuracy92Unverified
9CGLAccuracy91.7Unverified
10SR-GNNAccuracy91.2Unverified