Learning a Discriminative Filter Bank within a CNN for Fine-grained Recognition

2016-11-29CVPR 2018Code Available0· sign in to hype

Yaming Wang, Vlad I. Morariu, Larry S. Davis

Code Available — Be the first to reproduce this paper.

Code

github.com/jobinkv/Ongoing
none★ 0

Abstract

Compared to earlier multistage frameworks using CNN features, recent end-to-end deep approaches for fine-grained recognition essentially enhance the mid-level learning capability of CNNs. Previous approaches achieve this by introducing an auxiliary network to infuse localization information into the main classification network, or a sophisticated feature encoding method to capture higher order feature statistics. We show that mid-level representation learning can be enhanced within the CNN framework, by learning a bank of convolutional filters that capture class-specific discriminative patches without extra part or bounding box annotations. Such a filter bank is well structured, properly initialized and discriminatively learned through a novel asymmetric multi-stream architecture with convolutional filter supervision and a non-random layer initialization. Experimental results show that our approach achieves state-of-the-art on three publicly available fine-grained recognition datasets (CUB-200-2011, Stanford Cars and FGVC-Aircraft). Ablation studies and visualizations are provided to understand our approach.

Tasks

Representation Learning

Benchmark Results

Dataset	Model	Metric	Claimed	Verified	Status
CUB-200-2011	DFL-CNN	Accuracy	87.4	—	Unverified
FGVC-Aircraft	DFB-CNN	Accuracy	92	—	Unverified
Stanford Cars	DFL-CNN	Accuracy	93.8	—	Unverified

Learning a Discriminative Filter Bank within a CNN for Fine-grained Recognition

Code

Abstract

Tasks

Benchmark Results

Reproductions