Document Image Classification with Intra-Domain Transfer Learning and Stacked Generalization of Deep Convolutional Neural Networks

2018-01-29Code Available0· sign in to hype

Arindam Das, Saikat Roy, Ujjwal Bhattacharya, Swapan Kumar Parui

Code Available — Be the first to reproduce this paper.

Code

github.com/hiarindam/document-image-classification-TL-SG
tf★ 0
github.com/iamarjunchandra/LayoutLM-Form-Understanding---Sequence-Labeling
pytorch★ 0
github.com/microsoft/unilm/tree/master/layoutlm
pytorch★ 0
github.com/BordiaS/layoutlm
pytorch★ 0

Abstract

In this work, a region-based Deep Convolutional Neural Network framework is proposed for document structure learning. The contribution of this work involves efficient training of region based classifiers and effective ensembling for document image classification. A primary level of `inter-domain' transfer learning is used by exporting weights from a pre-trained VGG16 architecture on the ImageNet dataset to train a document classifier on whole document images. Exploiting the nature of region based influence modelling, a secondary level of `intra-domain' transfer learning is used for rapid training of deep learning models for image segments. Finally, stacked generalization based ensembling is utilized for combining the predictions of the base deep neural network models. The proposed method achieves state-of-the-art accuracy of 92.2% on the popular RVL-CDIP document image dataset, exceeding benchmarks set by existing algorithms.

Tasks

document-image-classification Document Image Classification General Classification image-classification Image Classification Transfer Learning

Benchmark Results

Dataset	Model	Metric	Claimed	Verified	Status
RVL-CDIP	Transfer Learning from VGG16 trained on Imagenet	Accuracy	92.21	—	Unverified

Document Image Classification with Intra-Domain Transfer Learning and Stacked Generalization of Deep Convolutional Neural Networks

Code

Abstract

Tasks

Benchmark Results

Reproductions