Induction Networks for Few-Shot Text Classification

2019-02-27IJCNLP 2019Code Available1· sign in to hype

Ruiying Geng, Binhua Li, Yongbin Li, Xiaodan Zhu, Ping Jian, Jian Sun

Code Available — Be the first to reproduce this paper.

Code

github.com/mhw32/prototransformer-public
pytorch★ 16
github.com/laohur/RelationNet
pytorch★ 0
github.com/zhongyuchen/few-shot-text-classification
pytorch★ 0
github.com/laohur/LearnToCompareText
pytorch★ 0
github.com/hongshengxin/Induction_network
pytorch★ 0

Abstract

Text classification tends to struggle when data is deficient or when it needs to adapt to unseen classes. In such challenging scenarios, recent studies have used meta-learning to simulate the few-shot task, in which new queries are compared to a small support set at the sample-wise level. However, this sample-wise comparison may be severely disturbed by the various expressions in the same class. Therefore, we should be able to learn a general representation of each class in the support set and then compare it to new queries. In this paper, we propose a novel Induction Network to learn such a generalized class-wise representation, by innovatively leveraging the dynamic routing algorithm in meta-learning. In this way, we find the model is able to induce and generalize better. We evaluate the proposed model on a well-studied sentiment classification dataset (English) and a real-world dialogue intent classification dataset (Chinese). Experiment results show that on both datasets, the proposed model significantly outperforms the existing state-of-the-art approaches, proving the effectiveness of class-wise generalization in few-shot text classification.

Tasks

Few-Shot Text Classification General Classification intent-classification Intent Classification Meta-Learning Sentiment Analysis Sentiment Classification Text Classification

Benchmark Results

Dataset	Model	Metric	Claimed	Verified	Status
ODIC 10-way (10-shot)	Induction Networks	Accuracy	81.64	—	Unverified
ODIC 10-way (5-shot)	Induction Networks	Accuracy	78.27	—	Unverified
ODIC 5-way (10-shot)	Induction Networks	Accuracy	88.49	—	Unverified
ODIC 5-way (5-shot)	Induction Networks	Accuracy	87.16	—	Unverified

Induction Networks for Few-Shot Text Classification

Code

Abstract

Tasks

Benchmark Results

Reproductions