Induction Networks for Few-Shot Text Classification
Ruiying Geng, Binhua Li, Yongbin Li, Xiaodan Zhu, Ping Jian, Jian Sun
Code Available — Be the first to reproduce this paper.
ReproduceCode
- github.com/mhw32/prototransformer-publicpytorch★ 16
- github.com/laohur/RelationNetpytorch★ 0
- github.com/zhongyuchen/few-shot-text-classificationpytorch★ 0
- github.com/laohur/LearnToCompareTextpytorch★ 0
- github.com/hongshengxin/Induction_networkpytorch★ 0
Abstract
Text classification tends to struggle when data is deficient or when it needs to adapt to unseen classes. In such challenging scenarios, recent studies have used meta-learning to simulate the few-shot task, in which new queries are compared to a small support set at the sample-wise level. However, this sample-wise comparison may be severely disturbed by the various expressions in the same class. Therefore, we should be able to learn a general representation of each class in the support set and then compare it to new queries. In this paper, we propose a novel Induction Network to learn such a generalized class-wise representation, by innovatively leveraging the dynamic routing algorithm in meta-learning. In this way, we find the model is able to induce and generalize better. We evaluate the proposed model on a well-studied sentiment classification dataset (English) and a real-world dialogue intent classification dataset (Chinese). Experiment results show that on both datasets, the proposed model significantly outperforms the existing state-of-the-art approaches, proving the effectiveness of class-wise generalization in few-shot text classification.
Tasks
Benchmark Results
| Dataset | Model | Metric | Claimed | Verified | Status |
|---|---|---|---|---|---|
| ODIC 10-way (10-shot) | Induction Networks | Accuracy | 81.64 | — | Unverified |
| ODIC 10-way (5-shot) | Induction Networks | Accuracy | 78.27 | — | Unverified |
| ODIC 5-way (10-shot) | Induction Networks | Accuracy | 88.49 | — | Unverified |
| ODIC 5-way (5-shot) | Induction Networks | Accuracy | 87.16 | — | Unverified |