SOTAVerified

Exploiting Word Internal Structures for Generic Chinese Sentence Representation

2017-09-01EMNLP 2017Unverified0· sign in to hype

Shaonan Wang, Jiajun Zhang, Cheng-qing Zong

Unverified — Be the first to reproduce this paper.

Reproduce

Abstract

We introduce a novel mixed characterword architecture to improve Chinese sentence representations, by utilizing rich semantic information of word internal structures. Our architecture uses two key strategies. The first is a mask gate on characters, learning the relation among characters in a word. The second is a maxpooling operation on words, adaptively finding the optimal mixture of the atomic and compositional word representations. Finally, the proposed architecture is applied to various sentence composition models, which achieves substantial performance gains over baseline models on sentence similarity task.

Tasks

Reproductions