Parallel Sentence Extraction from Comparable Corpora with Neural Network Features
2016-05-01LREC 2016Unverified0· sign in to hype
Chenhui Chu, Raj Dabre, Sadao Kurohashi
Unverified — Be the first to reproduce this paper.
ReproduceAbstract
Parallel corpora are crucial for machine translation (MT), however they are quite scarce for most language pairs and domains. As comparable corpora are far more available, many studies have been conducted to extract parallel sentences from them for MT. In this paper, we exploit the neural network features acquired from neural MT for parallel sentence extraction. We observe significant improvements for both accuracy in sentence extraction and MT performance.