Neural Models for Detecting Binary Semantic Textual Similarity for Algerian and MSA
2019-08-01WS 2019Unverified0· sign in to hype
Wafia Adouane, Jean-Philippe Bernardy, Simon Dobnik
Unverified — Be the first to reproduce this paper.
ReproduceAbstract
We explore the extent to which neural networks can learn to identify semantically equivalent sentences from a small variable dataset using an end-to-end training. We collect a new noisy non-standardised user-generated Algerian (ALG) dataset and also translate it to Modern Standard Arabic (MSA) which serves as its regularised counterpart. We compare the performance of various models on both datasets and report the best performing configurations. The results show that relatively simple models composed of 2 LSTM layers outperform by far other more sophisticated attention-based architectures, for both ALG and MSA datasets.