SOTAVerified

An English-Swahili parallel corpus and its use for neural machine translation in the news domain

2020-11-01EAMT 2020Unverified0· sign in to hype

Felipe Sánchez-Martínez, Víctor M. Sánchez-Cartagena, Juan Antonio Pérez-Ortiz, Mikel L. Forcada, Miquel Esplà-Gomis, Andrew Secker, Susie Coleman, Julie Wall

Unverified — Be the first to reproduce this paper.

Reproduce

Abstract

This paper describes our approach to create a neural machine translation system to translate between English and Swahili (both directions) in the news domain, as well as the process we followed to crawl the necessary parallel corpora from the Internet. We report the results of a pilot human evaluation performed by the news media organisations participating in the H2020 EU-funded project GoURMET.

Tasks

Reproductions