SOTAVerified

Parallel Data, Tools and Interfaces in OPUS

2012-05-01LREC 2012Unverified0· sign in to hype

J{\"o}rg Tiedemann

Unverified — Be the first to reproduce this paper.

Reproduce

Abstract

This paper presents the current status of OPUS, a growing language resource of parallel corpora and related tools. The focus in OPUS is to provide freely available data sets in various formats together with basic annotation to be useful for applications in computational linguistics, translation studies and cross-linguistic corpus studies. In this paper, we report about new data sets and their features, additional annotation tools and models provided from the website and essential interfaces and on-line services included in the project.

Tasks

Reproductions