SOTAVerified

Inforex --- a Collaborative Systemfor Text Corpora Annotation and Analysis Goes Open

2019-09-01RANLP 2019Code Available0· sign in to hype

Micha{\l} Marci{\'n}czuk, Marcin Oleksy

Code Available — Be the first to reproduce this paper.

Reproduce

Code

Abstract

In the paper we present the latest changes introduce to Inforex --- a web-based system for qualitative and collaborative text corpora annotation and analysis. One of the most important news is the release of source codes. Now the system is available on the GitHub repository (https://github.com/CLARIN-PL/Inforex) as an open source project. The system can be easily setup and run in a Docker container what simplifies the installation process. The major improvements include: semi-automatic text annotation, multilingual text preprocessing using CLARIN-PL web services, morphological tagging of XML documents, improved editor for annotation attribute, batch annotation attribute editor, morphological disambiguation, extended word sense annotation. This paper contains a brief description of the mentioned improvements. We also present two use cases in which various Inforex features were used and tested in real-life projects.

Tasks

Reproductions