SOTAVerified

CAS: French Corpus with Clinical Cases

2018-10-01WS 2018Unverified0· sign in to hype

Natalia Grabar, Vincent Claveau, Cl{\'e}ment Dalloux

Unverified — Be the first to reproduce this paper.

Reproduce

Abstract

Textual corpora are extremely important for various NLP applications as they provide information necessary for creating, setting and testing these applications and the corresponding tools. They are also crucial for designing reliable methods and reproducible results. Yet, in some areas, such as the medical area, due to confidentiality or to ethical reasons, it is complicated and even impossible to access textual data representative of those produced in these areas. We propose the CAS corpus built with clinical cases, such as they are reported in the published scientific literature in French. We describe this corpus, currently containing over 397,000 word occurrences, and the existing linguistic and semantic annotations.

Tasks

Reproductions