SOTAVerified

E:Calm Resource: a Resource for Studying Texts Produced by French Pupils and Students

2020-05-01LREC 2020Unverified0· sign in to hype

Lydia-Mai Ho-Dac, Serge Fleury, Claude Ponton

Unverified — Be the first to reproduce this paper.

Reproduce

Abstract

The E:Calm resource is constructed from French student texts produced in a variety of usual contexts of teaching. The distinction of the E:Calm resource is to provide an ecological data set that gives a broad overview of texts written at elementary school, high school and university. This paper describes the whole data processing: encoding of the main graphical aspects of the handwritten primary sources according to the TEI-P5 norm; spelling standardizing; POS tagging and syntactic parsing evaluation.

Tasks

Reproductions