E:Calm Resource: a Resource for Studying Texts Produced by French Pupils and Students
2020-05-01LREC 2020Unverified0· sign in to hype
Lydia-Mai Ho-Dac, Serge Fleury, Claude Ponton
Unverified — Be the first to reproduce this paper.
ReproduceAbstract
The E:Calm resource is constructed from French student texts produced in a variety of usual contexts of teaching. The distinction of the E:Calm resource is to provide an ecological data set that gives a broad overview of texts written at elementary school, high school and university. This paper describes the whole data processing: encoding of the main graphical aspects of the handwritten primary sources according to the TEI-P5 norm; spelling standardizing; POS tagging and syntactic parsing evaluation.