The Content Types Dataset: a New Resource to Explore Semantic and Functional Characteristics of Texts
2017-04-01EACL 2017Unverified0· sign in to hype
Rachele Sprugnoli, Tommaso Caselli, Sara Tonelli, Giovanni Moretti
Unverified — Be the first to reproduce this paper.
ReproduceAbstract
This paper presents a new resource, called Content Types Dataset, to promote the analysis of texts as a composition of units with specific semantic and functional roles. By developing this dataset, we also introduce a new NLP task for the automatic classification of Content Types. The annotation scheme and the dataset are described together with two sets of classification experiments.