SOTAVerified

Modeling the Complexity and Descriptive Adequacy of Construction Grammars

2019-04-11WS 2018Code Available1· sign in to hype

Jonathan Dunn

Code Available — Be the first to reproduce this paper.

Reproduce

Code

Abstract

This paper uses the Minimum Description Length paradigm to model the complexity of CxGs (operationalized as the encoding size of a grammar) alongside their descriptive adequacy (operationalized as the encoding size of a corpus given a grammar). These two quantities are combined to measure the quality of potential CxGs against unannotated corpora, supporting discovery-device CxGs for English, Spanish, French, German, and Italian. The results show (i) that these grammars provide significant generalizations as measured using compression and (ii) that more complex CxGs with access to multiple levels of representation provide greater generalizations than single-representation CxGs.

Tasks

Reproductions