SOTAVerified

Towards a General Abstract Meaning Representation Corpus for Brazilian Portuguese

2019-08-01WS 2019Unverified0· sign in to hype

Marco Antonio Sobrevilla Cabezudo, Thiago Pardo

Unverified — Be the first to reproduce this paper.

Reproduce

Abstract

Abstract Meaning Representation (AMR) is a recent and prominent semantic representation with good acceptance and several applications in the Natural Language Processing area. For English, there is a large annotated corpus (with approximately 39K sentences) that supports the research with the representation. However, to the best of our knowledge, there is only one restricted corpus for Portuguese, which contains 1,527 sentences. In this context, this paper presents an effort to build a general purpose AMR-annotated corpus for Brazilian Portuguese by translating and adapting AMR English guidelines. Our results show that such approach is feasible, but there are some challenging phenomena to solve. More than this, efforts are necessary to increase the coverage of the corresponding lexical resource that supports the annotation.

Tasks

Reproductions