Graph Pre-training for AMR Parsing and Generation

2022-03-15ACL 2022Code Available1· sign in to hype

Xuefeng Bai, Yulong Chen, Yue Zhang

Code Available — Be the first to reproduce this paper.

Code

github.com/muyeby/amrbart
OfficialIn paperpytorch★ 105
github.com/goodbai-nlp/amrbart
pytorch★ 105

Abstract

Abstract meaning representation (AMR) highlights the core semantic information of text in a graph structure. Recently, pre-trained language models (PLMs) have advanced tasks of AMR parsing and AMR-to-text generation, respectively. However, PLMs are typically pre-trained on textual data, thus are sub-optimal for modeling structural knowledge. To this end, we investigate graph self-supervised training to improve the structure awareness of PLMs over AMR graphs. In particular, we introduce two graph auto-encoding strategies for graph-to-graph pre-training and four tasks to integrate text and graph information during pre-training. We further design a unified framework to bridge the gap between pre-training and fine-tuning tasks. Experiments on both AMR parsing and AMR-to-text generation show the superiority of our model. To our knowledge, we are the first to consider pre-training on semantic graphs.

Tasks

Abstract Meaning Representation AMR Parsing AMR-to-Text Generation Text Generation

Benchmark Results

Dataset	Model	Metric	Claimed	Verified	Status
Bio	AMRBART large	Smatch	63.2	—	Unverified
LDC2017T10	AMRBART large	Smatch	85.4	—	Unverified
LDC2020T02	AMRBART large	Smatch	84.2	—	Unverified
New3	AMRBART large	Smatch	76.9	—	Unverified
The Little Prince	AMRBART large	Smatch	79.8	—	Unverified

Graph Pre-training for AMR Parsing and Generation

Code

Abstract

Tasks

Benchmark Results

Reproductions