Graph Pre-training for AMR Parsing and Generation
Xuefeng Bai, Yulong Chen, Yue Zhang
Code Available — Be the first to reproduce this paper.
ReproduceCode
- github.com/muyeby/amrbartOfficialIn paperpytorch★ 105
- github.com/goodbai-nlp/amrbartpytorch★ 105
Abstract
Abstract meaning representation (AMR) highlights the core semantic information of text in a graph structure. Recently, pre-trained language models (PLMs) have advanced tasks of AMR parsing and AMR-to-text generation, respectively. However, PLMs are typically pre-trained on textual data, thus are sub-optimal for modeling structural knowledge. To this end, we investigate graph self-supervised training to improve the structure awareness of PLMs over AMR graphs. In particular, we introduce two graph auto-encoding strategies for graph-to-graph pre-training and four tasks to integrate text and graph information during pre-training. We further design a unified framework to bridge the gap between pre-training and fine-tuning tasks. Experiments on both AMR parsing and AMR-to-text generation show the superiority of our model. To our knowledge, we are the first to consider pre-training on semantic graphs.
Tasks
Benchmark Results
| Dataset | Model | Metric | Claimed | Verified | Status |
|---|---|---|---|---|---|
| Bio | AMRBART large | Smatch | 63.2 | — | Unverified |
| LDC2017T10 | AMRBART large | Smatch | 85.4 | — | Unverified |
| LDC2020T02 | AMRBART large | Smatch | 84.2 | — | Unverified |
| New3 | AMRBART large | Smatch | 76.9 | — | Unverified |
| The Little Prince | AMRBART large | Smatch | 79.8 | — | Unverified |