SOTAVerified

A Simple and Strong Baseline for End-to-End Neural RST-style Discourse Parsing

2022-10-15Code Available1· sign in to hype

Naoki Kobayashi, Tsutomu Hirao, Hidetaka Kamigaito, Manabu Okumura, Masaaki Nagata

Code Available — Be the first to reproduce this paper.

Reproduce

Code

Abstract

To promote and further develop RST-style discourse parsing models, we need a strong baseline that can be regarded as a reference for reporting reliable experimental results. This paper explores a strong baseline by integrating existing simple parsing strategies, top-down and bottom-up, with various transformer-based pre-trained language models. The experimental results obtained from two benchmark datasets demonstrate that the parsing performance strongly relies on the pretrained language models rather than the parsing strategies. In particular, the bottom-up parser achieves large performance gains compared to the current best parser when employing DeBERTa. We further reveal that language models with a span-masking scheme especially boost the parsing performance through our analysis within intra- and multi-sentential parsing, and nuclearity prediction.

Tasks

Benchmark Results

DatasetModelMetricClaimedVerifiedStatus
Instructional-DT (Instr-DT)Top-down (BERT)Standard Parseval (Nuclearity)44.6Unverified
Instructional-DT (Instr-DT)Bottom-up (DeBERTa)Standard Parseval (Nuclearity)60Unverified
Instructional-DT (Instr-DT)Top-down (DeBERTa)Standard Parseval (Nuclearity)57.9Unverified
Instructional-DT (Instr-DT)Bottom-up (XLNet)Standard Parseval (Nuclearity)56.4Unverified
Instructional-DT (Instr-DT)Top-down (RoBERTa)Standard Parseval (Nuclearity)56.1Unverified
Instructional-DT (Instr-DT)Bottom-up (RoBERTa)Standard Parseval (Nuclearity)55.5Unverified
Instructional-DT (Instr-DT)Top-down (XLNet)Standard Parseval (Nuclearity)55.2Unverified
Instructional-DT (Instr-DT)Top-down (SpanBERT)Standard Parseval (Nuclearity)54.5Unverified
Instructional-DT (Instr-DT)Bottom-up (SpanBERT)Standard Parseval (Nuclearity)53.8Unverified
Instructional-DT (Instr-DT)Bottom-up (BERT)Standard Parseval (Nuclearity)46.3Unverified
RST-DTTop-down (DeBERTa)Standard Parseval (Full)54.4Unverified
RST-DTBottom-up (XLNet)Standard Parseval (Full)54.2Unverified
RST-DTTop-down (RoBERTa)Standard Parseval (Full)53.8Unverified
RST-DTBottom-up (RoBERTa)Standard Parseval (Full)53.7Unverified
RST-DTBottom-up (SpanBERT)Standard Parseval (Full)52.7Unverified
RST-DTTop-down (SpanBERT)Standard Parseval (Full)52.2Unverified
RST-DTTop-down (BERT)Standard Parseval (Full)46.6Unverified
RST-DTBottom-up (BERT)Standard Parseval (Full)46Unverified
RST-DTTop-down (XLNet)Standard Parseval (Full)54.8Unverified

Reproductions