Pretrained Language Models for Sequential Sentence Classification

2019-09-09IJCNLP 2019Code Available0· sign in to hype

Arman Cohan, Iz Beltagy, Daniel King, Bhavana Dalvi, Daniel S. Weld

Code Available — Be the first to reproduce this paper.

Code

github.com/allenai/sequential_sentence_classification
OfficialIn paperpytorch★ 0

Abstract

As a step toward better document-level understanding, we explore classification of a sequence of sentences into their corresponding categories, a task that requires understanding sentences in context of the document. Recent successful models for this task have used hierarchical models to contextualize sentence representations, and Conditional Random Fields (CRFs) to incorporate dependencies between subsequent labels. In this work, we show that pretrained language models, BERT (Devlin et al., 2018) in particular, can be used for this task to capture contextual dependencies without the need for hierarchical encoding nor a CRF. Specifically, we construct a joint sentence representation that allows BERT Transformer layers to directly utilize contextual information from all words in all sentences. Our approach achieves state-of-the-art results on four datasets, including a new dataset of structured scientific abstracts.

Tasks

Classification General Classification Sentence Sentence Classification

Pretrained Language Models for Sequential Sentence Classification

Code

Abstract

Tasks

Reproductions