Do sentence embeddings capture discourse properties of sentences from Scientific Abstracts ?

2020-11-01EMNLP (CODI) 2020Unverified0· sign in to hype

Laurine Huber, Chaker Memmadi, Mathilde Dargnat, Yannick Toussaint

Unverified — Be the first to reproduce this paper.

Abstract

We introduce four tasks designed to determine which sentence encoders best capture discourse properties of sentences from scientific abstracts, namely coherence and cohesion between clauses of a sentence, and discourse relations within sentences. We show that even if contextual encoders such as BERT or SciBERT encodes the coherence in discourse units, they do not help to predict three discourse relations commonly used in scientific abstracts. We discuss what these results underline, namely that these discourse relations are based on particular phrasing that allow non-contextual encoders to perform well.

Tasks

Sentence Sentence Embeddings

Do sentence embeddings capture discourse properties of sentences from Scientific Abstracts ?

Abstract

Tasks

Reproductions