SOTAVerified

Temporally-Informed Analysis of Named Entity Recognition

2020-07-01ACL 2020Unverified0· sign in to hype

Shruti Rijhwani, Daniel Preotiuc-Pietro

Unverified — Be the first to reproduce this paper.

Reproduce

Abstract

Natural language processing models often have to make predictions on text data that evolves over time as a result of changes in language use or the information described in the text. However, evaluation results on existing data sets are seldom reported by taking the timestamp of the document into account. We analyze and propose methods that make better use of temporally-diverse training data, with a focus on the task of named entity recognition. To support these experiments, we introduce a novel data set of English tweets annotated with named entities. We empirically demonstrate the effect of temporal drift on performance, and how the temporal information of documents can be used to obtain better models compared to those that disregard temporal information. Our analysis gives insights into why this information is useful, in the hope of informing potential avenues of improvement for named entity recognition as well as other NLP tasks under similar experimental setups.

Tasks

Reproductions