Automatic Data Acquisition for Event Coreference Resolution
Prafulla Kumar Choubey, Ruihong Huang
Code Available — Be the first to reproduce this paper.
ReproduceCode
- github.com/prafulla77/event-coref-eacl-2021OfficialIn papernone★ 0
Abstract
We propose to leverage lexical paraphrases and high precision rules informed by news discourse structure to automatically collect coreferential and non-coreferential event pairs from unlabeled English news articles. We perform both manual validation and empirical evaluation on multiple evaluation datasets with different event domains and text genres to assess the quality of our acquired event pairs. We found that a model trained on our acquired event pairs performs comparably as the supervised model when applied to new data out of the training data domains. Further, augmenting human-annotated data with the acquired event pairs provides empirical performance gains on both in-domain and out-of-domain evaluation datasets.