SOTAVerified

Lexical Semantic Recognition

2020-04-30ACL (MWE) 2021Code Available1· sign in to hype

Nelson F. Liu, Daniel Hershcovich, Michael Kranzlein, Nathan Schneider

Code Available — Be the first to reproduce this paper.

Reproduce

Code

Abstract

In lexical semantics, full-sentence segmentation and segment labeling of various phenomena are generally treated separately, despite their interdependence. We hypothesize that a unified lexical semantic recognition task is an effective way to encapsulate previously disparate styles of annotation, including multiword expression identification / classification and supersense tagging. Using the STREUSLE corpus, we train a neural CRF sequence tagger and evaluate its performance along various axes of annotation. As the label set generalizes that of previous tasks (PARSEME, DiMSUM), we additionally evaluate how well the model generalizes to those test sets, finding that it approaches or surpasses existing models despite training only on STREUSLE. Our work also establishes baseline models and evaluation metrics for integrated and accurate modeling of lexical semantics, facilitating future work in this area.

Tasks

Benchmark Results

DatasetModelMetricClaimedVerifiedStatus
STREUSLEBERT (pred POS/lemmas)Tags (Full) Acc82.5Unverified
STREUSLEBERT (none)Tags (Full) Acc82Unverified
STREUSLEBERT (gold POS/lemmas)Tags (Full) Acc81Unverified
STREUSLEGloVe (gold POS/lemmas)Tags (Full) Acc79.3Unverified
STREUSLEGloVe (none)Tags (Full) Acc77.5Unverified
STREUSLEGloVe (pred POS/lemmas)Tags (Full) Acc77.1Unverified

Reproductions