Semi-supervised Word Sense Disambiguation with Neural Models
Dayu Yuan, Julian Richardson, Ryan Doherty, Colin Evans, Eric Altendorf
Unverified — Be the first to reproduce this paper.
ReproduceAbstract
Determining the intended sense of words in text - word sense disambiguation (WSD) - is a long standing problem in natural language processing. Recently, researchers have shown promising results using word vectors extracted from a neural network language model as features in WSD algorithms. However, a simple average or concatenation of word vectors for each word in a text loses the sequential and syntactic information of the text. In this paper, we study WSD with a sequence learning neural net, LSTM, to better capture the sequential and syntactic patterns of the text. To alleviate the lack of training data in all-words WSD, we employ the same LSTM in a semi-supervised label propagation classifier. We demonstrate state-of-the-art results, especially on verbs.
Tasks
Benchmark Results
| Dataset | Model | Metric | Claimed | Verified | Status |
|---|---|---|---|---|---|
| SemEval 2007 Task 17 | LSTM (T:OMSTI) | F1 | 60.7 | — | Unverified |
| SemEval 2007 Task 17 | LSTMLP (T:OMSTI, U:1K) | F1 | 63.3 | — | Unverified |
| SemEval 2007 Task 17 | LSTMLP (T:SemCor, U:1K) | F1 | 63.5 | — | Unverified |
| SemEval 2007 Task 17 | LSTMLP (T:SemCor, U:OMSTI) | F1 | 63.7 | — | Unverified |
| SemEval 2007 Task 17 | LSTM (T:SemCor) | F1 | 64.2 | — | Unverified |
| SemEval 2007 Task 7 | LSTMLP (T:SemCor, U:1K) | F1 | 83.6 | — | Unverified |
| SemEval 2007 Task 7 | LSTM (T:OMSTI) | F1 | 81.1 | — | Unverified |
| SemEval 2007 Task 7 | LSTMLP (T:SemCor, U:OMSTI) | F1 | 84.3 | — | Unverified |
| SemEval 2007 Task 7 | LSTM (T:SemCor) | F1 | 82.8 | — | Unverified |
| SemEval 2007 Task 7 | LSTMLP (T:OMSTI, U:1K) | F1 | 83.3 | — | Unverified |
| SemEval 2013 Task 12 | LSTMLP (T:SemCor, U:OMSTI) | F1 | 67.9 | — | Unverified |
| SemEval 2013 Task 12 | LSTMLP (T:SemCor, U:1K) | F1 | 69.5 | — | Unverified |
| SemEval 2013 Task 12 | LSTMLP (T:OMSTI, U:1K) | F1 | 68.1 | — | Unverified |
| SemEval 2013 Task 12 | LSTM (T:OMSTI) | F1 | 67.3 | — | Unverified |
| SemEval 2013 Task 12 | LSTM (T:SemCor) | F1 | 67 | — | Unverified |
| Senseval-2 | LSTMLP (T:OMSTI, U:1K) | F1 | 74.4 | — | Unverified |
| Senseval-2 | LSTM (T:OMSTI) | F1 | 72.4 | — | Unverified |
| Senseval-2 | LSTM (T:SemCor) | F1 | 73.6 | — | Unverified |
| Senseval-2 | LSTMLP (T:SemCor, U:1K) | F1 | 73.8 | — | Unverified |
| Senseval-2 | LSTMLP (T:SemCor, U:OMSTI) | F1 | 73.9 | — | Unverified |
| SensEval 3 Task 1 | LSTMLP (T:SemCor, U:1K) | F1 | 71.8 | — | Unverified |
| SensEval 3 Task 1 | LSTM (T:OMSTI) | F1 | 64.3 | — | Unverified |
| SensEval 3 Task 1 | LSTM (T:SemCor) | F1 | 69.2 | — | Unverified |
| SensEval 3 Task 1 | LSTMLP (T:OMSTI, U:1K) | F1 | 71 | — | Unverified |
| SensEval 3 Task 1 | LSTMLP (T:SemCor, U:OMSTI) | F1 | 71.1 | — | Unverified |