Finding Function in Form: Compositional Character Models for Open Vocabulary Word Representation

2015-08-09EMNLP 2015Code Available0· sign in to hype

Wang Ling, Tiago Luís, Luís Marujo, Ramón Fernandez Astudillo, Silvio Amir, Chris Dyer, Alan W. black, Isabel Trancoso

Code Available — Be the first to reproduce this paper.

Code

github.com/wlin12/JNN
OfficialIn papernone★ 0

Abstract

We introduce a model for constructing vector representations of words by composing characters using bidirectional LSTMs. Relative to traditional word representation models that have independent vectors for each word type, our model requires only a single vector per character type and a fixed set of parameters for the compositional model. Despite the compactness of this model and, more importantly, the arbitrary nature of the form-function relationship in language, our "composed" word representations yield state-of-the-art results in language modeling and part-of-speech tagging. Benefits over traditional baselines are particularly pronounced in morphologically rich languages (e.g., Turkish).

Tasks

Form Language Modeling Language Modelling Part-Of-Speech Tagging

Benchmark Results

Dataset	Model	Metric	Claimed	Verified	Status
Penn Treebank	Char Bi-LSTM	Accuracy	97.78	—	Unverified

Finding Function in Form: Compositional Character Models for Open Vocabulary Word Representation

Code

Abstract

Tasks

Benchmark Results

Reproductions