Robust Word Vectors: Context-Informed Embeddings for Noisy Texts

2018-11-01WS 2018Unverified0· sign in to hype

Valentin Malykh, Varvara Logacheva, Taras Khakhulin

Unverified — Be the first to reproduce this paper.

Abstract

We suggest a new language-independent architecture of robust word vectors (RoVe). It is designed to alleviate the issue of typos, which are common in almost any user-generated content, and hinder automatic text processing. Our model is morphologically motivated, which allows it to deal with unseen word forms in morphologically rich languages. We present the results on a number of Natural Language Processing (NLP) tasks and languages for the variety of related architectures and show that proposed architecture is typo-proof.

Tasks

Morphological Analysis Word Embeddings

Robust Word Vectors: Context-Informed Embeddings for Noisy Texts

Abstract

Tasks

Reproductions