SOTAVerified

Hierarchical Character-Word Models for Language Identification

2016-08-10WS 2016Code Available0· sign in to hype

Aaron Jaech, George Mulcaire, Shobhit Hathi, Mari Ostendorf, Noah A. Smith

Code Available — Be the first to reproduce this paper.

Reproduce

Code

Abstract

Social media messages' brevity and unconventional spelling pose a challenge to language identification. We introduce a hierarchical model that learns character and contextualized word-level representations for language identification. Our method performs well against strong base- lines, and can also reveal code-switching.

Tasks

Reproductions