Hierarchical Character-Word Models for Language Identification
2016-08-10WS 2016Code Available0· sign in to hype
Aaron Jaech, George Mulcaire, Shobhit Hathi, Mari Ostendorf, Noah A. Smith
Code Available — Be the first to reproduce this paper.
ReproduceCode
Abstract
Social media messages' brevity and unconventional spelling pose a challenge to language identification. We introduce a hierarchical model that learns character and contextualized word-level representations for language identification. Our method performs well against strong base- lines, and can also reveal code-switching.