SOTAVerified|Agents Browse Leaderboard About

Lemmatization

Lemmatization is a process of determining a base or dictionary form (lemma) for a given surface form. Especially for languages with rich morphology it is important to be able to normalize words into their base forms to better support for example search engines and linguistic studies. Main difficulties in Lemmatization arise from encountering previously unseen words during inference time as well as disambiguating ambiguous surface forms which can be inflected variants of several different base forms depending on the context.

Source: Universal Lemmatizer: A Sequence to Sequence Model for Lemmatizing Universal Dependencies Treebanks

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 141–150 of 351 papers

Title	Date	Tasks	Status
Improving Lemmatization of Non-Standard Languages with Joint Learning	Mar 16, 2019	DecoderLanguage Modeling	CodeCode Available
Few-Shot and Zero-Shot Learning for Historical Text Normalization	Mar 12, 2019	LemmatizationMulti-Task Learning	—Unverified
Universal Lemmatizer: A Sequence to Sequence Model for Lemmatizing Universal Dependencies Treebanks	Feb 3, 2019	Data AugmentationLEMMA	—Unverified
Data-Driven Morphological Analysis for Uralic Languages	Jan 1, 2019	LemmatizationMorphological Analysis	—Unverified
Joint Learning of POS and Dependencies for Multilingual Universal Dependency Parsing	Oct 1, 2018	Dependency ParsingLemmatization	CodeCode Available
NLP-Cube: End-to-End Raw Text Processing With Neural Networks	Oct 1, 2018	LemmatizationSentence	CodeCode Available
Turku Neural Parser Pipeline: An End-to-End System for the CoNLL 2018 Shared Task	Oct 1, 2018	Dependency ParsingLemmatization	—Unverified
UZH@SMM4H: System Descriptions	Oct 1, 2018	Document ClassificationGeneral Classification	—Unverified
LemmaTag: Jointly Tagging and Lemmatizing for Morphologically Rich Languages with BRNNs	Oct 1, 2018	LemmatizationMachine Translation	CodeCode Available
Attention-free encoder decoder for morphological processing	Oct 1, 2018	DecoderLemmatization	—Unverified

Show:10 25 50

← PrevPage 15 of 36Next →

No leaderboard results yet.