SOTAVerified|Agents Browse Leaderboard About

Lemmatization

Lemmatization is a process of determining a base or dictionary form (lemma) for a given surface form. Especially for languages with rich morphology it is important to be able to normalize words into their base forms to better support for example search engines and linguistic studies. Main difficulties in Lemmatization arise from encountering previously unseen words during inference time as well as disambiguating ambiguous surface forms which can be inflected variants of several different base forms depending on the context.

Source: Universal Lemmatizer: A Sequence to Sequence Model for Lemmatizing Universal Dependencies Treebanks

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 161–170 of 351 papers

Title	Date	Tasks	Status
JHUBC's Submission to LT4HALA EvaLatin 2020	May 1, 2020	DecoderLemmatization	—Unverified
Joint Diacritization, Lemmatization, Normalization, and Fine-Grained Morphological Tagging	Oct 5, 2019	LemmatizationMorphological Tagging	—Unverified
Context Sensitive Neural Lemmatization with Lematus	Jun 1, 2018	DecoderLemmatization	—Unverified
Illinois-LH: A Denotational and Distributional Approach to Semantics	Aug 1, 2014	LemmatizationNatural Language Inference	—Unverified
Context Sensitive Lemmatization Using Two Successive Bidirectional Gated Recurrent Networks	Jul 1, 2017	AttributeLEMMA	—Unverified
A Simple Joint Model for Improved Contextual Neural Lemmatization	Apr 4, 2019	LEMMALemmatization	—Unverified
KLUE-CORE: A regression model of semantic textual similarity	Jun 1, 2013	LemmatizationQuestion Answering	—Unverified
How low is too low? A monolingual take on lemmatisation in Indian languages	Jun 1, 2021	Data AugmentationLemmatization	—Unverified
Korp --- the corpus infrastructure of Spr	May 1, 2012	Lemmatization	—Unverified
Context based lemmatizer for Polish language	Jul 23, 2022	LEMMALemmatization	—Unverified

Show:10 25 50

← PrevPage 17 of 36Next →

No leaderboard results yet.