SOTAVerified|Agents Browse Leaderboard About

Lemmatization

Lemmatization is a process of determining a base or dictionary form (lemma) for a given surface form. Especially for languages with rich morphology it is important to be able to normalize words into their base forms to better support for example search engines and linguistic studies. Main difficulties in Lemmatization arise from encountering previously unseen words during inference time as well as disambiguating ambiguous surface forms which can be inflected variants of several different base forms depending on the context.

Source: Universal Lemmatizer: A Sequence to Sequence Model for Lemmatizing Universal Dependencies Treebanks

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 211–220 of 351 papers

Title	Date	Tasks	Status
The First Cross-Lingual Challenge on Recognition, Normalization, and Matching of Named Entities in Slavic Languages	Apr 1, 2017	Entity LinkingLemmatization	—Unverified
Adapting a State-of-the-Art Tagger for South Slavic Languages to Non-Standard Text	Apr 1, 2017	Domain AdaptationLemmatization	—Unverified
Spelling Correction for Morphologically Rich Language: a Case Study of Russian	Apr 1, 2017	Language ModelingLanguage Modelling	—Unverified
Distributional regularities of verbs and verbal adjectives: Treebank evidence and broader implications	Jan 1, 2017	LemmatizationWord Embeddings	—Unverified
Acquisition of semantic relations between terms: how far can we get with standard NLP tools?	Dec 1, 2016	Coreference ResolutionLemmatization	—Unverified
YAMAMA: Yet Another Multi-Dialect Arabic Morphological Analyzer	Dec 1, 2016	LemmatizationMorphological Analysis	—Unverified
Improving the Morphological Analysis of Classical Sanskrit	Dec 1, 2016	BIG-bench Machine LearningLemmatization	—Unverified
Improving Neural Translation Models with Linguistic Factors	Dec 1, 2016	Constituency ParsingDependency Parsing	—Unverified
The Power of Language Music: Arabic Lemmatization through Patterns	Dec 1, 2016	Information RetrievalLEMMA	—Unverified
ENIAM: Categorial Syntactic-Semantic Parser for Polish	Dec 1, 2016	Information RetrievalLemmatization	—Unverified

Show:10 25 50

← PrevPage 22 of 36Next →

No leaderboard results yet.