SOTAVerified|Agents Browse Leaderboard About

Lemmatization

Lemmatization is a process of determining a base or dictionary form (lemma) for a given surface form. Especially for languages with rich morphology it is important to be able to normalize words into their base forms to better support for example search engines and linguistic studies. Main difficulties in Lemmatization arise from encountering previously unseen words during inference time as well as disambiguating ambiguous surface forms which can be inflected variants of several different base forms depending on the context.

Source: Universal Lemmatizer: A Sequence to Sequence Model for Lemmatizing Universal Dependencies Treebanks

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 171–180 of 351 papers

Title	Date	Tasks	Status
Word-Formation Network for Czech	May 1, 2014	LemmatizationMachine Translation	—Unverified
WSD for n-best reranking and local language modeling in SMT	Jul 1, 2012	Language ModelingLanguage Modelling	—Unverified
YAMAMA: Yet Another Multi-Dialect Arabic Morphological Analyzer	Dec 1, 2016	LemmatizationMorphological Analysis	—Unverified
ZAEBUC: An Annotated Arabic-English Bilingual Writer Corpus	Jun 1, 2022	LemmatizationPart-Of-Speech Tagging	—Unverified
Exploring the Use of Foundation Models for Named Entity Recognition and Lemmatization Tasks in Slavic Languages	Apr 11, 2023	Lemmatizationnamed-entity-recognition	—Unverified
Facilitating Multi-Lingual Sense Annotation: Human Mediated Lemmatizer	Jan 1, 2014	LemmatizationWord Sense Disambiguation	—Unverified
Factored Machine Translation Systems for Russian-English	Aug 1, 2013	LemmatizationMachine Translation	—Unverified
Fast and Accurate Decision Trees for Natural Language Processing Tasks	Sep 1, 2017	AttributeBIG-bench Machine Learning	—Unverified
Fast Query Expansion on an Accounting Corpus using Sub-Word Embeddings	Jun 1, 2018	Information RetrievalLemmatization	—Unverified
Few-Shot and Zero-Shot Learning for Historical Text Normalization	Mar 12, 2019	LemmatizationMulti-Task Learning	—Unverified

Show:10 25 50

← PrevPage 18 of 36Next →

No leaderboard results yet.