SOTAVerified|Agents Browse Leaderboard About

Lemmatization

Lemmatization is a process of determining a base or dictionary form (lemma) for a given surface form. Especially for languages with rich morphology it is important to be able to normalize words into their base forms to better support for example search engines and linguistic studies. Main difficulties in Lemmatization arise from encountering previously unseen words during inference time as well as disambiguating ambiguous surface forms which can be inflected variants of several different base forms depending on the context.

Source: Universal Lemmatizer: A Sequence to Sequence Model for Lemmatizing Universal Dependencies Treebanks

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 91–100 of 351 papers

Title	Date	Tasks	Status
BabyFST - Towards a Finite-State Based Computational Model of Ancient Babylonian	May 1, 2020	LemmatizationPOS	—Unverified
Development of email classifier in Brazilian Portuguese using feature selection for automatic response	Jul 8, 2019	Classificationfeature selection	—Unverified
Development of a rule-based lemmatization algorithm through Finite State Machine for Uzbek language	Oct 28, 2022	LEMMALemmatization	—Unverified
Automatic Translation of English Text to Indian Sign Language Synthetic Animations	Dec 1, 2016	LemmatizationTranslation	—Unverified
An efficient language independent toolkit for complete morphological disambiguation	May 1, 2014	Language ModellingLemmatization	—Unverified
Acquisition of semantic relations between terms: how far can we get with standard NLP tools?	Dec 1, 2016	Coreference ResolutionLemmatization	—Unverified
Diachronic Parsing of Pre-Standard Irish	Jun 1, 2022	Dependency ParsingLemmatization	—Unverified
Distant Reading in Digital Humanities: Case Study on the Serbian Part of the ELTeC Collection	Jun 1, 2022	Lemmatizationnamed-entity-recognition	—Unverified
A Case Study of Spanish Text Transformations for Twitter Sentiment Analysis	Jun 3, 2021	LemmatizationOpinion Mining	—Unverified
Developing New Linguistic Resources and Tools for the Galician Language	May 1, 2018	LemmatizationNamed Entity Recognition (NER)	—Unverified

Show:10 25 50

← PrevPage 10 of 36Next →

No leaderboard results yet.