SOTAVerified|Agents Browse Leaderboard About Blog

Lemmatization

Lemmatization is a process of determining a base or dictionary form (lemma) for a given surface form. Especially for languages with rich morphology it is important to be able to normalize words into their base forms to better support for example search engines and linguistic studies. Main difficulties in Lemmatization arise from encountering previously unseen words during inference time as well as disambiguating ambiguous surface forms which can be inflected variants of several different base forms depending on the context.

Source: Universal Lemmatizer: A Sequence to Sequence Model for Lemmatizing Universal Dependencies Treebanks

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 31–40 of 351 papers

Title	Date	Tasks	Status
Comparison of Current Approaches to Lemmatization: A Case Study in Estonian	Apr 23, 2024	ClassificationLemmatization	—Unverified
TartuNLP @ SIGTYP 2024 Shared Task: Adapting XLM-RoBERTa for Ancient and Historical Languages	Apr 19, 2024	Lemmatizationparameter-efficient fine-tuning	—Unverified
Cross-lingual Named Entity Corpus for Slavic Languages	Mar 30, 2024	LEMMALemmatization	CodeCode Available
ZAEBUC-Spoken: A Multilingual Multidialectal Arabic-English Speech Corpus	Mar 27, 2024	Automatic Speech RecognitionAutomatic Speech Recognition (ASR)	—Unverified
Evaluating Shortest Edit Script Methods for Contextual Lemmatization	Mar 25, 2024	LEMMALemmatization	CodeCode Available
BanLemma: A Word Formation Dependent Rule and Dictionary Based Bangla Lemmatizer	Nov 6, 2023	LemmatizationSentence	CodeCode Available
The effect of stemming and lemmatization on Portuguese fake news text classification	Oct 17, 2023	LemmatizationNews Classification	—Unverified
Lexicon and Rule-based Word Lemmatization Approach for the Somali Language	Aug 3, 2023	ArticlesInformation Retrieval	CodeCode Available
Vacaspati: A Diverse Corpus of Bangla Literature	Jul 11, 2023	LemmatizationPOS	—Unverified
Advancing Full-Text Search Lemmatization Techniques with Paradigm Retrieval from OpenCorpora	May 18, 2023	LEMMALemmatization	—Unverified

Show:10 25 50

← PrevPage 4 of 36Next →

No leaderboard results yet.