SOTAVerified|Agents Browse Leaderboard About

Lemmatization

Lemmatization is a process of determining a base or dictionary form (lemma) for a given surface form. Especially for languages with rich morphology it is important to be able to normalize words into their base forms to better support for example search engines and linguistic studies. Main difficulties in Lemmatization arise from encountering previously unseen words during inference time as well as disambiguating ambiguous surface forms which can be inflected variants of several different base forms depending on the context.

Source: Universal Lemmatizer: A Sequence to Sequence Model for Lemmatizing Universal Dependencies Treebanks

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 291–300 of 351 papers

Title	Date	Tasks	Status
Adapting a State-of-the-Art Tagger for South Slavic Languages to Non-Standard Text	Apr 1, 2017	Domain AdaptationLemmatization	—Unverified
Adapting the TTL Romanian POS Tagger to the Biomedical Domain	Sep 1, 2017	ChunkingDomain Adaptation	—Unverified
A data-driven approach to verbal multiword expression detection. PARSEME Shared Task system description paper	Apr 1, 2017	feature selectionLemmatization	—Unverified
Advancing Full-Text Search Lemmatization Techniques with Paradigm Retrieval from OpenCorpora	May 18, 2023	LEMMALemmatization	—Unverified
AGILe: The First Lemmatizer for Ancient Greek Inscriptions	Jun 1, 2022	Lemmatization	—Unverified
A Gradient Boosting-Seq2Seq System for Latin POS Tagging and Lemmatization	May 1, 2020	LemmatizationPOS	—Unverified
AI-KU: Using Co-Occurrence Modeling for Semantic Similarity	Aug 1, 2014	Information RetrievalLanguage Modelling	—Unverified
A Morphological Analyzer for Shipibo-Konibo	Oct 1, 2018	LemmatizationMachine Translation	—Unverified
A Morphologically Annotated Corpus of Emirati Arabic	May 1, 2018	LemmatizationMachine Translation	—Unverified
Analyse Automatique de l’Ancien Arménien. Évaluation d’une méthode hybride « dictionnaire » et « réseau de neurones » sur un Extrait de l’Adversus Haereses d’Irénée de Lyon	Jun 1, 2022	LemmatizationLexical Analysis	—Unverified

Show:10 25 50

← PrevPage 30 of 36Next →

No leaderboard results yet.