Lemmatization

Lemmatization is a process of determining a base or dictionary form (lemma) for a given surface form. Especially for languages with rich morphology it is important to be able to normalize words into their base forms to better support for example search engines and linguistic studies. Main difficulties in Lemmatization arise from encountering previously unseen words during inference time as well as disambiguating ambiguous surface forms which can be inflected variants of several different base forms depending on the context.

Source: Universal Lemmatizer: A Sequence to Sequence Model for Lemmatizing Universal Dependencies Treebanks

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 176–200 of 351 papers

Title	Date	Tasks	Status
Facilitating Multi-Lingual Sense Annotation: Human Mediated Lemmatizer	Jan 1, 2014	LemmatizationWord Sense Disambiguation	—Unverified
Factored Machine Translation Systems for Russian-English	Aug 1, 2013	LemmatizationMachine Translation	—Unverified
Fast and Accurate Decision Trees for Natural Language Processing Tasks	Sep 1, 2017	AttributeBIG-bench Machine Learning	—Unverified
Fast Query Expansion on an Accounting Corpus using Sub-Word Embeddings	Jun 1, 2018	Information RetrievalLemmatization	—Unverified
Few-Shot and Zero-Shot Learning for Historical Text Normalization	Mar 12, 2019	LemmatizationMulti-Task Learning	—Unverified
First Steps towards the Semi-automatic Development of a Wordformation-based Lexicon of Latin	May 1, 2012	Information RetrievalLemmatization	—Unverified
FOLK-Gold ― A Gold Standard for Part-of-Speech-Tagging of Spoken German	May 1, 2016	LemmatizationPart-Of-Speech Tagging	—Unverified
Gender Profiling for Slovene Twitter communication: the Influence of Gender Marking, Content and Style	Apr 1, 2017	Gender ClassificationGeneral Classification	—Unverified
Generating a Gold Standard for a Swedish Sentiment Lexicon	May 1, 2018	LemmatizationMachine Translation	—Unverified
GliLem: Leveraging GliNER for Contextualized Lemmatization in Estonian	Dec 29, 2024	Information RetrievalLEMMA	—Unverified
H2-Golden-Retriever: Methodology and Tool for an Evidence-Based Hydrogen Research Grantsmanship	Nov 16, 2022	Lemmatizationnamed-entity-recognition	—Unverified
Handling Unknown Words in Arabic FST Morphology	Jul 1, 2012	Lemmatization	—Unverified
Harmonizing Different Lemmatization Strategies for Building a Knowledge Base of Linguistic Resources for Latin	Aug 1, 2019	LEMMALemmatization	—Unverified
HHU at SemEval-2016 Task 1: Multiple Approaches to Measuring Semantic Textual Similarity	Jun 1, 2016	LemmatizationNamed Entity Recognition (NER)	—Unverified
Holaaa!! writin like u talk is kewl but kinda hard 4 NLP	May 1, 2012	Domain AdaptationLanguage Modelling	—Unverified
How low is too low? A monolingual take on lemmatisation in Indian languages	Jun 1, 2021	Data AugmentationLemmatization	—Unverified
Illinois-LH: A Denotational and Distributional Approach to Semantics	Aug 1, 2014	LemmatizationNatural Language Inference	—Unverified
Impact of Feature Selection on Micro-Text Classification	Aug 27, 2017	ClassificationClustering	—Unverified
Improving Neural Translation Models with Linguistic Factors	Dec 1, 2016	Constituency ParsingDependency Parsing	—Unverified
Improving the Morphological Analysis of Classical Sanskrit	Dec 1, 2016	BIG-bench Machine LearningLemmatization	—Unverified
Indexation libre et contr\^ol\'ee d'articles scientifiques. Pr\'esentation et r\'esultats du d\'efi fouille de textes DEFT2012 (Controlled and free indexing of scientific papers. Presentation and results of the DEFT2012 text-mining challenge) [in French]	Jun 1, 2012	Lemmatization	—Unverified
Investigating Sub-Word Embedding Strategies for the Morphologically Rich and Free Phrase-Order Hungarian	Aug 1, 2019	LemmatizationMorphological Analysis	—Unverified
Iula2Standoff: a tool for creating standoff documents for the IULACT	May 1, 2012	LemmatizationPOS	—Unverified
IWNLP: Inverse Wiktionary for Natural Language Processing	Jul 1, 2015	LemmatizationPart-Of-Speech Tagging	—Unverified
JAIST: Combining multiple features for Answer Selection in Community Question Answering	Jun 1, 2015	Answer SelectionCommunity Question Answering	—Unverified

Show:10 25 50

← PrevPage 8 of 15Next →

No leaderboard results yet.