| CorefUD 1.0: Coreference Meets Universal Dependencies | Jun 1, 2022 | coreference-resolutionCoreference Resolution | —Unverified | 0 |
| From ELTeC Text Collection Metadata and Named Entities to Linked-data (and Back) | Jun 1, 2022 | Entity Linkingnamed-entity-recognition | —Unverified | 0 |
| AsNER - Annotated Dataset and Baseline for Assamese Named Entity recognition | Jun 1, 2022 | named-entity-recognitionNamed Entity Recognition | —Unverified | 0 |
| Named Entity Recognition to Detect Criminal Texts on the Web | Jun 1, 2022 | named-entity-recognitionNamed Entity Recognition | —Unverified | 0 |
| ChiMST: A Chinese Medical Corpus for Word Segmentation and Medical Term Recognition | Jun 1, 2022 | Chinese Word Segmentationnamed-entity-recognition | CodeCode Available | 0 |
| IgboBERT Models: Building and Training Transformer Models for the Igbo Language | Jun 1, 2022 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Enhanced Entity Annotations for Multilingual Corpora | Jun 1, 2022 | ArticlesEntity Linking | CodeCode Available | 1 |
| GGPONC 2.0 - The German Clinical Guideline Corpus for Oncology: Curation Workflow, Annotation Policy, Baseline NER Taggers | Jun 1, 2022 | named-entity-recognitionNamed Entity Recognition | CodeCode Available | 0 |
| Distant Reading in Digital Humanities: Case Study on the Serbian Part of the ELTeC Collection | Jun 1, 2022 | Lemmatizationnamed-entity-recognition | —Unverified | 0 |
| A Warm Start and a Clean Crawled Corpus - A Recipe for Good Language Models | Jun 1, 2022 | Constituency ParsingGrammatical Error Detection | —Unverified | 0 |