| VNLP: Turkish NLP Package | Mar 2, 2024 | Morphological Analysisnamed-entity-recognition | CodeCode Available | 2 |
| SciCap: Generating Captions for Scientific Figures | Oct 22, 2021 | ArticlesImage Captioning | CodeCode Available | 1 |
| Inducing Language-Agnostic Multilingual Representations | Aug 20, 2020 | Cross-Lingual TransferSentence | CodeCode Available | 1 |
| Multi-Task Learning for Front-End Text Processing in TTS | Jan 12, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| hinglishNorm -- A Corpus of Hindi-English Code Mixed Sentences for Text Normalization | Oct 18, 2020 | SentenceText Normalization | CodeCode Available | 1 |
| HUI-Audio-Corpus-German: A high quality TTS dataset | Jun 11, 2021 | Text Normalizationtext-to-speech | CodeCode Available | 1 |
| Extending Whisper with prompt tuning to target-speaker ASR | Dec 13, 2023 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 1 |
| Applying the Transformer to Character-level Transduction | May 20, 2020 | Grapheme-to-Phoneme ConversionMorphological Inflection | CodeCode Available | 1 |
| An End-to-end Chinese Text Normalization Model based on Rule-guided Flat-Lattice Transformer | Mar 31, 2022 | Text Normalizationtext-to-speech | CodeCode Available | 1 |
| indic-punct: An automatic punctuation restoration and inverse text normalization framework for Indic languages | Mar 31, 2022 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 1 |
| Proteno: Text Normalization with Limited Data for Fast Deployment in Text to Speech Systems | Apr 15, 2021 | Text Normalizationtext-to-speech | CodeCode Available | 1 |
| Dialect Text Normalization to Normative Standard Finnish | Nov 1, 2019 | Text Normalization | CodeCode Available | 1 |
| Analogy-based Text Normalization : the case of unknowns words (Normalisation de textes par analogie: le cas des mots inconnus) [in French] | Jul 1, 2014 | Spelling CorrectionText Normalization | —Unverified | 0 |
| Amharic Text Normalization with Sequence-to-Sequence Models | Sep 25, 2019 | DecoderText Normalization | —Unverified | 0 |
| Adversarial Text Normalization | Jun 8, 2022 | Adversarial TextNatural Language Inference | —Unverified | 0 |
| A Log-Linear Model for Unsupervised Text Normalization | Oct 1, 2013 | Language ModellingLexical Normalization | —Unverified | 0 |
| A Spelling Correction Corpus for Multiple Arabic Dialects | May 1, 2020 | Spelling CorrectionText Normalization | —Unverified | 0 |
| Adaptive Parser-Centric Text Normalization | Aug 1, 2013 | Machine TranslationSpeech Recognition | —Unverified | 0 |
| A Basic Language Resource Kit for Persian | May 1, 2012 | Part-Of-Speech TaggingPOS | —Unverified | 0 |
| Few-Shot and Zero-Shot Learning for Historical Text Normalization | Mar 12, 2019 | LemmatizationMulti-Task Learning | —Unverified | 0 |
| A Cascaded Approach for Social Media Text Normalization of Turkish | Apr 1, 2014 | Opinion MiningSpelling Correction | —Unverified | 0 |
| Four-in-One: A Joint Approach to Inverse Text Normalization, Punctuation, Capitalization, and Disfluency for Automatic Speech Recognition | Oct 26, 2022 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Exploiting Dialect Identification in Automatic Dialectal Text Normalization | Jul 3, 2024 | Dialect IdentificationText Normalization | —Unverified | 0 |
| A unified front-end framework for English text-to-speech synthesis | May 18, 2023 | Speech SynthesisText Normalization | —Unverified | 0 |
| A Unified Transformer-based Framework for Duplex Text Normalization | Aug 23, 2021 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Automatically Extracting Variant-Normalization Pairs for Japanese Text Normalization | Nov 1, 2017 | Machine TranslationMorphological Analysis | —Unverified | 0 |
| BDKG at MEDIQA 2021: System Report for the Radiology Report Summarization Task | Jun 1, 2021 | DecoderDomain Adaptation | —Unverified | 0 |
| Bekli:A Simple Approach to Twitter Text Normalization. | Jul 1, 2015 | EpidemiologySentiment Analysis | —Unverified | 0 |
| Benchmarking Evaluation Metrics for Code-Switching Automatic Speech Recognition | Nov 22, 2022 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Benefits of Data Augmentation for NMT-based Text Normalization of User-Generated Content | Nov 1, 2019 | Data AugmentationDecoder | —Unverified | 0 |
| A Graph-based Approach for Contextual Text Normalization | Oct 1, 2014 | Text Normalization | —Unverified | 0 |
| An In-depth Analysis of the Effect of Text Normalization in Social Media | May 1, 2015 | Dependency Parsingnamed-entity-recognition | —Unverified | 0 |
| Fast and Accurate Reordering with ITG Transition RNN | Aug 1, 2018 | DecoderFeature Engineering | —Unverified | 0 |
| Full-text Error Correction for Chinese Speech Recognition with Large Language Model | Sep 12, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| DeepNNNER: Applying BLSTM-CNNs and Extended Lexicons to Named Entity Recognition in Tweets | Dec 1, 2016 | DiversityFeature Engineering | —Unverified | 0 |
| Data-Driven Parametric Text Normalization: Rapidly Scaling Finite-State Transduction Verbalizers to New Languages | May 1, 2020 | Text Normalization | —Unverified | 0 |
| An improved Bayesian TRIE based model for SMS text normalization | Aug 4, 2020 | Text Normalization | —Unverified | 0 |
| Creating Data in Icelandic for Text Normalization | May 1, 2021 | Text Normalization | —Unverified | 0 |
| DeepNorm-A Deep Learning Approach to Text Normalization | Dec 17, 2017 | Deep LearningGeneral Classification | —Unverified | 0 |
| Developing Resources for Automated Speech Processing of Quebec French | May 1, 2020 | SegmentationText Normalization | —Unverified | 0 |
| Context Tailoring for Text Normalization | Jun 1, 2016 | Text Normalization | —Unverified | 0 |
| An Out-of-Domain Test Suite for Dependency Parsing of German | May 1, 2014 | Dependency ParsingDomain Adaptation | —Unverified | 0 |
| Dynamic Context-Aware Streaming Pretrained Language Model For Inverse Text Normalization | May 30, 2025 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| A hybrid text normalization system using multi-head self-attention for mandarin | Nov 11, 2019 | SentenceText Normalization | —Unverified | 0 |
| Aggressive Language Detection with Joint Text Normalization via Adversarial Multi-task Learning | Sep 19, 2020 | Multi-Task LearningText Normalization | —Unverified | 0 |
| Evaluating historical text normalization systems: How well do they generalize? | Apr 7, 2018 | POSPOS Tagging | —Unverified | 0 |
| Comparing MT Approaches for Text Normalization | Sep 1, 2019 | Data AugmentationLanguage Modeling | —Unverified | 0 |
| Exploring Word Embeddings for Unsupervised Textual User-Generated Content Normalization | Apr 10, 2017 | Semantic SimilaritySemantic Textual Similarity | —Unverified | 0 |
| Chain of Correction for Full-text Speech Recognition with Large Language Models | Apr 2, 2025 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| An analysis of full-size Russian complexly NER labelled corpus of Internet user reviews on the drugs based on deep learning and language neural nets | Apr 30, 2021 | Language ModellingNER | —Unverified | 0 |