| VNLP: Turkish NLP Package | Mar 2, 2024 | Morphological Analysisnamed-entity-recognition | CodeCode Available | 2 |
| Multi-Task Learning for Front-End Text Processing in TTS | Jan 12, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Extending Whisper with prompt tuning to target-speaker ASR | Dec 13, 2023 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 1 |
| An End-to-end Chinese Text Normalization Model based on Rule-guided Flat-Lattice Transformer | Mar 31, 2022 | Text Normalizationtext-to-speech | CodeCode Available | 1 |
| indic-punct: An automatic punctuation restoration and inverse text normalization framework for Indic languages | Mar 31, 2022 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 1 |
| SciCap: Generating Captions for Scientific Figures | Oct 22, 2021 | ArticlesImage Captioning | CodeCode Available | 1 |
| HUI-Audio-Corpus-German: A high quality TTS dataset | Jun 11, 2021 | Text Normalizationtext-to-speech | CodeCode Available | 1 |
| Proteno: Text Normalization with Limited Data for Fast Deployment in Text to Speech Systems | Apr 15, 2021 | Text Normalizationtext-to-speech | CodeCode Available | 1 |
| hinglishNorm -- A Corpus of Hindi-English Code Mixed Sentences for Text Normalization | Oct 18, 2020 | SentenceText Normalization | CodeCode Available | 1 |
| Inducing Language-Agnostic Multilingual Representations | Aug 20, 2020 | Cross-Lingual TransferSentence | CodeCode Available | 1 |
| Applying the Transformer to Character-level Transduction | May 20, 2020 | Grapheme-to-Phoneme ConversionMorphological Inflection | CodeCode Available | 1 |
| Dialect Text Normalization to Normative Standard Finnish | Nov 1, 2019 | Text Normalization | CodeCode Available | 1 |
| Dynamic Context-Aware Streaming Pretrained Language Model For Inverse Text Normalization | May 30, 2025 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Visualizing Public Opinion on X: A Real-Time Sentiment Dashboard Using VADER and DistilBERT | Apr 21, 2025 | Sentiment AnalysisSentiment Classification | —Unverified | 0 |
| Chain of Correction for Full-text Speech Recognition with Large Language Models | Apr 2, 2025 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Misspellings in Natural Language Processing: A survey | Jan 28, 2025 | Data AugmentationMachine Translation | —Unverified | 0 |
| Universal-2-TF: Robust All-Neural Text Formatting for ASR | Jan 10, 2025 | AllAutomatic Speech Recognition | —Unverified | 0 |
| Digestion Algorithm in Hierarchical Symbolic Forests: A Fast Text Normalization Algorithm and Semantic Parsing Framework for Specific Scenarios and Lightweight Deployment | Dec 18, 2024 | Lightweight DeploymentSemantic Parsing | CodeCode Available | 0 |
| Neural Text Normalization for Luxembourgish using Real-Life Variation Data | Dec 12, 2024 | Text Normalization | —Unverified | 0 |
| Machine Learning Driven Smishing Detection Framework for Mobile Security | Dec 9, 2024 | ManagementMobile Security | —Unverified | 0 |
| Hybrid Deep Learning for Legal Text Analysis: Predicting Punishment Durations in Indonesian Court Rulings | Oct 26, 2024 | Computational EfficiencyDocument Summarization | —Unverified | 0 |
| WER We Stand: Benchmarking Urdu ASR Models | Sep 17, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Full-text Error Correction for Chinese Speech Recognition with Large Language Model | Sep 12, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Historical German Text Normalization Using Type- and Token-Based Language Modeling | Sep 4, 2024 | DecoderLanguage Modeling | —Unverified | 0 |
| What is lost in Normalization? Exploring Pitfalls in Multilingual ASR Model Evaluations | Sep 4, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |