| VNLP: Turkish NLP Package | Mar 2, 2024 | Morphological Analysisnamed-entity-recognition | CodeCode Available | 2 |
| Multi-Task Learning for Front-End Text Processing in TTS | Jan 12, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Extending Whisper with prompt tuning to target-speaker ASR | Dec 13, 2023 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 1 |
| indic-punct: An automatic punctuation restoration and inverse text normalization framework for Indic languages | Mar 31, 2022 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 1 |
| An End-to-end Chinese Text Normalization Model based on Rule-guided Flat-Lattice Transformer | Mar 31, 2022 | Text Normalizationtext-to-speech | CodeCode Available | 1 |
| SciCap: Generating Captions for Scientific Figures | Oct 22, 2021 | ArticlesImage Captioning | CodeCode Available | 1 |
| HUI-Audio-Corpus-German: A high quality TTS dataset | Jun 11, 2021 | Text Normalizationtext-to-speech | CodeCode Available | 1 |
| Proteno: Text Normalization with Limited Data for Fast Deployment in Text to Speech Systems | Apr 15, 2021 | Text Normalizationtext-to-speech | CodeCode Available | 1 |
| hinglishNorm -- A Corpus of Hindi-English Code Mixed Sentences for Text Normalization | Oct 18, 2020 | SentenceText Normalization | CodeCode Available | 1 |
| Inducing Language-Agnostic Multilingual Representations | Aug 20, 2020 | Cross-Lingual TransferSentence | CodeCode Available | 1 |
| Applying the Transformer to Character-level Transduction | May 20, 2020 | Grapheme-to-Phoneme ConversionMorphological Inflection | CodeCode Available | 1 |
| Dialect Text Normalization to Normative Standard Finnish | Nov 1, 2019 | Text Normalization | CodeCode Available | 1 |
| Dynamic Context-Aware Streaming Pretrained Language Model For Inverse Text Normalization | May 30, 2025 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Visualizing Public Opinion on X: A Real-Time Sentiment Dashboard Using VADER and DistilBERT | Apr 21, 2025 | Sentiment AnalysisSentiment Classification | —Unverified | 0 |
| Chain of Correction for Full-text Speech Recognition with Large Language Models | Apr 2, 2025 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Misspellings in Natural Language Processing: A survey | Jan 28, 2025 | Data AugmentationMachine Translation | —Unverified | 0 |
| Universal-2-TF: Robust All-Neural Text Formatting for ASR | Jan 10, 2025 | AllAutomatic Speech Recognition | —Unverified | 0 |
| Digestion Algorithm in Hierarchical Symbolic Forests: A Fast Text Normalization Algorithm and Semantic Parsing Framework for Specific Scenarios and Lightweight Deployment | Dec 18, 2024 | Lightweight DeploymentSemantic Parsing | CodeCode Available | 0 |
| Neural Text Normalization for Luxembourgish using Real-Life Variation Data | Dec 12, 2024 | Text Normalization | —Unverified | 0 |
| Machine Learning Driven Smishing Detection Framework for Mobile Security | Dec 9, 2024 | ManagementMobile Security | —Unverified | 0 |
| Hybrid Deep Learning for Legal Text Analysis: Predicting Punishment Durations in Indonesian Court Rulings | Oct 26, 2024 | Computational EfficiencyDocument Summarization | —Unverified | 0 |
| WER We Stand: Benchmarking Urdu ASR Models | Sep 17, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Full-text Error Correction for Chinese Speech Recognition with Large Language Model | Sep 12, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Historical German Text Normalization Using Type- and Token-Based Language Modeling | Sep 4, 2024 | DecoderLanguage Modeling | —Unverified | 0 |
| What is lost in Normalization? Exploring Pitfalls in Multilingual ASR Model Evaluations | Sep 4, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Is text normalization relevant for classifying medieval charters? | Aug 29, 2024 | Document DatingText Normalization | —Unverified | 0 |
| Positional Description for Numerical Normalization | Aug 22, 2024 | speech-recognitionSpeech Recognition | —Unverified | 0 |
| The CHiME-8 DASR Challenge for Generalizable and Array Agnostic Distant Automatic Speech Recognition and Diarization | Jul 23, 2024 | Automatic Speech RecognitionDistant Speech Recognition | —Unverified | 0 |
| Exploiting Dialect Identification in Automatic Dialectal Text Normalization | Jul 3, 2024 | Dialect IdentificationText Normalization | —Unverified | 0 |
| Prior-agnostic Multi-scale Contrastive Text-Audio Pre-training for Parallelized TTS Frontend Modeling | Apr 14, 2024 | Polyphone disambiguationText Normalization | —Unverified | 0 |
| Normalization of Lithuanian Text Using Regular Expressions | Dec 29, 2023 | Speech SynthesisText Normalization | —Unverified | 0 |
| A Chat About Boring Problems: Studying GPT-based text normalization | Sep 23, 2023 | Prompt EngineeringText Normalization | —Unverified | 0 |
| Improving Robustness of Neural Inverse Text Normalization via Data-Augmentation, Semi-Supervised Learning, and Post-Aligning Method | Sep 12, 2023 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| LDEB -- Label Digitization with Emotion Binarization and Machine Learning for Emotion Recognition in Conversational Dialogues | Jun 3, 2023 | BinarizationEmotion Recognition | —Unverified | 0 |
| A unified front-end framework for English text-to-speech synthesis | May 18, 2023 | Speech SynthesisText Normalization | —Unverified | 0 |
| Language Agnostic Data-Driven Inverse Text Normalization | Jan 20, 2023 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Benchmarking Evaluation Metrics for Code-Switching Automatic Speech Recognition | Nov 22, 2022 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Streaming, fast and accurate on-device Inverse Text Normalization for Automatic Speech Recognition | Nov 7, 2022 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| TRScore: A Novel GPT-based Readability Scorer for ASR Segmentation and Punctuation model evaluation and selection | Oct 27, 2022 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Four-in-One: A Joint Approach to Inverse Text Normalization, Punctuation, Capitalization, and Disfluency for Automatic Speech Recognition | Oct 26, 2022 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Singlish Message Paraphrasing: A Joint Task of Creole Translation and Text Normalization | Oct 1, 2022 | Stance DetectionText Normalization | —Unverified | 0 |
| Non-Standard Vietnamese Word Detection and Normalization for Text-to-Speech | Sep 7, 2022 | ArticlesSentence | —Unverified | 0 |
| Thutmose Tagger: Single-pass neural model for Inverse Text Normalization | Jul 29, 2022 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Improving Data Driven Inverse Text Normalization using Data Augmentation | Jul 20, 2022 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Text normalization for low-resource languages: the case of Ligurian | Jun 16, 2022 | Text Normalization | CodeCode Available | 0 |
| Adversarial Text Normalization | Jun 8, 2022 | Adversarial TextNatural Language Inference | —Unverified | 0 |
| Boring Problems Are Sometimes the Most Interesting | Jun 1, 2022 | Text Normalization | —Unverified | 0 |
| JHU IWSLT 2022 Dialect Speech Translation System Description | May 1, 2022 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Shallow Fusion of Weighted Finite-State Transducer and Language Model for Text Normalization | Mar 29, 2022 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 0 |
| Seq-2-Seq based Refinement of ASR Output for Spoken Name Capture | Mar 29, 2022 | Text Normalization | —Unverified | 0 |