| Granary: Speech Recognition and Translation Dataset in 25 European Languages | May 19, 2025 | HallucinationPunctuation Restoration | —Unverified | 0 |
| Chain of Correction for Full-text Speech Recognition with Large Language Models | Apr 2, 2025 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| HERITAGE: An End-to-End Web Platform for Processing Korean Historical Documents in Hanja | Jan 21, 2025 | document understandingMachine Translation | CodeCode Available | 0 |
| Universal-2-TF: Robust All-Neural Text Formatting for ASR | Jan 10, 2025 | AllAutomatic Speech Recognition | —Unverified | 0 |
| Fotheidil: an Automatic Transcription System for the Irish Language | Dec 31, 2024 | Action DetectionActivity Detection | —Unverified | 0 |
| When Does Classical Chinese Help? Quantifying Cross-Lingual Transfer in Hanja and Kanbun | Nov 7, 2024 | Cross-Lingual TransferLanguage Modeling | CodeCode Available | 0 |
| Spontaneous Informal Speech Dataset for Punctuation Restoration | Sep 17, 2024 | Punctuation Restoration | CodeCode Available | 0 |
| Full-text Error Correction for Chinese Speech Recognition with Large Language Model | Sep 12, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| LLaMA based Punctuation Restoration With Forward Pass Only Decoding | Aug 9, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Punctuation Restoration Improves Structure Understanding Without Supervision | Feb 13, 2024 | ChunkingLanguage Modeling | CodeCode Available | 0 |