SOTAVerified

Transliteration

Transliteration is a mechanism for converting a word in a source (foreign) language to a target language, and often adopts approaches from machine translation. In machine translation, the objective is to preserve the semantic meaning of the utterance as much as possible while following the syntactic structure in the target language. In Transliteration, the objective is to preserve the original pronunciation of the source word as much as possible while following the phonological structures of the target language.

For example, the city’s name “Manchester” has become well known by people of languages other than English. These new words are often named entities that are important in cross-lingual information retrieval, information extraction, machine translation, and often present out-of-vocabulary challenges to spoken language technologies such as automatic speech recognition, spoken keyword search, and text-to-speech.

Source: Phonology-Augmented Statistical Framework for Machine Transliteration using Limited Linguistic Resources

Papers

Showing 1120 of 435 papers

TitleStatusHype
Linguistic Analysis of Sinhala YouTube Comments on Sinhala Music Videos: A Dataset Study0
IndoNLP 2025: Shared Task on Real-Time Reverse Transliteration for Romanized Indo-Aryan languages0
When LLMs Struggle: Reference-less Translation Evaluation for Low-resource Languages0
Sinhala Transliteration: A Comparative Analysis Between Rule-based and Seq2Seq ApproachesCode0
LAMA-UT: Language Agnostic Multilingual ASR through Orthography Unification and Language-Specific Transliteration0
Transliterated Zero-Shot Domain Adaptation for Automatic Speech Recognition0
Romanized to Native Malayalam Script Transliteration Using an Encoder-Decoder FrameworkCode0
PolyIPA -- Multilingual Phoneme-to-Grapheme Conversion Model0
GR-NLP-TOOLKIT: An Open-Source NLP Toolkit for Modern GreekCode2
AyutthayaAlpha: A Thai-Latin Script Transliteration Transformer0
Show:102550
← PrevPage 2 of 44Next →

No leaderboard results yet.