SOTAVerified

Transliteration

Transliteration is a mechanism for converting a word in a source (foreign) language to a target language, and often adopts approaches from machine translation. In machine translation, the objective is to preserve the semantic meaning of the utterance as much as possible while following the syntactic structure in the target language. In Transliteration, the objective is to preserve the original pronunciation of the source word as much as possible while following the phonological structures of the target language.

For example, the city’s name “Manchester” has become well known by people of languages other than English. These new words are often named entities that are important in cross-lingual information retrieval, information extraction, machine translation, and often present out-of-vocabulary challenges to spoken language technologies such as automatic speech recognition, spoken keyword search, and text-to-speech.

Source: Phonology-Augmented Statistical Framework for Machine Transliteration using Limited Linguistic Resources

Papers

Showing 361370 of 435 papers

TitleStatusHype
Urdu - Roman Transliteration via Finite State Transducers0
Use of Transformer-Based Models for Word-Level Transliteration of the Book of the Dean of Lismore0
Using Transliteration of Proper Names from Arabic to Latin Script to Improve English-Arabic Word Alignment0
Utilisation de la translitt\'eration arabe pour l'am\'elioration de l'alignement de mots \`a partir de corpus parall\`eles fran -arabe (Using Arabic Transliteration to Improve Word Alignment from French-Arabic Parallel Corpora) [in French]0
Uzbek Cyrillic-Latin-Cyrillic Machine Transliteration0
Vocabulary-Based Language Similarity using Web Corpora0
Web-sentiment analysis of public comments (public reviews) for languages with limited resources such as the Kazakh language0
Weighting Finite-State Transductions With Neural Context0
What Matters Most in Morphologically Segmented SMT Models?0
When LLMs Struggle: Reference-less Translation Evaluation for Low-resource Languages0
Show:102550
← PrevPage 37 of 44Next →

No leaderboard results yet.