SOTAVerified

Transliteration

Transliteration is a mechanism for converting a word in a source (foreign) language to a target language, and often adopts approaches from machine translation. In machine translation, the objective is to preserve the semantic meaning of the utterance as much as possible while following the syntactic structure in the target language. In Transliteration, the objective is to preserve the original pronunciation of the source word as much as possible while following the phonological structures of the target language.

For example, the city’s name “Manchester” has become well known by people of languages other than English. These new words are often named entities that are important in cross-lingual information retrieval, information extraction, machine translation, and often present out-of-vocabulary challenges to spoken language technologies such as automatic speech recognition, spoken keyword search, and text-to-speech.

Source: Phonology-Augmented Statistical Framework for Machine Transliteration using Limited Linguistic Resources

Papers

Showing 76100 of 435 papers

TitleStatusHype
TArC: Tunisian Arabish Corpus, First complete release0
Exploiting Transliterated Words for Finding Similarity in Inter-Language News Articles using Machine Learning0
Use of Transformer-Based Models for Word-Level Transliteration of the Book of the Dean of Lismore0
A machine transliteration tool between Uzbek alphabetsCode1
Aksharantar: Open Indic-language Transliteration datasets and models for the Next Billion UsersCode2
DE-ABUSE@TamilNLP-ACL 2022: Transliteration as Data Augmentation for Abuse Detection in Tamil0
UMUTeam@LT-EDI-ACL2022: Detecting homophobic and transphobic comments in Tamil0
Joint Transformer/RNN Architecture for Gesture Typing in Indic Languages0
Multilingual Abusiveness Identification on Code-Mixed Social Media Text0
ParaNames: A Massively Multilingual Entity Name CorpusCode1
Learning to pronounce as measuring cross-lingual joint orthography-phonology complexity0
Does Transliteration Help Multilingual Language Modeling?Code0
Towards a Broad Coverage Named Entity Resource: A Data-Efficient Approach for Many Diverse Languages0
DEEP: DEnoising Entity Pre-training for Neural Machine Translation0
English-to-Chinese Transliteration with Phonetic Back-transliteration0
ARGUABLY at ComMA@ICON: Detection of Multilingual Aggressive, Gender Biased, and Communally Charged Tweets Using Ensemble and Fine-Tuned IndicBERT0
Speech Synthesis for Low Resource Languages using Transliteration Enabled Transfer Learning0
IIITT@Dravidian-CodeMix-FIRE2021: Transliterate or translate? Sentiment analysis of code-mixed text in Dravidian languagesCode0
DEEP: DEnoising Entity Pre-training for Neural Machine Translation0
Orthographic Transliteration for Kabyle Speech RecognitionCode0
Transliterating Kurdish texts in Latin into Persian-Arabic script0
Ceasing hate withMoH: Hate Speech Detection in Hindi-English Code-Switched Language0
Transliteration of Foreign Words in Burmese0
Transliteration: A Simple Technique For Improving Multilingual Language Modeling0
Context based Roman-Urdu to Urdu Script Transliteration System0
Show:102550
← PrevPage 4 of 18Next →

No leaderboard results yet.