SOTAVerified

Transliteration

Transliteration is a mechanism for converting a word in a source (foreign) language to a target language, and often adopts approaches from machine translation. In machine translation, the objective is to preserve the semantic meaning of the utterance as much as possible while following the syntactic structure in the target language. In Transliteration, the objective is to preserve the original pronunciation of the source word as much as possible while following the phonological structures of the target language.

For example, the city’s name “Manchester” has become well known by people of languages other than English. These new words are often named entities that are important in cross-lingual information retrieval, information extraction, machine translation, and often present out-of-vocabulary challenges to spoken language technologies such as automatic speech recognition, spoken keyword search, and text-to-speech.

Source: Phonology-Augmented Statistical Framework for Machine Transliteration using Limited Linguistic Resources

Papers

Showing 301350 of 435 papers

TitleStatusHype
Normalization of Transliterated Words in Code-Mixed Data Using Seq2Seq Model & Levenshtein Distance0
Normalization of Transliterated Words in Code-Mixed Data Using Seq2Seq Model \& Levenshtein Distance0
NRC Russian-English Machine Translation System for WMT 20160
NusaAksara: A Multimodal and Multilingual Benchmark for Preserving Indonesian Indigenous Scripts0
OFFLangOne@DravidianLangTech-EACL2021: Transformers with the Class Balanced Loss for Offensive Language Identification in Dravidian Code-Mixed text.0
Opinion Mining in a Code-Mixed Environment: A Case Study with Government Portals0
Optimizing Multilingual Text-To-Speech with Accents & Emotions0
Optimizing Transliteration for Hindi/Marathi to English Using only Two Weights0
Orthographic and Morphological Processing for Persian-to-English Statistical Machine Translation0
Palmyra: A Platform Independent Dependency Annotation Tool for Morphologically Rich Languages0
Part-of-Speech Tagging for Code-Switched, Transliterated Texts without Explicit Language Identification0
Phonologically Aware Neural Model for Named Entity Recognition in Low Resource Transfer Settings0
Phonology-Augmented Statistical Framework for Machine Transliteration using Limited Linguistic Resources0
PJAIT Systems for the IWSLT 2015 Evaluation Campaign Enhanced by Comparable Corpora0
PJAIT Systems for the WMT 20160
PJIIT's systems for WMT 2017 Conference0
PolyIPA -- Multilingual Phoneme-to-Grapheme Conversion Model0
Portable Spelling Corrector for a Less-Resourced Language: Amharic0
POS Tagging of English-Hindi Code-Mixed Social Media Content0
POS Tagging of Hindi-English Code Mixed Text from Social Media: Some Machine Learning Experiments0
Processing Informal, Romanized Pakistani Text Messages0
Proper Name Diacritization for Arabic Wikipedia: A Benchmark Dataset0
Proper Name Machine Translation from Japanese to Japanese Sign Language0
Putting Figures on Influences on Moroccan Darija from Arabic, French and Spanish using the WordNet0
QCRI-MES Submission at WMT13: Using Transliteration Mining to Improve Statistical Machine Translation0
Query Translation for Cross-Language Information Retrieval using Multilingual Word Clusters0
Quillpad Multilingual Predictive Transliteration System0
QuranTree.jl: A Julia Package for Quranic Arabic Corpus0
Recovering Missing Characters in Old Hawaiian Writing0
Regularity and Flexibility in English-Chinese Name Transliteration0
Regularized Interlingual Projections: Evaluation on Multilingual Transliteration0
Regulating Orthography-Phonology Relationship for English to Thai Transliteration0
Report of NEWS 2012 Machine Transliteration Shared Task0
Report of NEWS 2015 Machine Transliteration Shared Task0
Report of NEWS 2016 Machine Transliteration Shared Task0
Report of NEWS 2018 Named Entity Transliteration Shared Task0
Rescoring a Phrase-based Machine Transliteration System with Recurrent Neural Network Language Models0
Rethinking Hate Speech Detection on Social Media: Can LLMs Replace Traditional Models?0
Review of Computational Epigraphy0
Robust Dictionary Lookup in Multiple Noisy Orthographies0
Robust Transliteration Mining from Comparable Corpora with Bilingual Topic Models0
Romanization-based Approach to Morphological Analysis in Korean SMS Text Processing0
Romanization-based Large-scale Adaptation of Multilingual Language Models0
Romanized Arabic Transliteration0
Romanized Berber and Romanized Arabic Automatic Language Identification Using Machine Learning0
Rule based Approach for Word Normalization by resolving Transcription Ambiguity in Transliterated Search Queries0
Rule Based Transliteration Scheme for English to Punjabi0
Russian Stress Prediction using Maximum Entropy Ranking0
Samsung R&D Institute Poland submission to WAT 2021 Indic Language Multilingual Task0
Sangam: A Perso-Arabic to Indic Script Machine Transliteration Model0
Show:102550
← PrevPage 7 of 9Next →

No leaderboard results yet.