SOTAVerified

Transliteration

Transliteration is a mechanism for converting a word in a source (foreign) language to a target language, and often adopts approaches from machine translation. In machine translation, the objective is to preserve the semantic meaning of the utterance as much as possible while following the syntactic structure in the target language. In Transliteration, the objective is to preserve the original pronunciation of the source word as much as possible while following the phonological structures of the target language.

For example, the city’s name “Manchester” has become well known by people of languages other than English. These new words are often named entities that are important in cross-lingual information retrieval, information extraction, machine translation, and often present out-of-vocabulary challenges to spoken language technologies such as automatic speech recognition, spoken keyword search, and text-to-speech.

Source: Phonology-Augmented Statistical Framework for Machine Transliteration using Limited Linguistic Resources

Papers

Showing 251300 of 435 papers

TitleStatusHype
OFFLangOne@DravidianLangTech-EACL2021: Transformers with the Class Balanced Loss for Offensive Language Identification in Dravidian Code-Mixed text.0
Opinion Mining in a Code-Mixed Environment: A Case Study with Government Portals0
Optimizing Multilingual Text-To-Speech with Accents & Emotions0
Optimizing Transliteration for Hindi/Marathi to English Using only Two Weights0
Orthographic and Morphological Processing for Persian-to-English Statistical Machine Translation0
Palmyra: A Platform Independent Dependency Annotation Tool for Morphologically Rich Languages0
Part-of-Speech Tagging for Code-Switched, Transliterated Texts without Explicit Language Identification0
Phonologically Aware Neural Model for Named Entity Recognition in Low Resource Transfer Settings0
Phonology-Augmented Statistical Framework for Machine Transliteration using Limited Linguistic Resources0
PJAIT Systems for the IWSLT 2015 Evaluation Campaign Enhanced by Comparable Corpora0
PJAIT Systems for the WMT 20160
PJIIT's systems for WMT 2017 Conference0
PolyIPA -- Multilingual Phoneme-to-Grapheme Conversion Model0
Portable Spelling Corrector for a Less-Resourced Language: Amharic0
POS Tagging of English-Hindi Code-Mixed Social Media Content0
POS Tagging of Hindi-English Code Mixed Text from Social Media: Some Machine Learning Experiments0
Processing Informal, Romanized Pakistani Text Messages0
Proper Name Diacritization for Arabic Wikipedia: A Benchmark Dataset0
Proper Name Machine Translation from Japanese to Japanese Sign Language0
Putting Figures on Influences on Moroccan Darija from Arabic, French and Spanish using the WordNet0
QCRI-MES Submission at WMT13: Using Transliteration Mining to Improve Statistical Machine Translation0
Query Translation for Cross-Language Information Retrieval using Multilingual Word Clusters0
Quillpad Multilingual Predictive Transliteration System0
QuranTree.jl: A Julia Package for Quranic Arabic Corpus0
Recovering Missing Characters in Old Hawaiian Writing0
Regularity and Flexibility in English-Chinese Name Transliteration0
Regularized Interlingual Projections: Evaluation on Multilingual Transliteration0
Regulating Orthography-Phonology Relationship for English to Thai Transliteration0
Report of NEWS 2012 Machine Transliteration Shared Task0
Report of NEWS 2015 Machine Transliteration Shared Task0
Report of NEWS 2016 Machine Transliteration Shared Task0
Report of NEWS 2018 Named Entity Transliteration Shared Task0
Rescoring a Phrase-based Machine Transliteration System with Recurrent Neural Network Language Models0
Rethinking Hate Speech Detection on Social Media: Can LLMs Replace Traditional Models?0
Review of Computational Epigraphy0
Robust Dictionary Lookup in Multiple Noisy Orthographies0
Robust Transliteration Mining from Comparable Corpora with Bilingual Topic Models0
Romanization-based Approach to Morphological Analysis in Korean SMS Text Processing0
Romanization-based Large-scale Adaptation of Multilingual Language Models0
Romanized Arabic Transliteration0
Romanized Berber and Romanized Arabic Automatic Language Identification Using Machine Learning0
Rule based Approach for Word Normalization by resolving Transcription Ambiguity in Transliterated Search Queries0
Rule Based Transliteration Scheme for English to Punjabi0
Russian Stress Prediction using Maximum Entropy Ranking0
Samsung R&D Institute Poland submission to WAT 2021 Indic Language Multilingual Task0
Sangam: A Perso-Arabic to Indic Script Machine Transliteration Model0
Scalable Large-Margin Structured Learning: Theory and Algorithms0
Semi-supervised Chinese Word Segmentation based on Bilingual Information0
SentiALG: Automated Corpus Annotation for Algerian Sentiment Analysis0
Sequence to Sequence Networks for Roman-Urdu to Urdu Transliteration0
Show:102550
← PrevPage 6 of 9Next →

No leaderboard results yet.