SOTAVerified

Transliteration

Transliteration is a mechanism for converting a word in a source (foreign) language to a target language, and often adopts approaches from machine translation. In machine translation, the objective is to preserve the semantic meaning of the utterance as much as possible while following the syntactic structure in the target language. In Transliteration, the objective is to preserve the original pronunciation of the source word as much as possible while following the phonological structures of the target language.

For example, the city’s name “Manchester” has become well known by people of languages other than English. These new words are often named entities that are important in cross-lingual information retrieval, information extraction, machine translation, and often present out-of-vocabulary challenges to spoken language technologies such as automatic speech recognition, spoken keyword search, and text-to-speech.

Source: Phonology-Augmented Statistical Framework for Machine Transliteration using Limited Linguistic Resources

Papers

Showing 5175 of 435 papers

TitleStatusHype
Creating a Translation Matrix of the Bible's Names Across 591 LanguagesCode0
Exploiting Language Relatedness for Low Web-Resource Language Model Adaptation: An Indic Languages StudyCode0
On Biasing Transformer Attention Towards MonotonicityCode0
Towards Offensive Language Identification for Tamil Code-Mixed YouTube Comments and PostsCode0
A Unified Model for Arabizi Detection and Transliteration using Sequence-to-Sequence Models0
A two-stage transliteration approach to improve performance of a multilingual ASR0
Analyzing Urdu Social Media for Sentiments using Transfer Learning with Controlled Translations0
Agreement on Target-bidirectional Neural Machine Translation0
A Tightly-coupled Unsupervised Clustering and Bilingual Alignment Model for Transliteration0
A Unified Approach to Transliteration-based Text Input with Online Spelling Correction0
A systematic comparison of methods for low-resource dependency parsing on genuinely low-resource languages0
A Universal Dependencies Treebank for Marathi0
Analyzing English-Spanish Named-Entity enhanced Machine Translation0
A Statistical Model for Unsupervised and Semi-supervised Transliteration Mining0
Assamese WordNet based Quality Enhancement of Bilingual Machine Translation System0
A Myanmar (Burmese)-English Named Entity Transliteration Dictionary0
A Framework for the Classification and Annotation of Multiword Expressions in Dialectal Arabic0
A Comparison of Entity Matching Methods between English and Japanese Katakana0
Assamese-English Bilingual Machine Translation0
ASMA: A System for Automatic Segmentation and Morpho-Syntactic Disambiguation of Modern Standard Arabic0
A Multi-Orthography Parallel Corpus of Yiddish Nouns0
A Simple but Effective Approach to Improve Arabizi-to-English Statistical Machine Translation0
A Multilinear Approach to the Unsupervised Learning of Morphology0
A Digital Swedish-Yiddish/Yiddish-Swedish Dictionary: A Web-Based Dictionary that is also Available Offline0
ARGUABLY at ComMA@ICON: Detection of Multilingual Aggressive, Gender Biased, and Communally Charged Tweets Using Ensemble and Fine-Tuned IndicBERT0
Show:102550
← PrevPage 3 of 18Next →

No leaderboard results yet.