SOTAVerified

Transliteration

Transliteration is a mechanism for converting a word in a source (foreign) language to a target language, and often adopts approaches from machine translation. In machine translation, the objective is to preserve the semantic meaning of the utterance as much as possible while following the syntactic structure in the target language. In Transliteration, the objective is to preserve the original pronunciation of the source word as much as possible while following the phonological structures of the target language.

For example, the city’s name “Manchester” has become well known by people of languages other than English. These new words are often named entities that are important in cross-lingual information retrieval, information extraction, machine translation, and often present out-of-vocabulary challenges to spoken language technologies such as automatic speech recognition, spoken keyword search, and text-to-speech.

Source: Phonology-Augmented Statistical Framework for Machine Transliteration using Limited Linguistic Resources

Papers

Showing 51100 of 435 papers

TitleStatusHype
A Multilinear Approach to the Unsupervised Learning of Morphology0
A Simple but Effective Approach to Improve Arabizi-to-English Statistical Machine Translation0
ASMA: A System for Automatic Segmentation and Morpho-Syntactic Disambiguation of Modern Standard Arabic0
Assamese-English Bilingual Machine Translation0
Assamese WordNet based Quality Enhancement of Bilingual Machine Translation System0
A Statistical Model for Unsupervised and Semi-supervised Transliteration Mining0
A systematic comparison of methods for low-resource dependency parsing on genuinely low-resource languages0
A Tightly-coupled Unsupervised Clustering and Bilingual Alignment Model for Transliteration0
A two-stage transliteration approach to improve performance of a multilingual ASR0
A Unified Approach to Transliteration-based Text Input with Online Spelling Correction0
A Unified Model for Arabizi Detection and Transliteration using Sequence-to-Sequence Models0
A Universal Dependencies Treebank for Marathi0
A Classical Chinese Corpus with Nested Part-of-Speech Tags0
Combining Word-Level and Character-Level Models for Machine Translation Between Closely-Related Languages0
Composing RNNs and FSTs for Small Data: Recovering Missing Characters in Old Hawaiian Text0
AIDA2: A Hybrid Approach for Token and Sentence Level Dialect Identification in Arabic0
ANVITA Machine Translation System for WAT 2021 MultiIndicMT Shared Task0
A Conventional Orthography for Tunisian Arabic0
BenLLMEval: A Comprehensive Evaluation into the Potentials and Pitfalls of Large Language Models on Bengali NLP0
An unsupervised EM method to infer time variation in sense probabilities0
Benchmarking Evaluation Metrics for Code-Switching Automatic Speech Recognition0
Bangla Phonetic Input Method with Foreign Words Handling0
An Omni-Font Gurmukhi to Shahmukhi Transliteration System0
A Hybrid Word Alignment Model for Phrase-Based Statistical Machine Translation0
Beyond Specialization: Benchmarking LLMs for Transliteration of Indian Languages0
Bidirectional Bengali Script and Meetei Mayek Transliteration of Web Based Manipuri News Corpus0
Applying mpaligner to Machine Transliteration with Japanese-Specific Heuristics0
Bilingual Dictionary Construction with Transliteration Filtering0
Accurate Word Segmentation using Transliteration and Language Model Projection0
Bootstrapping Entity Translation on Weakly Comparable Corpora0
Bootstrapping Method for Chunk Alignment in Phrase Based SMT0
Applying Sanskrit Concepts for Reordering in MT0
AyutthayaAlpha: A Thai-Latin Script Transliteration Transformer0
Brahmi-Net: A transliteration and script conversion system for languages of the Indian subcontinent0
Bridging the Gap: An Intermediate Language for Enhanced and Cost-Effective Grapheme-to-Phoneme Conversion with Homographs with Multiple Pronunciations Disambiguation0
Building an Arabic Multiword Expressions Repository0
CamelParser: A system for Arabic Syntactic Analysis and Morphological Disambiguation0
Can Multilingual Language Models Transfer to an Unseen Dialect? A Case Study on North African Arabizi0
Arabic Diacritization with Recurrent Neural Networks0
Ceasing hate withMoH: Hate Speech Detection in Hindi-English Code-Switched Language0
Automatic word stress annotation of Russian unrestricted text0
Character-Level Bangla Text-to-IPA Transcription Using Transformer Architecture with Sequence Alignment0
Charles Translator: A Machine Translation System between Ukrainian and Czech0
Classifying Arab Names Geographically0
CLeLfPC: a Large Open Multi-Speaker Corpus of French Cued Speech0
Code-Mixed Text to Speech Synthesis under Low-Resource Constraints0
Code-Mixed to Monolingual Translation Framework0
Combining, Adapting and Reusing Bi-texts between Related Languages: Application to Statistical Machine Translation (invited talk)0
Combining Word Embeddings with Bilingual Orthography Embeddings for Bilingual Dictionary Induction0
A Neural Network Transliteration Model in Low Resource Settings0
Show:102550
← PrevPage 2 of 9Next →

No leaderboard results yet.