SOTAVerified

Transliteration

Transliteration is a mechanism for converting a word in a source (foreign) language to a target language, and often adopts approaches from machine translation. In machine translation, the objective is to preserve the semantic meaning of the utterance as much as possible while following the syntactic structure in the target language. In Transliteration, the objective is to preserve the original pronunciation of the source word as much as possible while following the phonological structures of the target language.

For example, the city’s name “Manchester” has become well known by people of languages other than English. These new words are often named entities that are important in cross-lingual information retrieval, information extraction, machine translation, and often present out-of-vocabulary challenges to spoken language technologies such as automatic speech recognition, spoken keyword search, and text-to-speech.

Source: Phonology-Augmented Statistical Framework for Machine Transliteration using Limited Linguistic Resources

Papers

Showing 376400 of 435 papers

TitleStatusHype
Word-level Language Identification in Bi-lingual Code-switched Texts0
Words.hk: A Comprehensive Cantonese Dictionary Dataset with Definitions, Translations and Transliterated Examples0
YAMAMA: Yet Another Multi-Dialect Arabic Morphological Analyzer0
Yandex School of Data Analysis Machine Translation Systems for WMT130
Yandex School of Data Analysis Russian-English Machine Translation System for WMT140
Algorithms for certain classes of Tamil Spelling correction0
``ye word kis lang ka hai bhai?'' Testing the Limits of Word level Language Identification0
3arif: A Corpus of Modern Standard and Egyptian Arabic Tweets Annotated for Epistemic Modality Using Interactive Crowdsourcing0
A Bird's-eye View of Language Processing Projects at the Romanian Academy0
Accurate Word Segmentation using Transliteration and Language Model Projection0
A Classical Chinese Corpus with Nested Part-of-Speech Tags0
A Comparative Study of Extremely Low-Resource Transliteration of the World's Languages0
A Comparison of Entity Matching Methods between English and Japanese Katakana0
A complete character recognition and transliteration technique for Devanagari script0
A Conventional Orthography for Tunisian Arabic0
A Correlational Encoder Decoder Architecture for Pivot Based Sequence Generation0
Addressing Noise in Multidialectal Word Embeddings0
A Deep Learning Based Approach to Transliteration0
A Digital Swedish-Yiddish/Yiddish-Swedish Dictionary: A Web-Based Dictionary that is also Available Offline0
A Framework for the Classification and Annotation of Multiword Expressions in Dialectal Arabic0
Agreement on Target-bidirectional Neural Machine Translation0
A House United: Bridging the Script and Lexical Barrier between Hindi and Urdu0
A Hybrid Transliteration Model for Chinese/English Named Entities ---BJTU-NLP Report for the 5th Named Entities Workshop0
A Hybrid Word Alignment Model for Phrase-Based Statistical Machine Translation0
AIDA2: A Hybrid Approach for Token and Sentence Level Dialect Identification in Arabic0
Show:102550
← PrevPage 16 of 18Next →

No leaderboard results yet.