SOTAVerified

Transliteration

Transliteration is a mechanism for converting a word in a source (foreign) language to a target language, and often adopts approaches from machine translation. In machine translation, the objective is to preserve the semantic meaning of the utterance as much as possible while following the syntactic structure in the target language. In Transliteration, the objective is to preserve the original pronunciation of the source word as much as possible while following the phonological structures of the target language.

For example, the city’s name “Manchester” has become well known by people of languages other than English. These new words are often named entities that are important in cross-lingual information retrieval, information extraction, machine translation, and often present out-of-vocabulary challenges to spoken language technologies such as automatic speech recognition, spoken keyword search, and text-to-speech.

Source: Phonology-Augmented Statistical Framework for Machine Transliteration using Limited Linguistic Resources

Papers

Showing 151175 of 435 papers

TitleStatusHype
Entity Clustering Across Languages0
EPIK: Eliminating multi-model Pipelines with Knowledge-distillation0
Bidirectional Bengali Script and Meetei Mayek Transliteration of Web Based Manipuri News Corpus0
ANVITA Machine Translation System for WAT 2021 MultiIndicMT Shared Task0
Exploiting Parallel Corpus for Handling Out-of-Vocabulary Words0
Exploiting Transliterated Words for Finding Similarity in Inter-Language News Articles using Machine Learning0
Exploring Linguistic Similarity and Zero-Shot Learning for Multilingual Translation of Dravidian Languages0
Exploring the Role of Transliteration in In-Context Learning for Low-resource Languages Written in Non-Latin Scripts0
Automatic Correction of Arabic Text: a Cascaded Approach0
Factored Machine Translation Systems for Russian-English0
False-Friend Detection and Entity Matching via Unsupervised Transliteration0
Finite State Approach to the Kazakh Nominal Paradigm0
Finite-state script normalization and processing utilities: The Nisaba Brahmic library0
Foreign Words and the Automatic Processing of Arabic Social Media Text Written in Roman Script0
Forward Transliteration of Dzongkha Text to Braille0
Fourteen Light Tasks for comparing Analogical and Phrase-based Machine Translation0
Digraph of Senegal s local languages: issues, challenges and prospects of their transliteration0
Further Developments in Treebank Error Detection Using Derivation Trees0
G2P Conversion of Proper Names Using Word Origin Information0
Gender Prediction in English-Hindi Code-Mixed Social Media Content : Corpus and Baseline System0
Graphonological Levenshtein Edit Distance: Application for Automated Cognate Identification0
Digraphie des langues ouest africaines : Latin2Ajami : un algorithme de translitteration automatique0
Gui at MixMT 2022 : English-Hinglish: An MT approach for translation of code mixed data0
HCCL at SemEval-2017 Task 2: Combining Multilingual Word Embeddings and Transliteration Model for Semantic Similarity0
A House United: Bridging the Script and Lexical Barrier between Hindi and Urdu0
Show:102550
← PrevPage 7 of 18Next →

No leaderboard results yet.