SOTAVerified

Transliteration

Transliteration is a mechanism for converting a word in a source (foreign) language to a target language, and often adopts approaches from machine translation. In machine translation, the objective is to preserve the semantic meaning of the utterance as much as possible while following the syntactic structure in the target language. In Transliteration, the objective is to preserve the original pronunciation of the source word as much as possible while following the phonological structures of the target language.

For example, the city’s name “Manchester” has become well known by people of languages other than English. These new words are often named entities that are important in cross-lingual information retrieval, information extraction, machine translation, and often present out-of-vocabulary challenges to spoken language technologies such as automatic speech recognition, spoken keyword search, and text-to-speech.

Source: Phonology-Augmented Statistical Framework for Machine Transliteration using Limited Linguistic Resources

Papers

Showing 151200 of 435 papers

TitleStatusHype
Entity Clustering Across Languages0
EPIK: Eliminating multi-model Pipelines with Knowledge-distillation0
Bidirectional Bengali Script and Meetei Mayek Transliteration of Web Based Manipuri News Corpus0
ANVITA Machine Translation System for WAT 2021 MultiIndicMT Shared Task0
Exploiting Parallel Corpus for Handling Out-of-Vocabulary Words0
Exploiting Transliterated Words for Finding Similarity in Inter-Language News Articles using Machine Learning0
Exploring Linguistic Similarity and Zero-Shot Learning for Multilingual Translation of Dravidian Languages0
Exploring the Role of Transliteration in In-Context Learning for Low-resource Languages Written in Non-Latin Scripts0
Automatic Correction of Arabic Text: a Cascaded Approach0
Factored Machine Translation Systems for Russian-English0
False-Friend Detection and Entity Matching via Unsupervised Transliteration0
Finite State Approach to the Kazakh Nominal Paradigm0
Finite-state script normalization and processing utilities: The Nisaba Brahmic library0
Foreign Words and the Automatic Processing of Arabic Social Media Text Written in Roman Script0
Forward Transliteration of Dzongkha Text to Braille0
Fourteen Light Tasks for comparing Analogical and Phrase-based Machine Translation0
Digraph of Senegal s local languages: issues, challenges and prospects of their transliteration0
Further Developments in Treebank Error Detection Using Derivation Trees0
G2P Conversion of Proper Names Using Word Origin Information0
Gender Prediction in English-Hindi Code-Mixed Social Media Content : Corpus and Baseline System0
Graphonological Levenshtein Edit Distance: Application for Automated Cognate Identification0
Digraphie des langues ouest africaines : Latin2Ajami : un algorithme de translitteration automatique0
Gui at MixMT 2022 : English-Hinglish: An MT approach for translation of code mixed data0
HCCL at SemEval-2017 Task 2: Combining Multilingual Word Embeddings and Transliteration Model for Semantic Similarity0
Hindi and Marathi to English NE Transliteration Tool using Phonology and Stress Analysis0
HindiWSD: A package for word sense disambiguation in Hinglish & Hindi0
HinMA: Distributed Morphology based Hindi Morphological Analyzer0
HintedBT: Augmenting Back-Translation with Quality and Transliteration Hints0
Homonym normalisation by word sense clustering: a case in Japanese0
Hopeful_Men@LT-EDI-EACL2021: Hope Speech Detection Using Indic Transliteration and Transformers0
Hopeful Men@LT-EDI-EACL2021: Hope Speech Detection Using Indic Transliteration and Transformers0
How Are Spelling Errors Generated and Corrected? A Study of Corrected and Uncorrected Spelling Errors Using Keystroke Logs0
A House United: Bridging the Script and Lexical Barrier between Hindi and Urdu0
How do you spell that? A journey through word representations0
How to Speak a Language without Knowing It0
Character-Level Bangla Text-to-IPA Transcription Using Transformer Architecture with Sequence Alignment0
Hybrid approach for transliteration of Algerian arabizi: a primary study0
Hybrid Approach to English-Hindi Name Entity Transliteration0
Hybrid Statistical Machine Translation for English-Myanmar: UTYCC Submission to WAT-20210
``I am borrowing ya mixing ?'' An Analysis of English-Hindi Code Mixing in Facebook0
Identifying Aggression and Toxicity in Comments using Capsule Network0
CLeLfPC: a Large Open Multi-Speaker Corpus of French Cued Speech0
Improving Arabic Diacritization through Syntactic Analysis0
Improving Document Ranking using Query Expansion and Classification Techniques for Mixed Script Information Retrieval0
Improving Performance Of English-Hindi Cross Language Information Retrieval Using Transliteration Of Query Terms0
Improving Statistical Machine Translation with a Multilingual Paraphrase Database0
Improving Statistical Machine Translation for a Resource-Poor Language Using Related Resource-Rich Languages0
Improving the neural network-based machine transliteration for low-resourced language pair0
Improving the quality of Gujarati-Hindi Machine Translation through part-of-speech tagging and stemmer-assisted transliteration0
Dictionary Look-up with Katakana Variant Recognition0
Show:102550
← PrevPage 4 of 9Next →

No leaderboard results yet.