SOTAVerified

Transliteration

Transliteration is a mechanism for converting a word in a source (foreign) language to a target language, and often adopts approaches from machine translation. In machine translation, the objective is to preserve the semantic meaning of the utterance as much as possible while following the syntactic structure in the target language. In Transliteration, the objective is to preserve the original pronunciation of the source word as much as possible while following the phonological structures of the target language.

For example, the city’s name “Manchester” has become well known by people of languages other than English. These new words are often named entities that are important in cross-lingual information retrieval, information extraction, machine translation, and often present out-of-vocabulary challenges to spoken language technologies such as automatic speech recognition, spoken keyword search, and text-to-speech.

Source: Phonology-Augmented Statistical Framework for Machine Transliteration using Limited Linguistic Resources

Papers

Showing 151200 of 435 papers

TitleStatusHype
Entity Clustering Across Languages0
EPIK: Eliminating multi-model Pipelines with Knowledge-distillation0
English-to-Chinese Transliteration with Phonetic Back-transliteration0
BenLLMEval: A Comprehensive Evaluation into the Potentials and Pitfalls of Large Language Models on Bengali NLP0
An unsupervised EM method to infer time variation in sense probabilities0
Exploiting Transliterated Words for Finding Similarity in Inter-Language News Articles using Machine Learning0
Exploring Linguistic Similarity and Zero-Shot Learning for Multilingual Translation of Dravidian Languages0
Exploring the Role of Transliteration in In-Context Learning for Low-resource Languages Written in Non-Latin Scripts0
English-to-Chinese Transliteration with Phonetic Auxiliary Task0
Factored Machine Translation Systems for Russian-English0
False-Friend Detection and Entity Matching via Unsupervised Transliteration0
Finite State Approach to the Kazakh Nominal Paradigm0
Finite-state script normalization and processing utilities: The Nisaba Brahmic library0
Foreign Words and the Automatic Processing of Arabic Social Media Text Written in Roman Script0
English to Bengali Multimodal Neural Machine Translation using Transliteration-based Phrase Pairs Augmentation0
Fourteen Light Tasks for comparing Analogical and Phrase-based Machine Translation0
Benchmarking Evaluation Metrics for Code-Switching Automatic Speech Recognition0
Further Developments in Treebank Error Detection Using Derivation Trees0
English-Korean Named Entity Transliteration Using Substring Alignment and Re-ranking Methods0
Gender Prediction in English-Hindi Code-Mixed Social Media Content : Corpus and Baseline System0
End-to-End Natural Language Understanding Pipeline for Bangla Conversational Agents0
Bangla Phonetic Input Method with Foreign Words Handling0
Gui at MixMT 2022 : English-Hinglish: An MT approach for translation of code mixed data0
HCCL at SemEval-2017 Task 2: Combining Multilingual Word Embeddings and Transliteration Model for Semantic Similarity0
Hindi and Marathi to English NE Transliteration Tool using Phonology and Stress Analysis0
HindiWSD: A package for word sense disambiguation in Hinglish & Hindi0
HinMA: Distributed Morphology based Hindi Morphological Analyzer0
HintedBT: Augmenting Back-Translation with Quality and Transliteration Hints0
An Omni-Font Gurmukhi to Shahmukhi Transliteration System0
Hopeful_Men@LT-EDI-EACL2021: Hope Speech Detection Using Indic Transliteration and Transformers0
Hopeful Men@LT-EDI-EACL2021: Hope Speech Detection Using Indic Transliteration and Transformers0
How Are Spelling Errors Generated and Corrected? A Study of Corrected and Uncorrected Spelling Errors Using Keystroke Logs0
A Hybrid Word Alignment Model for Phrase-Based Statistical Machine Translation0
A Conventional Orthography for Tunisian Arabic0
How to Speak a Language without Knowing It0
Egyptian Arabic to English Statistical Machine Translation System for NIST OpenMT'20150
Hybrid approach for transliteration of Algerian arabizi: a primary study0
Hybrid Approach to English-Hindi Name Entity Transliteration0
Hybrid Statistical Machine Translation for English-Myanmar: UTYCC Submission to WAT-20210
``I am borrowing ya mixing ?'' An Analysis of English-Hindi Code Mixing in Facebook0
Identifying Aggression and Toxicity in Comments using Capsule Network0
AyutthayaAlpha: A Thai-Latin Script Transliteration Transformer0
Improving Arabic Diacritization through Syntactic Analysis0
Improving Document Ranking using Query Expansion and Classification Techniques for Mixed Script Information Retrieval0
Improving Performance Of English-Hindi Cross Language Information Retrieval Using Transliteration Of Query Terms0
Improving Statistical Machine Translation with a Multilingual Paraphrase Database0
Improving Statistical Machine Translation for a Resource-Poor Language Using Related Resource-Rich Languages0
Improving the neural network-based machine transliteration for low-resourced language pair0
Efficient Neural Machine Translation for Low-Resource Languages via Exploiting Related Languages0
Effective Architectures for Low Resource Multilingual Named Entity Transliteration0
Show:102550
← PrevPage 4 of 9Next →

No leaderboard results yet.