SOTAVerified

Transliteration

Transliteration is a mechanism for converting a word in a source (foreign) language to a target language, and often adopts approaches from machine translation. In machine translation, the objective is to preserve the semantic meaning of the utterance as much as possible while following the syntactic structure in the target language. In Transliteration, the objective is to preserve the original pronunciation of the source word as much as possible while following the phonological structures of the target language.

For example, the city’s name “Manchester” has become well known by people of languages other than English. These new words are often named entities that are important in cross-lingual information retrieval, information extraction, machine translation, and often present out-of-vocabulary challenges to spoken language technologies such as automatic speech recognition, spoken keyword search, and text-to-speech.

Source: Phonology-Augmented Statistical Framework for Machine Transliteration using Limited Linguistic Resources

Papers

Showing 101150 of 435 papers

TitleStatusHype
Comparison of Assorted Models for Transliteration0
Composing RNNs and FSTs for Small Data: Recovering Missing Characters in Old Hawaiian Text0
Arabic Diacritization: Stats, Rules, and Hacks0
Confusion Network for Arabic Name Disambiguation and Transliteration in Statistical Machine Translation0
Connecting the Persian-speaking World through Transliteration0
Constraint-Based Models of Lexical Borrowing0
Context based Roman-Urdu to Urdu Script Transliteration System0
A Layered Language Model based Hybrid Approach to Automatic Full Diacritization of Arabic0
Conversion between Scripts of Punjabi: Beyond Simple Transliteration0
Cost-benefit Analysis of Two-Stage Conditional Random Fields based English-to-Chinese Machine Transliteration0
Cost-Performance Optimization for Processing Low-Resource Language Tasks Using Commercial LLMs0
Addressing Noise in Multidialectal Word Embeddings0
Building an Arabic Multiword Expressions Repository0
Cross-lingual Named Entity List Search via Transliteration0
Bridging the Gap: An Intermediate Language for Enhanced and Cost-Effective Grapheme-to-Phoneme Conversion with Homographs with Multiple Pronunciations Disambiguation0
Cross-Lingual Transfer from Related Languages: Treating Low-Resource Maltese as Multilingual Code-Switching0
Data Cleaning for XML Electronic Dictionaries via Statistical Anomaly Detection0
Data representation methods and use of mined corpora for Indian language transliteration0
DCU Terminology Translation System for Medical Query Subtask at WMT140
DE-ABUSE@TamilNLP-ACL 2022: Transliteration as Data Augmentation for Abuse Detection in Tamil0
Dealing with unknown words in statistical machine translation0
Decoding Anagrammed Texts Written in an Unknown Language and Script0
Brahmi-Net: A transliteration and script conversion system for languages of the Indian subcontinent0
DEEP: DEnoising Entity Pre-training for Neural Machine Translation0
Approche Hybride pour la translit\'eration de l'Arabizi Alg\'erien : une \'etude pr\'eliminaire (A hybrid approach for the transliteration of Algerian Arabizi: A primary study)0
Bootstrapping Method for Chunk Alignment in Phrase Based SMT0
Applying Sanskrit Concepts for Reordering in MT0
Bootstrapping Entity Translation on Weakly Comparable Corpora0
Boosting English-Chinese Machine Transliteration via High Quality Alignment and Multilingual Resources0
Applying Neural Networks to English-Chinese Named Entity Transliteration0
A Correlational Encoder Decoder Architecture for Pivot Based Sequence Generation0
A Classical Chinese Corpus with Nested Part-of-Speech Tags0
Bilingual Dictionary Construction with Transliteration Filtering0
Applying mpaligner to Machine Transliteration with Japanese-Specific Heuristics0
Bidirectional Bengali Script and Meetei Mayek Transliteration of Web Based Manipuri News Corpus0
Beyond Specialization: Benchmarking LLMs for Transliteration of Indian Languages0
ANVITA Machine Translation System for WAT 2021 MultiIndicMT Shared Task0
AIDA2: A Hybrid Approach for Token and Sentence Level Dialect Identification in Arabic0
English to Punjabi Transliteration using Orthographic and Phonetic Information0
BenLLMEval: A Comprehensive Evaluation into the Potentials and Pitfalls of Large Language Models on Bengali NLP0
An unsupervised EM method to infer time variation in sense probabilities0
English-to-Chinese Transliteration with Phonetic Auxiliary Task0
English to Bengali Multimodal Neural Machine Translation using Transliteration-based Phrase Pairs Augmentation0
Benchmarking Evaluation Metrics for Code-Switching Automatic Speech Recognition0
English-Korean Named Entity Transliteration Using Substring Alignment and Re-ranking Methods0
End-to-End Natural Language Understanding Pipeline for Bangla Conversational Agents0
English-to-Chinese Transliteration with Phonetic Back-transliteration0
English to Indonesian Transliteration to Support English Pronunciation Practice0
Bangla Phonetic Input Method with Foreign Words Handling0
An Omni-Font Gurmukhi to Shahmukhi Transliteration System0
Show:102550
← PrevPage 3 of 9Next →

No leaderboard results yet.