SOTAVerified

Transliteration

Transliteration is a mechanism for converting a word in a source (foreign) language to a target language, and often adopts approaches from machine translation. In machine translation, the objective is to preserve the semantic meaning of the utterance as much as possible while following the syntactic structure in the target language. In Transliteration, the objective is to preserve the original pronunciation of the source word as much as possible while following the phonological structures of the target language.

For example, the city’s name “Manchester” has become well known by people of languages other than English. These new words are often named entities that are important in cross-lingual information retrieval, information extraction, machine translation, and often present out-of-vocabulary challenges to spoken language technologies such as automatic speech recognition, spoken keyword search, and text-to-speech.

Source: Phonology-Augmented Statistical Framework for Machine Transliteration using Limited Linguistic Resources

Papers

Showing 151200 of 435 papers

TitleStatusHype
Transliteration of Judeo-Arabic Texts into Arabic Script Using Recurrent Neural Networks0
Language-agnostic Multilingual Modeling0
Machine Translation Pre-training for Data-to-Text Generation -- A Case Study in Czech0
Urdu-English Machine Transliteration using Neural Networks0
A Multi-cascaded Deep Model for Bilingual SMS ClassificationCode0
Sideways Transliteration: How to Transliterate Multicultural Person Names?0
Code-Mixed to Monolingual Translation Framework0
Rule based Approach for Word Normalization by resolving Transcription Ambiguity in Transliterated Search Queries0
Algorithms for certain classes of Tamil Spelling correction0
A systematic comparison of methods for low-resource dependency parsing on genuinely low-resource languages0
Latin script keyboards for South Asian languages with finite-state normalization0
ANETAC: Arabic Named Entity Transliteration and Classification Dataset0
Joint Approach to Deromanization of Code-mixed Texts0
Event detection in Twitter: A keyword volume approachCode0
Improving the neural network-based machine transliteration for low-resourced language pair0
A Rule-based Kurdish Text Transliteration SystemCode0
Normalization of Transliterated Words in Code-Mixed Data Using Seq2Seq Model \& Levenshtein Distance0
Composing RNNs and FSTs for Small Data: Recovering Missing Characters in Old Hawaiian Text0
Phonology-Augmented Statistical Framework for Machine Transliteration using Limited Linguistic Resources0
Tilde's Parallel Corpus Filtering Methods for WMT 20180
A Comparison of Entity Matching Methods between English and Japanese Katakana0
Arabizi sentiment analysis based on transliteration and automatic corpus annotation0
The SLT-Interactions Parsing System at the CoNLL 2018 Shared Task0
Part-of-Speech Tagging for Code-Switched, Transliterated Texts without Explicit Language Identification0
Recovering Missing Characters in Old Hawaiian Writing0
Efficient Sequence Labeling with Actor-Critic TrainingCode0
Bootstrapping Transliteration with Constrained Discovery for Low-Resource LanguagesCode0
SentiALG: Automated Corpus Annotation for Algerian Sentiment Analysis0
Hybrid approach for transliteration of Algerian arabizi: a primary study0
Design Challenges in Named Entity TransliterationCode0
Indigenous language technologies in Canada: Assessment, challenges, and successes0
Identifying Aggression and Toxicity in Comments using Capsule Network0
Transliteration Better than Translation? Answering Code-mixed Questions over a Knowledge Base0
Low-Resource Machine Transliteration Using Recurrent Neural Networks of Asian Languages0
NEWS 2018 Whitepaper0
Comparison of Assorted Models for Transliteration0
A Deep Learning Based Approach to Transliteration0
Report of NEWS 2018 Named Entity Transliteration Shared Task0
Statistical Machine Transliteration Baselines for NEWS 20180
Simple Features for Strong Performance on Named Entity Recognition in Code-Switched Twitter Data0
Neural Machine Translation Techniques for Named Entity TransliterationCode0
Addressing Noise in Multidialectal Word Embeddings0
Gender Prediction in English-Hindi Code-Mixed Social Media Content : Corpus and Baseline System0
Meaningless yet meaningful: Morphology grounded subword-level NMT0
Normalization of Transliterated Words in Code-Mixed Data Using Seq2Seq Model & Levenshtein Distance0
Approche Hybride pour la translit\'eration de l'Arabizi Alg\'erien : une \'etude pr\'eliminaire (A hybrid approach for the transliteration of Algerian Arabizi: A primary study)0
Unified Guidelines and Resources for Arabic Dialect Orthography0
Creating Large-Scale Multilingual Cognate TablesCode0
A Bird's-eye View of Language Processing Projects at the Romanian Academy0
Portable Spelling Corrector for a Less-Resourced Language: Amharic0
Show:102550
← PrevPage 4 of 9Next →

No leaderboard results yet.