SOTAVerified

Transliteration

Transliteration is a mechanism for converting a word in a source (foreign) language to a target language, and often adopts approaches from machine translation. In machine translation, the objective is to preserve the semantic meaning of the utterance as much as possible while following the syntactic structure in the target language. In Transliteration, the objective is to preserve the original pronunciation of the source word as much as possible while following the phonological structures of the target language.

For example, the city’s name “Manchester” has become well known by people of languages other than English. These new words are often named entities that are important in cross-lingual information retrieval, information extraction, machine translation, and often present out-of-vocabulary challenges to spoken language technologies such as automatic speech recognition, spoken keyword search, and text-to-speech.

Source: Phonology-Augmented Statistical Framework for Machine Transliteration using Limited Linguistic Resources

Papers

Showing 101150 of 435 papers

TitleStatusHype
Orthographic Transliteration for Kabyle Speech RecognitionCode0
Transliterating Kurdish texts in Latin into Persian-Arabic script0
Ceasing hate withMoH: Hate Speech Detection in Hindi-English Code-Switched Language0
Transliteration of Foreign Words in Burmese0
Context based Roman-Urdu to Urdu Script Transliteration System0
Transliteration: A Simple Technique For Improving Multilingual Language Modeling0
Role of Language Relatedness in Multilingual Fine-tuning of Language Models: A Case Study in Indo-Aryan LanguagesCode0
HintedBT: Augmenting Back-Translation with Quality and Transliteration Hints0
Web-sentiment analysis of public comments (public reviews) for languages with limited resources such as the Kazakh language0
Cross-Lingual Text Classification of Transliterated Hindi and MalayalamCode0
Towards Offensive Language Identification for Tamil Code-Mixed YouTube Comments and PostsCode0
Hybrid Statistical Machine Translation for English-Myanmar: UTYCC Submission to WAT-20210
Language Relatedness and Lexical Closeness can help Improve Multilingual NMT: IITBombay@MultiIndicNMT WAT20210
ANVITA Machine Translation System for WAT 2021 MultiIndicMT Shared Task0
Samsung R&D Institute Poland submission to WAT 2021 Indic Language Multilingual Task0
End-to-End Natural Language Understanding Pipeline for Bangla Conversational Agents0
Specializing Multilingual Language Models: An Empirical StudyCode0
Balanced End-to-End Monolingual pre-training for Low-Resourced Indic Languages Code-Switching Speech Recognition0
Exploiting Language Relatedness for Low Web-Resource Language Model Adaptation: An Indic Languages StudyCode0
Normalization and Back-Transliteration for Code-Switched Data0
Transliteration for Low-Resource Code-Switching Texts: Building an Automatic Cyrillic-to-Latin Converter for Tatar0
Neural String Edit DistanceCode0
On Biasing Transformer Attention Towards MonotonicityCode0
Finite-state script normalization and processing utilities: The Nisaba Brahmic library0
Hopeful Men@LT-EDI-EACL2021: Hope Speech Detection Using Indic Transliteration and Transformers0
OFFLangOne@DravidianLangTech-EACL2021: Transformers with the Class Balanced Loss for Offensive Language Identification in Dravidian Code-Mixed text.0
A Large-scale Evaluation of Neural Machine Transliteration for Indic LanguagesCode0
Towards Offensive Language Identification for Dravidian LanguagesCode0
QuranTree.jl: A Julia Package for Quranic Arabic Corpus0
Hopeful_Men@LT-EDI-EACL2021: Hope Speech Detection Using Indic Transliteration and Transformers0
Uzbek Cyrillic-Latin-Cyrillic Machine Transliteration0
A Unified Model for Arabizi Detection and Transliteration using Sequence-to-Sequence Models0
Combining Word Embeddings with Bilingual Orthography Embeddings for Bilingual Dictionary Induction0
Homonym normalisation by word sense clustering: a case in Japanese0
Language Identification and Normalization of Code Mixed English and Punjabi Text0
English-to-Chinese Transliteration with Phonetic Auxiliary Task0
Effective Architectures for Low Resource Multilingual Named Entity Transliteration0
Machine Translation Pre-training for Data-to-Text Generation - A Case Study in Czech0
Leveraging Alignment and Phonology for low-resource Indic to English Neural Machine Transliteration0
A complete character recognition and transliteration technique for Devanagari script0
Transliteration for Cross-Lingual Morphological Inflection0
Efficient Neural Machine Translation for Low-Resource Languages via Exploiting Related Languages0
Digraph of Senegal s local languages: issues, challenges and prospects of their transliteration0
Digraphie des langues ouest africaines : Latin2Ajami : un algorithme de translitteration automatique0
Cross-lingual Named Entity List Search via Transliteration0
A Myanmar (Burmese)-English Named Entity Transliteration Dictionary0
A Multi-Orthography Parallel Corpus of Yiddish Nouns0
TRANSLIT: A Large-scale Name Transliteration Resource0
Can Multilingual Language Models Transfer to an Unseen Dialect? A Case Study on North African Arabizi0
Towards an Efficient Code-Mixed Grapheme-to-Phoneme Conversion in an Agglutinative Language: A Case Study on To-Korean Transliteration0
Show:102550
← PrevPage 3 of 9Next →

No leaderboard results yet.