Transliteration

Transliteration is a mechanism for converting a word in a source (foreign) language to a target language, and often adopts approaches from machine translation. In machine translation, the objective is to preserve the semantic meaning of the utterance as much as possible while following the syntactic structure in the target language. In Transliteration, the objective is to preserve the original pronunciation of the source word as much as possible while following the phonological structures of the target language.

For example, the city’s name “Manchester” has become well known by people of languages other than English. These new words are often named entities that are important in cross-lingual information retrieval, information extraction, machine translation, and often present out-of-vocabulary challenges to spoken language technologies such as automatic speech recognition, spoken keyword search, and text-to-speech.

Source: Phonology-Augmented Statistical Framework for Machine Transliteration using Limited Linguistic Resources

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 51–75 of 435 papers

Title	Date	Tasks	Status
Cost-Performance Optimization for Processing Low-Resource Language Tasks Using Commercial LLMs	Mar 8, 2024	Transliteration	—Unverified
Training a Bilingual Language Model by Mapping Tokens onto a Shared Character Space	Feb 25, 2024	Language ModelingLanguage Modelling	—Unverified
Cross-Lingual Transfer from Related Languages: Treating Low-Resource Maltese as Multilingual Code-Switching	Jan 30, 2024	Cross-Lingual TransferTransliteration	—Unverified
TransliCo: A Contrastive Learning Framework to Address the Script Barrier in Multilingual Pretrained Language Models	Jan 12, 2024	Contrastive LearningTransliteration	CodeCode Available
Language Detection for Transliterated Content	Jan 9, 2024	Language IdentificationTransliteration	—Unverified
Code-Mixed Text to Speech Synthesis under Low-Resource Constraints	Dec 2, 2023	Speech Synthesistext-to-speech	—Unverified
Character-Level Bangla Text-to-IPA Transcription Using Transformer Architecture with Sequence Alignment	Nov 7, 2023	DecoderPosition	—Unverified
BenLLMEval: A Comprehensive Evaluation into the Potentials and Pitfalls of Large Language Models on Bengali NLP	Sep 22, 2023	Abstractive Text SummarizationNatural Language Inference	—Unverified
Exploring Linguistic Similarity and Zero-Shot Learning for Multilingual Translation of Dravidian Languages	Aug 10, 2023	DecoderMachine Translation	—Unverified
Multilingual Neural Machine Translation System for Indic to Indic Languages	Jun 22, 2023	Machine TranslationTranslation	—Unverified
Learning Cross-lingual Mappings for Data Augmentation to Improve Low-Resource Speech Recognition	Jun 14, 2023	Data Augmentationspeech-recognition	—Unverified
Towards Transliteration between Sindhi Scripts from Devanagari to Perso-Arabic	May 12, 2023	Transliteration	—Unverified
Investigating Lexical Sharing in Multilingual Machine Translation for Indian Languages	May 4, 2023	Cross-Lingual TransferMachine Translation	—Unverified
Romanization-based Large-scale Adaptation of Multilingual Language Models	Apr 18, 2023	Cross-Lingual TransferTransliteration	—Unverified
Unsupervised Language agnostic WER Standardization	Mar 9, 2023	speech-recognitionSpeech Recognition	—Unverified
EPIK: Eliminating multi-model Pipelines with Knowledge-distillation	Nov 27, 2022	Knowledge DistillationTransliteration	—Unverified
Benchmarking Evaluation Metrics for Code-Switching Automatic Speech Recognition	Nov 22, 2022	Automatic Speech RecognitionAutomatic Speech Recognition (ASR)	—Unverified
Towards Zero-Shot Code-Switched Speech Recognition	Nov 2, 2022	Automatic Speech RecognitionAutomatic Speech Recognition (ASR)	—Unverified
DuDe: Dual-Decoder Multilingual ASR for Indian Languages using Common Label Set	Oct 30, 2022	Automatic Speech RecognitionAutomatic Speech Recognition (ASR)	—Unverified
Gui at MixMT 2022 : English-Hinglish: An MT approach for translation of code mixed data	Oct 21, 2022	Machine TranslationTranslation	—Unverified
English to Bengali Multimodal Neural Machine Translation using Transliteration-based Phrase Pairs Augmentation	Oct 1, 2022	Machine TranslationNMT	—Unverified
机器音译研究综述(Survey on Machine Transliteration)	Oct 1, 2022	Transliteration	—Unverified
Investigation of English to Hindi Multimodal Neural Machine Translation using Transliteration-based Phrase Pairs Augmentation	Oct 1, 2022	Machine TranslationNMT	—Unverified
Investigation of Multilingual Neural Machine Translation for Indian Languages	Oct 1, 2022	Machine TranslationTranslation	—Unverified
MATra: A Multilingual Attentive Transliteration System for Indian Scripts	Aug 23, 2022	Transliteration	—Unverified

Show:10 25 50

← PrevPage 3 of 18Next →

No leaderboard results yet.