SOTAVerified

Transliteration

Transliteration is a mechanism for converting a word in a source (foreign) language to a target language, and often adopts approaches from machine translation. In machine translation, the objective is to preserve the semantic meaning of the utterance as much as possible while following the syntactic structure in the target language. In Transliteration, the objective is to preserve the original pronunciation of the source word as much as possible while following the phonological structures of the target language.

For example, the city’s name “Manchester” has become well known by people of languages other than English. These new words are often named entities that are important in cross-lingual information retrieval, information extraction, machine translation, and often present out-of-vocabulary challenges to spoken language technologies such as automatic speech recognition, spoken keyword search, and text-to-speech.

Source: Phonology-Augmented Statistical Framework for Machine Transliteration using Limited Linguistic Resources

Papers

Showing 351400 of 435 papers

TitleStatusHype
Transliteration of Arabizi into Arabic Orthography: Developing a Parallel Annotated Arabizi-Arabic Script SMS/Chat Corpus0
Transliteration of Foreign Words in Burmese0
Transliteration of Judeo-Arabic Texts into Arabic Script Using Recurrent Neural Networks0
Transliteration Systems across Indian Languages Using Parallel Corpora0
UMUTeam@LT-EDI-ACL2022: Detecting homophobic and transphobic comments in Tamil0
Unified Guidelines and Resources for Arabic Dialect Orthography0
UniGlyph: A Seven-Segment Script for Universal Language Representation0
Unsupervised Language agnostic WER Standardization0
Urdu-English Machine Transliteration using Neural Networks0
Urdu Hindi Machine Transliteration using SMT0
Urdu - Roman Transliteration via Finite State Transducers0
Use of Transformer-Based Models for Word-Level Transliteration of the Book of the Dean of Lismore0
Using Transliteration of Proper Names from Arabic to Latin Script to Improve English-Arabic Word Alignment0
Utilisation de la translitt\'eration arabe pour l'am\'elioration de l'alignement de mots \`a partir de corpus parall\`eles fran -arabe (Using Arabic Transliteration to Improve Word Alignment from French-Arabic Parallel Corpora) [in French]0
Uzbek Cyrillic-Latin-Cyrillic Machine Transliteration0
Vocabulary-Based Language Similarity using Web Corpora0
Web-sentiment analysis of public comments (public reviews) for languages with limited resources such as the Kazakh language0
Weighting Finite-State Transductions With Neural Context0
What Matters Most in Morphologically Segmented SMT Models?0
When LLMs Struggle: Reference-less Translation Evaluation for Low-resource Languages0
When Transliteration Met Crowdsourcing : An Empirical Study of Transliteration via Crowdsourcing using Efficient, Non-redundant and Fair Quality Control0
Whitepaper of NEWS 2012 Shared Task on Machine Transliteration0
Whitepaper of NEWS 2015 Shared Task on Machine Transliteration0
Whitepaper of NEWS 2016 Shared Task on Machine Transliteration0
Whose Nickname is This? Recognizing Politicians from Their Aliases0
Word-level Language Identification in Bi-lingual Code-switched Texts0
Words.hk: A Comprehensive Cantonese Dictionary Dataset with Definitions, Translations and Transliterated Examples0
YAMAMA: Yet Another Multi-Dialect Arabic Morphological Analyzer0
Yandex School of Data Analysis Machine Translation Systems for WMT130
Yandex School of Data Analysis Russian-English Machine Translation System for WMT140
Algorithms for certain classes of Tamil Spelling correction0
``ye word kis lang ka hai bhai?'' Testing the Limits of Word level Language Identification0
3arif: A Corpus of Modern Standard and Egyptian Arabic Tweets Annotated for Epistemic Modality Using Interactive Crowdsourcing0
A Bird's-eye View of Language Processing Projects at the Romanian Academy0
Accurate Word Segmentation using Transliteration and Language Model Projection0
A Classical Chinese Corpus with Nested Part-of-Speech Tags0
A Comparative Study of Extremely Low-Resource Transliteration of the World's Languages0
A Comparison of Entity Matching Methods between English and Japanese Katakana0
A complete character recognition and transliteration technique for Devanagari script0
A Conventional Orthography for Tunisian Arabic0
A Correlational Encoder Decoder Architecture for Pivot Based Sequence Generation0
Addressing Noise in Multidialectal Word Embeddings0
A Deep Learning Based Approach to Transliteration0
A Digital Swedish-Yiddish/Yiddish-Swedish Dictionary: A Web-Based Dictionary that is also Available Offline0
A Framework for the Classification and Annotation of Multiword Expressions in Dialectal Arabic0
Agreement on Target-bidirectional Neural Machine Translation0
A House United: Bridging the Script and Lexical Barrier between Hindi and Urdu0
A Hybrid Transliteration Model for Chinese/English Named Entities ---BJTU-NLP Report for the 5th Named Entities Workshop0
A Hybrid Word Alignment Model for Phrase-Based Statistical Machine Translation0
AIDA2: A Hybrid Approach for Token and Sentence Level Dialect Identification in Arabic0
Show:102550
← PrevPage 8 of 9Next →

No leaderboard results yet.