Aksharantar: Open Indic-language Transliteration datasets and models for the Next Billion Users May 6, 2022 Transliteration
Code Code Available 2GR-NLP-TOOLKIT: An Open-Source NLP Toolkit for Modern Greek Dec 11, 2024 Dependency Parsing Morphological Tagging
Code Code Available 2Applying the Transformer to Character-level Transduction May 20, 2020 Grapheme-to-Phoneme Conversion Morphological Inflection
Code Code Available 1ParaNames: A Massively Multilingual Entity Name Corpus Feb 28, 2022 named-entity-recognition Named Entity Recognition
Code Code Available 1An Ensemble Model of Word-based and Character-based Models for Japanese and Chinese Input Method Dec 1, 2012 Transliteration
Code Code Available 1Question Answering Classification for Amharic Social Media Community Based Questions Jun 1, 2022 8k Question Answering
Code Code Available 1Taqyim: Evaluating Arabic NLP Tasks Using ChatGPT Models Jun 28, 2023 Part-Of-Speech Tagging Sentiment Analysis
Code Code Available 1Multilingual Text-to-Speech Synthesis for Turkic Languages Using Transliteration May 25, 2023 Speech Synthesis text-to-speech
Code Code Available 1XTREME-UP: A User-Centric Scarce-Data Benchmark for Under-Represented Languages May 19, 2023 In-Context Learning Multilingual NLP
Code Code Available 1Processing South Asian Languages Written in the Latin Script: the Dakshina Dataset Jul 2, 2020 Language Modeling Language Modelling
Code Code Available 1KLPT – Kurdish Language Processing Toolkit Nov 1, 2020 Diversity Lemmatization
Code Code Available 1Beyond Arabic: Software for Perso-Arabic Script Manipulation Jan 26, 2023 Transliteration
Code Code Available 1Show Me the World in My Language: Establishing the First Baseline for Scene-Text to Scene-Text Translation Aug 6, 2023 Machine Translation Scene Text Editing
Code Code Available 1ParaNames 1.0: Creating an Entity Name Corpus for 400+ Languages using Wikidata May 15, 2024 Multilingual Named Entity Recognition named-entity-recognition
Code Code Available 1DeepScribe: Localization and Classification of Elamite Cuneiform Signs Via Deep Learning Jun 2, 2023 Transliteration
Code Code Available 1Leveraging Multilingual News Websites for Building a Kurdish Parallel Corpus Oct 4, 2020 Articles Machine Translation
Code Code Available 1ParsiPy: NLP Toolkit for Historical Persian Texts in Python Mar 22, 2025 Lemmatization Part-Of-Speech Tagging
Code Code Available 1Sub-Character Tokenization for Chinese Pretrained Language Models Jun 1, 2021 Chinese Word Segmentation Computational Efficiency
Code Code Available 1A machine transliteration tool between Uzbek alphabets May 19, 2022 Transliteration
Code Code Available 1A Comparison of Entity Matching Methods between English and Japanese Katakana Oct 1, 2018 Transliteration
— Unverified 0A Framework for the Classification and Annotation of Multiword Expressions in Dialectal Arabic Oct 1, 2014 Entity Extraction using GAN General Classification
— Unverified 0ARGUABLY at ComMA@ICON: Detection of Multilingual Aggressive, Gender Biased, and Communally Charged Tweets Using Ensemble and Fine-Tuned IndicBERT Dec 1, 2021 Hate Speech Detection Multi-class Classification
— Unverified 0A Multilinear Approach to the Unsupervised Learning of Morphology Aug 1, 2016 Transliteration
— Unverified 0A Digital Swedish-Yiddish/Yiddish-Swedish Dictionary: A Web-Based Dictionary that is also Available Offline Jun 1, 2022 Transliteration
— Unverified 0A Deep Learning Based Approach to Transliteration Jul 1, 2018 Deep Learning Information Retrieval
— Unverified 0A Multi-Orthography Parallel Corpus of Yiddish Nouns May 1, 2020 Grapheme-to-Phoneme Conversion Transliteration
— Unverified 0A Myanmar (Burmese)-English Named Entity Transliteration Dictionary May 1, 2020 Transliteration
— Unverified 0Analyzing English-Spanish Named-Entity enhanced Machine Translation Jun 1, 2015 Machine Translation Named Entity Recognition (NER)
— Unverified 0Analyzing Urdu Social Media for Sentiments using Transfer Learning with Controlled Translations Jun 1, 2012 Domain Adaptation Part-Of-Speech Tagging
— Unverified 0Agreement on Target-bidirectional Neural Machine Translation Jun 1, 2016 Machine Translation Structured Prediction
— Unverified 0amLite: Amharic Transliteration Using Key Map Dictionary Sep 16, 2015 Transliteration
— Unverified 0A Comparative Study of Extremely Low-Resource Transliteration of the World's Languages May 1, 2018 Machine Translation Speech Recognition
— Unverified 0A Bird's-eye View of Language Processing Projects at the Romanian Academy May 1, 2018 Autonomous Vehicles Keyword Spotting
— Unverified 0A Simple but Effective Approach to Improve Arabizi-to-English Statistical Machine Translation Dec 1, 2016 Machine Translation Translation
— Unverified 0Arabic Retrieval Revisited: Morphological Hole Filling Jul 1, 2012 Morphological Analysis Retrieval
— Unverified 0A Layered Language Model based Hybrid Approach to Automatic Full Diacritization of Arabic Apr 1, 2017 Arabic Text Diacritization Form
— Unverified 0Addressing Noise in Multidialectal Word Embeddings Jul 1, 2018 Sentence Transliteration
— Unverified 0A Correlational Encoder Decoder Architecture for Pivot Based Sequence Generation Jun 15, 2016 Decoder Machine Translation
— Unverified 0Applying Sanskrit Concepts for Reordering in MT Dec 1, 2015 Language Modelling Machine Translation
— Unverified 0Applying Neural Networks to English-Chinese Named Entity Transliteration Aug 1, 2016 Information Retrieval Machine Translation
— Unverified 0A Classical Chinese Corpus with Nested Part-of-Speech Tags Apr 1, 2012 Part-Of-Speech Tagging Transliteration
— Unverified 0Approche Hybride pour la translit\'eration de l'Arabizi Alg\'erien : une \'etude pr\'eliminaire (A hybrid approach for the transliteration of Algerian Arabizi: A primary study) May 1, 2018 Transliteration
— Unverified 0Arabic Diacritization: Stats, Rules, and Hacks Apr 1, 2017 Decoder Part-Of-Speech Tagging
— Unverified 0Arabic Diacritization with Recurrent Neural Networks Sep 1, 2015 Language Modelling Morphological Analysis
— Unverified 03arif: A Corpus of Modern Standard and Egyptian Arabic Tweets Annotated for Epistemic Modality Using Interactive Crowdsourcing Aug 1, 2014 Opinion Mining Sentiment Analysis
— Unverified 0Arabic to English Person Name Transliteration using Twitter May 1, 2016 Retrieval Translation
— Unverified 0Arabizi Detection and Conversion to Arabic Jun 28, 2013 Language Modeling Language Modelling
— Unverified 0Arabizi sentiment analysis based on transliteration and automatic corpus annotation Oct 1, 2018 Opinion Mining Sentiment Analysis
— Unverified 0AraNLP: a Java-based Library for the Processing of Arabic Text. May 1, 2014 Information Retrieval Machine Translation
— Unverified 0Applying mpaligner to Machine Transliteration with Japanese-Specific Heuristics Jul 1, 2012 Machine Translation Transliteration
— Unverified 0