Role of Language Relatedness in Multilingual Fine-tuning of Language Models: A Case Study in Indo-Aryan Languages Sep 22, 2021 Multiple Choice Question Answering (MCQA) Natural Language Inference
Code Code Available 0HintedBT: Augmenting Back-Translation with Quality and Transliteration Hints Sep 9, 2021 Data Augmentation Decoder
— Unverified 0Web-sentiment analysis of public comments (public reviews) for languages with limited resources such as the Kazakh language Sep 1, 2021 Aspect-Based Sentiment Analysis Aspect-Based Sentiment Analysis (ABSA)
— Unverified 0Cross-Lingual Text Classification of Transliterated Hindi and Malayalam Aug 31, 2021 Benchmarking Classification
Code Code Available 0Towards Offensive Language Identification for Tamil Code-Mixed YouTube Comments and Posts Aug 24, 2021 Language Identification Transfer Learning
Code Code Available 0Samsung R&D Institute Poland submission to WAT 2021 Indic Language Multilingual Task Aug 1, 2021 Domain Adaptation Knowledge Distillation
— Unverified 0Hybrid Statistical Machine Translation for English-Myanmar: UTYCC Submission to WAT-2021 Aug 1, 2021 Decoder Machine Translation
— Unverified 0ANVITA Machine Translation System for WAT 2021 MultiIndicMT Shared Task Aug 1, 2021 Data Augmentation Decoder
— Unverified 0Language Relatedness and Lexical Closeness can help Improve Multilingual NMT: IITBombay@MultiIndicNMT WAT2021 Aug 1, 2021 Decoder Machine Translation
— Unverified 0End-to-End Natural Language Understanding Pipeline for Bangla Conversational Agents Jul 12, 2021 BIG-bench Machine Learning Chatbot
— Unverified 0Specializing Multilingual Language Models: An Empirical Study Jun 16, 2021 Dependency Parsing named-entity-recognition
Code Code Available 0Balanced End-to-End Monolingual pre-training for Low-Resourced Indic Languages Code-Switching Speech Recognition Jun 10, 2021 Language Modelling speech-recognition
— Unverified 0Exploiting Language Relatedness for Low Web-Resource Language Model Adaptation: An Indic Languages Study Jun 7, 2021 Data Augmentation Language Modeling
Code Code Available 0Transliteration for Low-Resource Code-Switching Texts: Building an Automatic Cyrillic-to-Latin Converter for Tatar Jun 1, 2021 Language Identification Transliteration
— Unverified 0Normalization and Back-Transliteration for Code-Switched Data Jun 1, 2021 named-entity-recognition Named Entity Recognition
— Unverified 0Sub-Character Tokenization for Chinese Pretrained Language Models Jun 1, 2021 Chinese Word Segmentation Computational Efficiency
Code Code Available 1Neural String Edit Distance Apr 16, 2021 Classification General Classification
Code Code Available 0On Biasing Transformer Attention Towards Monotonicity Apr 8, 2021 Grapheme-to-Phoneme Conversion Morphological Inflection
Code Code Available 0QuranTree.jl: A Julia Package for Quranic Arabic Corpus Apr 1, 2021 Transliteration
— Unverified 0Towards Offensive Language Identification for Dravidian Languages Apr 1, 2021 Few-Shot Learning Language Identification
Code Code Available 0OFFLangOne@DravidianLangTech-EACL2021: Transformers with the Class Balanced Loss for Offensive Language Identification in Dravidian Code-Mixed text. Apr 1, 2021 Language Identification Transliteration
— Unverified 0Hopeful Men@LT-EDI-EACL2021: Hope Speech Detection Using Indic Transliteration and Transformers Apr 1, 2021 Hope Speech Detection regression
— Unverified 0A Large-scale Evaluation of Neural Machine Transliteration for Indic Languages Apr 1, 2021 Translation Transliteration
Code Code Available 0Finite-state script normalization and processing utilities: The Nisaba Brahmic library Apr 1, 2021 Survey Transliteration
— Unverified 0Hopeful_Men@LT-EDI-EACL2021: Hope Speech Detection Using Indic Transliteration and Transformers Feb 24, 2021 Hope Speech Detection regression
— Unverified 0Uzbek Cyrillic-Latin-Cyrillic Machine Transliteration Jan 13, 2021 Transliteration
— Unverified 0Machine Translation Pre-training for Data-to-Text Generation - A Case Study in Czech Dec 1, 2020 Data-to-Text Generation Machine Translation
— Unverified 0A Unified Model for Arabizi Detection and Transliteration using Sequence-to-Sequence Models Dec 1, 2020 Transliteration
— Unverified 0Leveraging Alignment and Phonology for low-resource Indic to English Neural Machine Transliteration Dec 1, 2020 Transliteration
— Unverified 0Language Identification and Normalization of Code Mixed English and Punjabi Text Dec 1, 2020 Language Identification Transliteration
— Unverified 0Effective Architectures for Low Resource Multilingual Named Entity Transliteration Dec 1, 2020 Decoder Transliteration
— Unverified 0Combining Word Embeddings with Bilingual Orthography Embeddings for Bilingual Dictionary Induction Dec 1, 2020 Translation Transliteration
— Unverified 0Homonym normalisation by word sense clustering: a case in Japanese Dec 1, 2020 Clustering Language Modeling
— Unverified 0English-to-Chinese Transliteration with Phonetic Auxiliary Task Dec 1, 2020 Machine Translation Multi-Task Learning
— Unverified 0KLPT – Kurdish Language Processing Toolkit Nov 1, 2020 Diversity Lemmatization
Code Code Available 1Leveraging Multilingual News Websites for Building a Kurdish Parallel Corpus Oct 4, 2020 Articles Machine Translation
Code Code Available 1A complete character recognition and transliteration technique for Devanagari script Sep 28, 2020 Segmentation Transliteration
— Unverified 0Processing South Asian Languages Written in the Latin Script: the Dakshina Dataset Jul 2, 2020 Language Modeling Language Modelling
Code Code Available 1Transliteration for Cross-Lingual Morphological Inflection Jul 1, 2020 Cross-Lingual Transfer Grapheme-to-Phoneme Conversion
— Unverified 0Efficient Neural Machine Translation for Low-Resource Languages via Exploiting Related Languages Jul 1, 2020 Machine Translation NMT
— Unverified 0Applying the Transformer to Character-level Transduction May 20, 2020 Grapheme-to-Phoneme Conversion Morphological Inflection
Code Code Available 1Digraph of Senegal s local languages: issues, challenges and prospects of their transliteration May 5, 2020 Translation Transliteration
— Unverified 0Digraphie des langues ouest africaines : Latin2Ajami : un algorithme de translitteration automatique May 5, 2020 Transliteration
— Unverified 0Cross-lingual Named Entity List Search via Transliteration May 1, 2020 Transliteration
— Unverified 0A Myanmar (Burmese)-English Named Entity Transliteration Dictionary May 1, 2020 Transliteration
— Unverified 0A Multi-Orthography Parallel Corpus of Yiddish Nouns May 1, 2020 Grapheme-to-Phoneme Conversion Transliteration
— Unverified 0TRANSLIT: A Large-scale Name Transliteration Resource May 1, 2020 Transliteration
— Unverified 0Towards an Efficient Code-Mixed Grapheme-to-Phoneme Conversion in an Agglutinative Language: A Case Study on To-Korean Transliteration May 1, 2020 Grapheme-to-Phoneme Conversion Philosophy
— Unverified 0Can Multilingual Language Models Transfer to an Unseen Dialect? A Case Study on North African Arabizi May 1, 2020 Dependency Parsing Part-Of-Speech Tagging
— Unverified 0Detect Language of Transliterated Texts Apr 26, 2020 Language Identification Translation
— Unverified 0