| BanglaBERT: Language Model Pretraining and Benchmarks for Low-Resource Language Understanding Evaluation in Bangla | Jan 1, 2021 | Document ClassificationLanguage Modeling | CodeCode Available | 1 |
| Not All Memories are Created Equal: Learning to Expire | Jan 1, 2021 | AllLanguage Modeling | CodeCode Available | 1 |
| WARP: Word-level Adversarial ReProgramming | Jan 1, 2021 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| K-PLUG: KNOWLEDGE-INJECTED PRE-TRAINED LANGUAGE MODEL FOR NATURAL LANGUAGE UNDERSTANDING AND GENERATION | Jan 1, 2021 | ChatbotDecoder | CodeCode Available | 1 |
| Shortformer: Better Language Modeling using Shorter Inputs | Dec 31, 2020 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Unified Mandarin TTS Front-end Based on Distilled BERT Model | Dec 31, 2020 | Knowledge DistillationLanguage Modeling | CodeCode Available | 1 |
| AraGPT2: Pre-Trained Transformer for Arabic Language Generation | Dec 31, 2020 | ArticlesLanguage Modeling | CodeCode Available | 1 |
| AraELECTRA: Pre-Training Text Discriminators for Arabic Language Understanding | Dec 31, 2020 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Generating Query Focused Summaries from Query-Free Resources | Dec 29, 2020 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Intrinsic Dimensionality Explains the Effectiveness of Language Model Fine-Tuning | Dec 22, 2020 | Generalization BoundsLanguage Modeling | CodeCode Available | 1 |
| RealFormer: Transformer Likes Residual Attention | Dec 21, 2020 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Learning Contextual Representations for Semantic Parsing with Generation-Augmented Pre-Training | Dec 18, 2020 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Binary Black-box Evasion Attacks Against Deep Learning-based Static Malware Detectors with Adversarial Byte-Level Language Model | Dec 14, 2020 | Deep LearningFeature Engineering | CodeCode Available | 1 |
| Extracting Training Data from Large Language Models | Dec 14, 2020 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Towards Neural Programming Interfaces | Dec 10, 2020 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Fusing Context Into Knowledge Graph for Commonsense Question Answering | Dec 9, 2020 | Common Sense ReasoningKnowledge Graphs | CodeCode Available | 1 |
| TAP: Text-Aware Pre-training for Text-VQA and Text-Caption | Dec 8, 2020 | Caption GenerationLanguage Modeling | CodeCode Available | 1 |
| Pre-training Protein Language Models with Label-Agnostic Binding Pairs Enhances Performance in Downstream Tasks | Dec 5, 2020 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Multi-Task Learning for Knowledge Graph Completion with Pre-trained Language Models | Dec 1, 2020 | Knowledge Graph CompletionKnowledge Graphs | CodeCode Available | 1 |
| End-to-End Automatic Speech Recognition for Gujarati | Dec 1, 2020 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 1 |
| Kungfupanda at SemEval-2020 Task 12: BERT-Based Multi-TaskLearning for Offensive Language Detection | Dec 1, 2020 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Try to Substitute: An Unsupervised Chinese Word Sense Disambiguation Method Based on HowNet | Dec 1, 2020 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Retrieving Skills from Job Descriptions: A Language Model Based Extreme Multi-label Classification Framework | Dec 1, 2020 | Extreme Multi-Label ClassificationLanguage Modeling | CodeCode Available | 1 |
| CPM: A Large-scale Generative Chinese Pre-trained Language Model | Dec 1, 2020 | Cloze TestLanguage Modeling | CodeCode Available | 1 |
| SentiX: A Sentiment-Aware Pre-Trained Model for Cross-Domain Sentiment Analysis | Dec 1, 2020 | Language ModelingLanguage Modelling | CodeCode Available | 1 |