| ELECTRA: Pre-training Text Encoders as Discriminators Rather Than Generators | Mar 23, 2020 | GPULanguage Modeling | CodeCode Available | 1 |
| Beheshti-NER: Persian Named Entity Recognition Using BERT | Mar 19, 2020 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Efficient Content-Based Sparse Attention with Routing Transformers | Mar 12, 2020 | Image GenerationLanguage Modeling | CodeCode Available | 1 |
| ReZero is All You Need: Fast Convergence at Large Depth | Mar 10, 2020 | AllLanguage Modeling | CodeCode Available | 1 |
| ProGen: Language Modeling for Protein Generation | Mar 8, 2020 | DiversityLanguage Modeling | CodeCode Available | 1 |
| RecipeGPT: Generative Pre-training Based Cooking Recipe Generation and Evaluation System | Mar 5, 2020 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Talking-Heads Attention | Mar 5, 2020 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Data Augmentation using Pre-trained Transformer Models | Mar 4, 2020 | Data AugmentationDiversity | CodeCode Available | 1 |
| Understanding Contexts Inside Robot and Human Manipulation Tasks through a Vision-Language Model and Ontology System in a Video Stream | Mar 2, 2020 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| UniLMv2: Pseudo-Masked Language Models for Unified Language Model Pre-Training | Feb 28, 2020 | Abstractive Text SummarizationDecoder | CodeCode Available | 1 |