| Exploring Unsupervised Pretraining Objectives for Machine Translation | Jun 10, 2021 | DecoderLanguage Modeling | CodeCode Available | 0 |
| BERTnesia: Investigating the capture and forgetting of knowledge in BERT | Jun 5, 2021 | Knowledge Base CompletionLanguage Modeling | CodeCode Available | 0 |
| Bi-Granularity Contrastive Learning for Post-Training in Few-Shot Scene | Jun 4, 2021 | Contrastive LearningData Augmentation | —Unverified | 0 |
| Exposing the Implicit Energy Networks behind Masked Language Models via Metropolis--Hastings | Jun 4, 2021 | Language ModelingLanguage Modelling | —Unverified | 0 |
| BERT-Defense: A Probabilistic Model Based on BERT to Combat Cognitively Inspired Orthographic Adversarial Attacks | Jun 2, 2021 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| SCRIPT: Self-Critic PreTraining of Transformers | Jun 1, 2021 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Target-Aware Data Augmentation for Stance Detection | Jun 1, 2021 | Data AugmentationLanguage Modeling | —Unverified | 0 |
| MG-BERT: Multi-Graph Augmented BERT for Masked Language Modeling | Jun 1, 2021 | Knowledge GraphsLanguage Modeling | —Unverified | 0 |
| From Masked Language Modeling to Translation: Non-English Auxiliary Tasks Improve Zero-shot Spoken Language Understanding | May 15, 2021 | intent-classificationIntent Classification | CodeCode Available | 0 |
| Larger-Scale Transformers for Multilingual Masked Language Modeling | May 2, 2021 | Language ModelingLanguage Modelling | —Unverified | 0 |