| SAS: Self-Augmentation Strategy for Language Model Pre-training | Jun 14, 2021 | Data AugmentationLanguage Modeling | CodeCode Available | 0 |
| Improving Pretrained Cross-Lingual Language Models via Self-Labeled Word Alignment | Jun 11, 2021 | DenoisingLanguage Modeling | CodeCode Available | 1 |
| Exploring Unsupervised Pretraining Objectives for Machine Translation | Jun 10, 2021 | DecoderLanguage Modeling | CodeCode Available | 0 |
| MST: Masked Self-Supervised Transformer for Visual Representation | Jun 10, 2021 | Language ModelingLanguage Modelling | —Unverified | 0 |
| BERTnesia: Investigating the capture and forgetting of knowledge in BERT | Jun 5, 2021 | Knowledge Base CompletionLanguage Modeling | CodeCode Available | 0 |
| Exposing the Implicit Energy Networks behind Masked Language Models via Metropolis--Hastings | Jun 4, 2021 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Bi-Granularity Contrastive Learning for Post-Training in Few-Shot Scene | Jun 4, 2021 | Contrastive LearningData Augmentation | —Unverified | 0 |
| Luna: Linear Unified Nested Attention | Jun 3, 2021 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| BERT-Defense: A Probabilistic Model Based on BERT to Combat Cognitively Inspired Orthographic Adversarial Attacks | Jun 2, 2021 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| MG-BERT: Multi-Graph Augmented BERT for Masked Language Modeling | Jun 1, 2021 | Knowledge GraphsLanguage Modeling | —Unverified | 0 |