| CiteWorth: Cite-Worthiness Detection for Improved Scientific Document Understanding | May 23, 2021 | document understandingDomain Adaptation | CodeCode Available | 1 |
| Scatterbrain: Unifying Sparse and Low-rank Attention | May 21, 2021 | Image GenerationLanguage Modeling | CodeCode Available | 1 |
| Effective Attention Sheds Light On Interpretability | May 18, 2021 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Stage-wise Fine-tuning for Graph-to-Text Generation | May 17, 2021 | Data-to-Text GenerationKB-to-Language Generation | CodeCode Available | 1 |
| RetGen: A Joint framework for Retrieval and Grounded Text Generation Modeling | May 14, 2021 | Dialogue GenerationLanguage Modeling | CodeCode Available | 1 |
| Not All Memories are Created Equal: Learning to Forget by Expiring | May 13, 2021 | AllLanguage Modeling | CodeCode Available | 1 |
| MATE-KD: Masked Adversarial TExt, a Companion to Knowledge Distillation | May 12, 2021 | Adversarial TextData Augmentation | CodeCode Available | 1 |
| BERT is to NLP what AlexNet is to CV: Can Pre-Trained Language Models Identify Analogies? | May 11, 2021 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| DocSCAN: Unsupervised Text Classification via Learning from Neighbors | May 9, 2021 | ClassificationClustering | CodeCode Available | 1 |
| Lawformer: A Pre-trained Language Model for Chinese Legal Long Documents | May 9, 2021 | Language ModelingLanguage Modelling | CodeCode Available | 1 |