| Long Expressive Memory for Sequence Modeling | Oct 10, 2021 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Yuan 1.0: Large-Scale Pre-trained Language Model in Zero-Shot and Few-Shot Learning | Oct 10, 2021 | ArticlesFew-Shot Learning | CodeCode Available | 1 |
| Improving Multi-Party Dialogue Discourse Parsing via Domain Integration | Oct 9, 2021 | Discourse ParsingDomain Adaptation | CodeCode Available | 1 |
| Global Explainability of BERT-Based Evaluation Metrics by Disentangling along Linguistic Factors | Oct 8, 2021 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Layer-wise Pruning of Transformer Attention Heads for Efficient Language Modeling | Oct 7, 2021 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Mixer-TTS: non-autoregressive, fast and compact text-to-speech model conditioned on language model embeddings | Oct 7, 2021 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Revisiting Self-Training for Few-Shot Learning of Language Model | Oct 4, 2021 | BenchmarkingFew-Shot Learning | CodeCode Available | 1 |
| JuriBERT: A Masked-Language Model Adaptation for French Legal Text | Oct 4, 2021 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| SlovakBERT: Slovak Masked Language Model | Sep 30, 2021 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| BERT got a Date: Introducing Transformers to Temporal Tagging | Sep 30, 2021 | ClassificationDecoder | CodeCode Available | 1 |