| mLUKE: The Power of Entity Representations in Multilingual Pretrained Language Models | Oct 15, 2021 | Cross-Lingual Question AnsweringCross-Lingual Transfer | CodeCode Available | 1 |
| Tracing Origins: Coreference-aware Machine Reading Comprehension | Oct 15, 2021 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Meta-learning via Language Model In-context Tuning | Oct 15, 2021 | In-Context LearningInductive Bias | CodeCode Available | 1 |
| Control Prefixes for Parameter-Efficient Text Generation | Oct 15, 2021 | Abstractive Text SummarizationAttribute | CodeCode Available | 1 |
| Generated Knowledge Prompting for Commonsense Reasoning | Oct 15, 2021 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Composable Sparse Fine-Tuning for Cross-Lingual Transfer | Oct 14, 2021 | Cross-Lingual TransferLanguage Modeling | CodeCode Available | 1 |
| Symbolic Knowledge Distillation: from General Language Models to Commonsense Models | Oct 14, 2021 | Knowledge DistillationKnowledge Graphs | CodeCode Available | 1 |
| UniPELT: A Unified Framework for Parameter-Efficient Language Model Tuning | Oct 14, 2021 | DiversityLanguage Modeling | CodeCode Available | 1 |
| Time Masking for Temporal Language Models | Oct 12, 2021 | Change DetectionLanguage Modeling | CodeCode Available | 1 |
| Learning Compact Metrics for MT | Oct 12, 2021 | Cross-Lingual TransferLanguage Modeling | CodeCode Available | 1 |
| Yuan 1.0: Large-Scale Pre-trained Language Model in Zero-Shot and Few-Shot Learning | Oct 10, 2021 | ArticlesFew-Shot Learning | CodeCode Available | 1 |
| Long Expressive Memory for Sequence Modeling | Oct 10, 2021 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Improving Multi-Party Dialogue Discourse Parsing via Domain Integration | Oct 9, 2021 | Discourse ParsingDomain Adaptation | CodeCode Available | 1 |
| Global Explainability of BERT-Based Evaluation Metrics by Disentangling along Linguistic Factors | Oct 8, 2021 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Layer-wise Pruning of Transformer Attention Heads for Efficient Language Modeling | Oct 7, 2021 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Mixer-TTS: non-autoregressive, fast and compact text-to-speech model conditioned on language model embeddings | Oct 7, 2021 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Revisiting Self-Training for Few-Shot Learning of Language Model | Oct 4, 2021 | BenchmarkingFew-Shot Learning | CodeCode Available | 1 |
| JuriBERT: A Masked-Language Model Adaptation for French Legal Text | Oct 4, 2021 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| SlovakBERT: Slovak Masked Language Model | Sep 30, 2021 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| BERT got a Date: Introducing Transformers to Temporal Tagging | Sep 30, 2021 | ClassificationDecoder | CodeCode Available | 1 |
| MatSciBERT: A Materials Domain Language Model for Text Mining and Information Extraction | Sep 30, 2021 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Factorized Neural Transducer for Efficient Language Model Adaptation | Sep 27, 2021 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 1 |
| Effective Use of Graph Convolution Network and Contextual Sub-Tree forCommodity News Event Extraction | Sep 27, 2021 | Event ExtractionLanguage Modeling | CodeCode Available | 1 |
| XLM-K: Improving Cross-Lingual Language Model Pre-training with Multilingual Knowledge | Sep 26, 2021 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Extracting and Inferring Personal Attributes from Dialogue | Sep 26, 2021 | AttributeLanguage Modeling | CodeCode Available | 1 |