| Tracing Origins: Coreference-aware Machine Reading Comprehension | Oct 15, 2021 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Control Prefixes for Parameter-Efficient Text Generation | Oct 15, 2021 | Abstractive Text SummarizationAttribute | CodeCode Available | 1 |
| Generated Knowledge Prompting for Commonsense Reasoning | Oct 15, 2021 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Meta-learning via Language Model In-context Tuning | Oct 15, 2021 | In-Context LearningInductive Bias | CodeCode Available | 1 |
| mLUKE: The Power of Entity Representations in Multilingual Pretrained Language Models | Oct 15, 2021 | Cross-Lingual Question AnsweringCross-Lingual Transfer | CodeCode Available | 1 |
| Composable Sparse Fine-Tuning for Cross-Lingual Transfer | Oct 14, 2021 | Cross-Lingual TransferLanguage Modeling | CodeCode Available | 1 |
| UniPELT: A Unified Framework for Parameter-Efficient Language Model Tuning | Oct 14, 2021 | DiversityLanguage Modeling | CodeCode Available | 1 |
| Symbolic Knowledge Distillation: from General Language Models to Commonsense Models | Oct 14, 2021 | Knowledge DistillationKnowledge Graphs | CodeCode Available | 1 |
| Learning Compact Metrics for MT | Oct 12, 2021 | Cross-Lingual TransferLanguage Modeling | CodeCode Available | 1 |
| Time Masking for Temporal Language Models | Oct 12, 2021 | Change DetectionLanguage Modeling | CodeCode Available | 1 |
| Yuan 1.0: Large-Scale Pre-trained Language Model in Zero-Shot and Few-Shot Learning | Oct 10, 2021 | ArticlesFew-Shot Learning | CodeCode Available | 1 |
| Long Expressive Memory for Sequence Modeling | Oct 10, 2021 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Improving Multi-Party Dialogue Discourse Parsing via Domain Integration | Oct 9, 2021 | Discourse ParsingDomain Adaptation | CodeCode Available | 1 |
| Global Explainability of BERT-Based Evaluation Metrics by Disentangling along Linguistic Factors | Oct 8, 2021 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Layer-wise Pruning of Transformer Attention Heads for Efficient Language Modeling | Oct 7, 2021 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Mixer-TTS: non-autoregressive, fast and compact text-to-speech model conditioned on language model embeddings | Oct 7, 2021 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| JuriBERT: A Masked-Language Model Adaptation for French Legal Text | Oct 4, 2021 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Revisiting Self-Training for Few-Shot Learning of Language Model | Oct 4, 2021 | BenchmarkingFew-Shot Learning | CodeCode Available | 1 |
| SlovakBERT: Slovak Masked Language Model | Sep 30, 2021 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| MatSciBERT: A Materials Domain Language Model for Text Mining and Information Extraction | Sep 30, 2021 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| BERT got a Date: Introducing Transformers to Temporal Tagging | Sep 30, 2021 | ClassificationDecoder | CodeCode Available | 1 |
| Factorized Neural Transducer for Efficient Language Model Adaptation | Sep 27, 2021 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 1 |
| Effective Use of Graph Convolution Network and Contextual Sub-Tree forCommodity News Event Extraction | Sep 27, 2021 | Event ExtractionLanguage Modeling | CodeCode Available | 1 |
| XLM-K: Improving Cross-Lingual Language Model Pre-training with Multilingual Knowledge | Sep 26, 2021 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Extracting and Inferring Personal Attributes from Dialogue | Sep 26, 2021 | AttributeLanguage Modeling | CodeCode Available | 1 |
| DziriBERT: a Pre-trained Language Model for the Algerian Dialect | Sep 25, 2021 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Zero-Shot Information Extraction as a Unified Text-to-Triple Translation | Sep 23, 2021 | Factual probeLanguage Modeling | CodeCode Available | 1 |
| Pix2seq: A Language Modeling Framework for Object Detection | Sep 22, 2021 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| TrOCR: Transformer-based Optical Character Recognition with Pre-trained Models | Sep 21, 2021 | Handwritten Text RecognitionLanguage Modeling | CodeCode Available | 1 |
| JobBERT: Understanding Job Titles through Skills | Sep 20, 2021 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Distilling Linguistic Context for Language Model Compression | Sep 17, 2021 | Knowledge DistillationLanguage Modeling | CodeCode Available | 1 |
| KnowMAN: Weakly Supervised Multinomial Adversarial Networks | Sep 16, 2021 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| SupCL-Seq: Supervised Contrastive Learning for Downstream Optimized Sequence Representations | Sep 15, 2021 | CoLAContrastive Learning | CodeCode Available | 1 |
| Dialogue State Tracking with a Language Model using Schema-Driven Prompting | Sep 15, 2021 | Dialogue State TrackingLanguage Modeling | CodeCode Available | 1 |
| Allocating Large Vocabulary Capacity for Cross-lingual Language Model Pre-training | Sep 15, 2021 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| LM-Critic: Language Models for Unsupervised Grammatical Error Correction | Sep 14, 2021 | Grammatical Error CorrectionLanguage Modeling | CodeCode Available | 1 |
| Types of Out-of-Distribution Texts and How to Detect Them | Sep 14, 2021 | Density EstimationLanguage Modeling | CodeCode Available | 1 |
| Rationales for Sequential Predictions | Sep 14, 2021 | Combinatorial OptimizationLanguage Modeling | CodeCode Available | 1 |
| xGQA: Cross-Lingual Visual Question Answering | Sep 13, 2021 | Cross-Lingual TransferLanguage Modeling | CodeCode Available | 1 |
| CPT: A Pre-Trained Unbalanced Transformer for Both Chinese Language Understanding and Generation | Sep 13, 2021 | DecoderDenoising | CodeCode Available | 1 |
| Raise a Child in Large Language Model: Towards Effective and Generalizable Fine-tuning | Sep 13, 2021 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Virtual Data Augmentation: A Robust and General Framework for Fine-tuning Pre-trained Models | Sep 13, 2021 | Data AugmentationDiversity | CodeCode Available | 1 |
| TEASEL: A Transformer-Based Speech-Prefixed Language Model | Sep 12, 2021 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| IndoBERTweet: A Pretrained Language Model for Indonesian Twitter with Effective Domain-Specific Vocabulary Initialization | Sep 10, 2021 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Distantly-Supervised Named Entity Recognition with Noise-Robust Learning and Language Model Augmented Self-Training | Sep 10, 2021 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Euphemistic Phrase Detection by Masked Language Model | Sep 10, 2021 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| BERT, mBERT, or BiBERT? A Study on Contextualized Embeddings for Neural Machine Translation | Sep 9, 2021 | de-enLanguage Modeling | CodeCode Available | 1 |
| Avoiding Inference Heuristics in Few-shot Prompt-based Finetuning | Sep 9, 2021 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Efficient Nearest Neighbor Language Models | Sep 9, 2021 | Domain AdaptationLanguage Modeling | CodeCode Available | 1 |
| AStitchInLanguageModels: Dataset and Methods for the Exploration of Idiomaticity in Pre-Trained Language Models | Sep 9, 2021 | Language ModelingLanguage Modelling | CodeCode Available | 1 |