| Distilling Linguistic Context for Language Model Compression | Sep 17, 2021 | Knowledge DistillationLanguage Modeling | CodeCode Available | 1 |
| KnowMAN: Weakly Supervised Multinomial Adversarial Networks | Sep 16, 2021 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Dialogue State Tracking with a Language Model using Schema-Driven Prompting | Sep 15, 2021 | Dialogue State TrackingLanguage Modeling | CodeCode Available | 1 |
| Allocating Large Vocabulary Capacity for Cross-lingual Language Model Pre-training | Sep 15, 2021 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| SupCL-Seq: Supervised Contrastive Learning for Downstream Optimized Sequence Representations | Sep 15, 2021 | CoLAContrastive Learning | CodeCode Available | 1 |
| Types of Out-of-Distribution Texts and How to Detect Them | Sep 14, 2021 | Density EstimationLanguage Modeling | CodeCode Available | 1 |
| Rationales for Sequential Predictions | Sep 14, 2021 | Combinatorial OptimizationLanguage Modeling | CodeCode Available | 1 |
| LM-Critic: Language Models for Unsupervised Grammatical Error Correction | Sep 14, 2021 | Grammatical Error CorrectionLanguage Modeling | CodeCode Available | 1 |
| CPT: A Pre-Trained Unbalanced Transformer for Both Chinese Language Understanding and Generation | Sep 13, 2021 | DecoderDenoising | CodeCode Available | 1 |
| Virtual Data Augmentation: A Robust and General Framework for Fine-tuning Pre-trained Models | Sep 13, 2021 | Data AugmentationDiversity | CodeCode Available | 1 |