| TensorCoder: Dimension-Wise Attention via Tensor Representation for Natural Language Modeling | Jul 28, 2020 | Language ModelingLanguage Modelling | —Unverified | 0 | 0 |
| A Cohesive Distillation Architecture for Neural Language Models | Jan 12, 2023 | Knowledge DistillationLanguage Modeling | —Unverified | 0 | 0 |
| Text Style Transfer for Bias Mitigation using Masked Language Modeling | Jan 21, 2022 | Language ModelingLanguage Modelling | —Unverified | 0 | 0 |
| The Bottom-up Evolution of Representations in the Transformer: A Study with Machine Translation and Language Modeling Objectives | Sep 3, 2019 | Language ModelingLanguage Modelling | —Unverified | 0 | 0 |
| Weighted Sampling for Masked Language Modeling | Feb 28, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 | 0 |
| A Closer Look at Parameter Contributions When Training Neural Language and Translation Models | Oct 1, 2022 | Causal Language ModelingLanguage Modeling | —Unverified | 0 | 0 |
| Automated Scoring of Clinical Patient Notes using Advanced NLP and Pseudo Labeling | Jan 18, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 | 0 |
| Augmenting Vision Language Pretraining by Learning Codebook with Visual Semantics | Jul 31, 2022 | Language ModelingLanguage Modelling | —Unverified | 0 | 0 |
| How does the pre-training objective affect what large language models learn about linguistic properties? | Nov 16, 2021 | Language ModelingLanguage Modelling | —Unverified | 0 | 0 |
| Token Dropping for Efficient BERT Pretraining | Mar 24, 2022 | Language ModelingLanguage Modelling | —Unverified | 0 | 0 |