| Position Masking for Language Models | Jun 2, 2020 | Language ModelingLanguage Modelling | —Unverified | 0 |
| POSTECH-ETRI’s Submission to the WMT2020 APE Shared Task: Automatic Post-Editing with Cross-lingual Language Model | Nov 1, 2020 | Automatic Post-EditingLanguage Modeling | —Unverified | 0 |
| Predicting Attention Sparsity in Transformers | Sep 24, 2021 | DecoderLanguage Modeling | —Unverified | 0 |
| Predicting Attention Sparsity in Transformers | Nov 16, 2021 | DecoderLanguage Modeling | —Unverified | 0 |
| Pre-Training and Prompting for Few-Shot Node Classification on Text-Attributed Graphs | Jul 22, 2024 | Few-Shot LearningGraph Neural Network | —Unverified | 0 |
| Pretraining Chinese BERT for Detecting Word Insertion and Deletion Errors | Apr 26, 2022 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Pre-training Is (Almost) All You Need: An Application to Commonsense Reasoning | Apr 29, 2020 | AllHellaSwag | —Unverified | 0 |
| Pre-training Language Model as a Multi-perspective Course Learner | May 6, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Pre-Training Transformer Decoder for End-to-End ASR Model with Unpaired Speech Data | Mar 31, 2022 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Probing BERT’s priors with serial reproduction chains | Nov 16, 2021 | Language ModelingLanguage Modelling | —Unverified | 0 |