| CPT: A Pre-Trained Unbalanced Transformer for Both Chinese Language Understanding and Generation | Sep 13, 2021 | DecoderDenoising | CodeCode Available | 1 | 5 |
| Hello, It's GPT-2 -- How Can I Help You? Towards the Use of Pretrained Language Models for Task-Oriented Dialogue Systems | Jul 12, 2019 | Decision MakingLanguage Modeling | CodeCode Available | 1 | 5 |
| CTRLEval: An Unsupervised Reference-Free Metric for Evaluating Controlled Text Generation | Apr 2, 2022 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| Crafting Large Language Models for Enhanced Interpretability | Jul 5, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| Hessian of Perplexity for Large Language Models by PyTorch autograd (Open Source) | Apr 6, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| How Much Knowledge Can You Pack Into the Parameters of a Language Model? | Feb 10, 2020 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| GUing: A Mobile GUI Search Engine using a Vision-Language Model | Apr 30, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| Correcting Diverse Factual Errors in Abstractive Summarization via Post-Editing and Language Model Infilling | Oct 22, 2022 | Abstractive Text SummarizationLanguage Modeling | CodeCode Available | 1 | 5 |
| AMPERSAND: Argument Mining for PERSuAsive oNline Discussions | Apr 30, 2020 | Argument MiningLanguage Modeling | CodeCode Available | 1 | 5 |
| gzip Predicts Data-dependent Scaling Laws | May 26, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |