| CPT: Efficient Deep Neural Network Training via Cyclic Precision | Jan 25, 2021 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| Hallucinations in Large Multilingual Translation Models | Mar 28, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| Guiding Attention for Self-Supervised Learning with Transformers | Oct 6, 2020 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| Guiding Pretraining in Reinforcement Learning with Large Language Models | Feb 13, 2023 | Common Sense ReasoningLanguage Modeling | CodeCode Available | 1 | 5 |
| Guess the Instruction! Flipped Learning Makes Language Models Stronger Zero-Shot Learners | Oct 6, 2022 | Common Sense ReasoningCoreference Resolution | CodeCode Available | 1 | 5 |
| Counterfactual Data Augmentation for Neural Machine Translation | Jun 1, 2021 | counterfactualData Augmentation | CodeCode Available | 1 | 5 |
| GUing: A Mobile GUI Search Engine using a Vision-Language Model | Apr 30, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| AdaSplash: Adaptive Sparse Flash Attention | Feb 17, 2025 | GPULanguage Modeling | CodeCode Available | 1 | 5 |
| A Model of Cross-Lingual Knowledge-Grounded Response Generation for Open-Domain Dialogue Systems | Nov 1, 2021 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| CoSafe: Evaluating Large Language Model Safety in Multi-Turn Dialogue Coreference | Jun 25, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |