| Promises, Outlooks and Challenges of Diffusion Language Modeling | Jun 17, 2024 | ARCHellaSwag | —Unverified | 0 |
| SaGE: Evaluating Moral Consistency in Large Language Models | Feb 21, 2024 | Decision MakingHellaSwag | CodeCode Available | 0 |
| Attacks on Node Attributes in Graph Neural Networks | Feb 19, 2024 | Contrastive LearningHellaSwag | CodeCode Available | 0 |
| Who's Harry Potter? Approximate Unlearning in LLMs | Oct 3, 2023 | ARCGPU | —Unverified | 0 |
| Contrastive Decoding Improves Reasoning in Large Language Models | Sep 17, 2023 | GSM8KHellaSwag | —Unverified | 0 |
| In-Contextual Gender Bias Suppression for Large Language Models | Sep 13, 2023 | counterfactualData Augmentation | CodeCode Available | 0 |
| Toward Adversarial Training on Contextualized Language Representation | May 8, 2023 | Decoderglobal-optimization | CodeCode Available | 0 |
| GraDA: Graph Generative Data Augmentation for Commonsense Reasoning | Oct 1, 2022 | Data AugmentationHellaSwag | CodeCode Available | 0 |
| On Curriculum Learning for Commonsense Reasoning | Jul 1, 2022 | HellaSwagLearning-To-Rank | CodeCode Available | 0 |
| When Chosen Wisely, More Data Is What You Need: A Universal Sample-Efficient Strategy For Data Augmentation | Nov 16, 2021 | Data AugmentationHellaSwag | —Unverified | 0 |
| Comparing Test Sets with Item Response Theory | Jun 1, 2021 | HellaSwagNatural Language Understanding | —Unverified | 0 |
| English Intermediate-Task Training Improves Zero-Shot Cross-Lingual Transfer Too | May 26, 2020 | Cross-Lingual TransferHellaSwag | —Unverified | 0 |
| Pre-training Is (Almost) All You Need: An Application to Commonsense Reasoning | Apr 29, 2020 | AllHellaSwag | —Unverified | 0 |
| HellaSwag: Can a Machine Really Finish Your Sentence? | May 19, 2019 | HellaSwagNatural Language Inference | CodeCode Available | 0 |