| Large Language Models Often Know When They Are Being Evaluated | May 28, 2025 | MMLUMultiple-choice | —Unverified | 0 |
| Distractor Analysis and Selection for Multiple-Choice Cloze Questions for Second-Language Learners | Jul 1, 2020 | Multiple-choice | —Unverified | 0 |
| DISTO: Evaluating Textual Distractors for Multi-Choice Questions using Negative Sampling based Approach | Apr 10, 2023 | Distractor GenerationMachine Translation | —Unverified | 0 |
| Auxiliary Class Based Multiple Choice Learning | Aug 6, 2021 | DiversityEnsemble Learning | —Unverified | 0 |
| Disaggregating Hops: Can We Guide a Multi-Hop Reasoning Language Model to Incrementally Learn at each Hop? | Jan 16, 2022 | Language ModelingLanguage Modelling | —Unverified | 0 |
| An Improved Traditional Chinese Evaluation Suite for Foundation Model | Mar 4, 2024 | Multiple-choiceQuestion Answering | —Unverified | 0 |
| A Foundational Multimodal Vision Language AI Assistant for Human Pathology | Dec 13, 2023 | Decision MakingDiagnostic | —Unverified | 0 |
| Large Language Models Sensitivity to The Order of Options in Multiple-Choice Questions | Aug 22, 2023 | Multiple-choiceSensitivity | —Unverified | 0 |
| Learning Language-Visual Embedding for Movie Understanding with Natural-Language | Sep 26, 2016 | Multiple-choiceRetrieval | —Unverified | 0 |
| Digital Comprehensibility Assessment of Simplified Texts among Persons with Intellectual Disabilities | Feb 20, 2024 | Multiple-choiceText Simplification | —Unverified | 0 |