| KRISTEVA: Close Reading as a Novel Task for Benchmarking Interpretive Reasoning | May 14, 2025 | BenchmarkingMMLU | —Unverified | 0 |
| E-cheating Prevention Measures: Detection of Cheating at Online Examinations Using Deep Learning Approach -- A Case Study | Jan 25, 2021 | Multiple-choice | —Unverified | 0 |
| Better Distractions: Transformer-based Distractor Generation and Multiple Choice Question Filtering | Oct 19, 2020 | Distractor GenerationLanguage Modeling | —Unverified | 0 |
| Knowledge-Driven Distractor Generation for Cloze-style Multiple Choice Questions | Apr 21, 2020 | Distractor GenerationLearning-To-Rank | —Unverified | 0 |
| Dual Co-Matching Network for Multi-choice Reading Comprehension | Jan 27, 2019 | Machine Reading ComprehensionMultiple-choice | —Unverified | 0 |
| ACPBench Hard: Unrestrained Reasoning about Action, Change, and Planning | Mar 31, 2025 | Multiple-choice | —Unverified | 0 |
| DsMCL: Dual-Level Stochastic Multiple Choice Learning for Multi-Modal Trajectory Prediction | Mar 19, 2020 | Multiple-choicePrediction | —Unverified | 0 |
| DRIVINGVQA: Analyzing Visual Chain-of-Thought Reasoning of Vision Language Models in Real-World Scenarios with Driving Theory Tests | Jan 8, 2025 | Multimodal ReasoningMultiple-choice | —Unverified | 0 |
| AGenT Zero: Zero-shot Automatic Multiple-Choice Question Generation for Skill Assessments | Nov 25, 2020 | Multiple-choiceQuestion Generation | —Unverified | 0 |
| DREAM: A Challenge Data Set and Models for Dialogue-Based Reading Comprehension | Mar 1, 2019 | Dialogue UnderstandingMultiple-choice | —Unverified | 0 |