| Detect, Describe, Discriminate: Moving Beyond VQA for MLLM Evaluation | Sep 23, 2024 | Multiple-choiceQuestion Answering | —Unverified | 0 | 0 |
| Developing A Framework to Support Human Evaluation of Bias in Generated Free Response Text | May 5, 2025 | Multiple-choice | —Unverified | 0 | 0 |
| Development and Evaluation of a Personalized Computer-aided Question Generation for English Learners to Improve Proficiency and Correct Mistakes | Aug 29, 2018 | Multiple-choiceQuestion Generation | —Unverified | 0 | 0 |
| DFIR-Metric: A Benchmark Dataset for Evaluating Large Language Models in Digital Forensics and Incident Response | May 26, 2025 | Multiple-choice | —Unverified | 0 | 0 |
| D-GEN: Automatic Distractor Generation and Evaluation for Reliable Assessment of Generative Model | Apr 18, 2025 | Distractor GenerationMultiple-choice | —Unverified | 0 | 0 |
| DGRC: An Effective Fine-tuning Framework for Distractor Generation in Chinese Multi-choice Reading Comprehension | May 29, 2024 | Distractor GenerationMultiple-choice | —Unverified | 0 | 0 |
| Instructions and Guide for Diagnostic Questions: The NeurIPS 2020 Education Challenge | Jul 23, 2020 | DiagnosticMisconceptions | —Unverified | 0 | 0 |
| Dialogue-Based Simulation For Cultural Awareness Training | Feb 1, 2020 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 | 0 |
| Dienstplanerstellung in Krankenhaeusern mittels genetischer Algorithmen | May 30, 2013 | Multiple-choice | —Unverified | 0 | 0 |
| Differentiable Open-Ended Commonsense Reasoning | Oct 24, 2020 | Multiple-choice | —Unverified | 0 | 0 |