| Generating Adequate Distractors for Multiple-Choice Questions | Oct 23, 2020 | FormMultiple-choice | —Unverified | 0 |
| Break the Checkbox: Challenging Closed-Style Evaluations of Cultural Alignment in LLMs | Feb 12, 2025 | Multiple-choiceSurvey | —Unverified | 0 |
| AI-based Arabic Language and Speech Tutor | Oct 22, 2022 | Multiple-choiceSelf-Learning | —Unverified | 0 |
| Answering Science Exam Questions Using Query Reformulation with Background Knowledge | Nov 17, 2018 | ARCInformation Retrieval | —Unverified | 0 |
| ActionAtlas: A VideoQA Benchmark for Domain-specialized Action Recognition | Oct 8, 2024 | Action RecognitionMultiple-choice | —Unverified | 0 |
| Answering Science Exam Questions Using Query Rewriting with Background Knowledge | Sep 15, 2018 | ARCInformation Retrieval | —Unverified | 0 |
| BloomVQA: Assessing Hierarchical Multi-modal Comprehension | Dec 20, 2023 | Data AugmentationMemorization | —Unverified | 0 |
| AI and Machine Learning for Next Generation Science Assessments | Apr 23, 2024 | Multiple-choice | —Unverified | 0 |
| BLINK: Multimodal Large Language Models Can See but Not Perceive | Apr 18, 2024 | Depth EstimationMultiple-choice | —Unverified | 0 |
| Answering Questions in Stages: Prompt Chaining for Contract QA | Oct 9, 2024 | Multiple-choice | —Unverified | 0 |