| Zero-shot Event Causality Identification with Question Answering | Sep 1, 2022 | ArticlesEvent Causality Identification | —Unverified | 0 | 0 |
| DARE: Diverse Visual Question Answering with Robustness Evaluation | Sep 26, 2024 | image-classificationImage Classification | —Unverified | 0 | 0 |
| ACPBench Hard: Unrestrained Reasoning about Action, Change, and Planning | Mar 31, 2025 | Multiple-choice | —Unverified | 0 | 0 |
| Dataset Bias Mitigation in Multiple-Choice Visual Question Answering and Beyond | Oct 23, 2023 | counterfactualMultiple-choice | —Unverified | 0 | 0 |
| Decision-Making Behavior Evaluation Framework for LLMs under Uncertain Context | Jun 10, 2024 | Decision MakingMultiple-choice | —Unverified | 0 | 0 |
| Deep learning for sentence clustering in essay grading support | Apr 23, 2021 | ClusteringDeep Learning | —Unverified | 0 | 0 |
| DeepQR: Neural-based Quality Ratings for Learnersourced Multiple-Choice Questions | Nov 19, 2021 | Contrastive LearningMultiple-choice | —Unverified | 0 | 0 |
| DeepSeek-R1 Outperforms Gemini 2.0 Pro, OpenAI o1, and o3-mini in Bilingual Complex Ophthalmology Reasoning | Feb 25, 2025 | ManagementMultiple-choice | —Unverified | 0 | 0 |
| Designing Templates for Eliciting Commonsense Knowledge from Pretrained Sequence-to-Sequence Models | Dec 1, 2020 | Multiple-choiceNatural Language Understanding | —Unverified | 0 | 0 |
| DeSIQ: Towards an Unbiased, Challenging Benchmark for Social Intelligence Understanding | Oct 24, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 | 0 |