| CinePile: A Long Video Question Answering Dataset and Benchmark | May 14, 2024 | FormHuman-Object Interaction Detection | —Unverified | 0 | 0 |
| Cleared for Takeoff? Compositional & Conditional Reasoning may be the Achilles Heel to (Flight-Booking) Language Agents | Apr 5, 2024 | Multiple-choiceNavigate | —Unverified | 0 | 0 |
| ClinBench-HPB: A Clinical Benchmark for Evaluating LLMs in Hepato-Pancreato-Biliary Diseases | May 30, 2025 | Medical Question AnsweringMultiple-choice | —Unverified | 0 | 0 |
| An Experimental Study of Deep Neural Network Models for Vietnamese Multiple-Choice Reading Comprehension | Aug 20, 2020 | Machine Reading ComprehensionMultiple-choice | —Unverified | 0 | 0 |
| CLIP-UP: CLIP-Based Unanswerable Problem Detection for Visual Question Answering | Jan 2, 2025 | Multiple-choiceQuestion Answering | —Unverified | 0 | 0 |
| Clozer: Adaptable Data Augmentation for Cloze-style Reading Comprehension | Mar 30, 2022 | Data AugmentationMachine Reading Comprehension | —Unverified | 0 | 0 |
| Clozer”:" Adaptable Data Augmentation for Cloze-style Reading Comprehension | May 1, 2022 | Data AugmentationMachine Reading Comprehension | —Unverified | 0 | 0 |
| Think you have Solved Direct-Answer Question Answering? Try ARC-DA, the Direct-Answer AI2 Reasoning Challenge | Feb 5, 2021 | AI2 Reasoning ChallengeARC | —Unverified | 0 | 0 |
| A New Era: Intelligent Tutoring Systems Will Transform Online Learning for Millions | Mar 3, 2022 | Active LearningMultiple-choice | —Unverified | 0 | 0 |
| CoddLLM: Empowering Large Language Models for Data Analytics | Feb 1, 2025 | Multiple-choiceSynthetic Data Generation | —Unverified | 0 | 0 |