| The Use of Artificial Intelligence Tools in Assessing Content Validity: A Comparative Study with Human Experts | Feb 3, 2025 | Multiple-choiceReading Comprehension | —Unverified | 0 | 0 |
| Bridging Information-Seeking Human Gaze and Machine Reading Comprehension | Sep 30, 2020 | Machine Reading ComprehensionMultiple-choice | —Unverified | 0 | 0 |
| Bridging the Language Gap: Knowledge Injected Multilingual Question Answering | Apr 6, 2023 | Cross-Lingual TransferExtractive Question-Answering | —Unverified | 0 | 0 |
| Analysis of the Cambridge Multiple-Choice Questions Reading Dataset with a Focus on Candidate Response Distribution | Jun 22, 2023 | Multiple-choice | —Unverified | 0 | 0 |
| Can AI Master Construction Management (CM)? Benchmarking State-of-the-Art Large Language Models on CM Certification Exams | Apr 4, 2025 | BenchmarkingManagement | —Unverified | 0 | 0 |
| Can ChatGPT pass the Vietnamese National High School Graduation Examination? | Jun 15, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 | 0 |
| Can Crowdsourcing be used for Effective Annotation of Arabic? | May 1, 2014 | Entity ResolutionMultiple-choice | —Unverified | 0 | 0 |
| Can Generative Pre-trained Transformers (GPT) Pass Assessments in Higher Education Programming Courses? | Mar 16, 2023 | Multiple-choice | —Unverified | 0 | 0 |
| The use of large language models to enhance cancer clinical trial educational materials | Dec 2, 2024 | MisinformationMultiple-choice | —Unverified | 0 | 0 |
| Can Multimodal LLMs do Visual Temporal Understanding and Reasoning? The answer is No! | Jan 18, 2025 | Multiple-choiceQuestion Answering | —Unverified | 0 | 0 |