SOTAVerified

Multiple-choice

Papers

Showing 961970 of 1107 papers

TitleStatusHype
Evaluating Machine Reading Systems through Comprehension Tests0
Evaluating multiple large language models in pediatric ophthalmology0
Evaluating Nuanced Bias in Large Language Model Free Response Answers0
Evaluating Question Answering Evaluation0
A Corpus of Text Data and Gaze Fixations from Autistic and Non-Autistic Adults0
Evaluating the Performance and Robustness of LLMs in Materials Science Q&A and Property Predictions0
Evaluating the Potential of Leading Large Language Models in Reasoning Biology Questions0
Evaluating the Rationale Understanding of Critical Reasoning in Logical Reading Comprehension0
Evaluating the Symbol Binding Ability of Large Language Models for Multiple-Choice Questions in Vietnamese General Education0
Evaluating Vision-Language and Large Language Models for Automated Student Assessment in Indonesian Classrooms0
Show:102550
← PrevPage 97 of 111Next →

No leaderboard results yet.