SOTAVerified

Multiple-choice

Papers

Showing 791800 of 1107 papers

TitleStatusHype
Thrilled by Your Progress! Large Language Models (GPT-4) No Longer Struggle to Pass Assessments in Higher Education Programming Courses0
Can ChatGPT pass the Vietnamese National High School Graduation Examination?0
Questioning the Survey Responses of Large Language ModelsCode0
Investigating the Effectiveness of ChatGPT in Mathematical Reasoning and Problem Solving: Evidence from the Vietnamese National High School Graduation Examination0
Network-based Representations and Dynamic Discrete Choice Models for Multiple Discrete Choice Analysis0
BUCA: A Binary Classification Approach to Unsupervised Commonsense Question AnsweringCode0
Increasing Probability Mass on Answer Choices Does Not Always Improve AccuracyCode0
Have Large Language Models Developed a Personality?: Applicability of Self-Assessment Tests in Measuring Personality in LLMs0
ToMChallenges: A Principle-Guided Dataset and Diverse Evaluation Tasks for Exploring Theory of MindCode0
This Land is Your, My Land: Evaluating Geopolitical Biases in Language ModelsCode0
Show:102550
← PrevPage 80 of 111Next →

No leaderboard results yet.