SOTAVerified

Multiple-choice

Papers

Showing 411420 of 1107 papers

TitleStatusHype
SCoRE: Benchmarking Long-Chain Reasoning in Commonsense ScenariosCode0
Investigating the Shortcomings of LLMs in Step-by-Step Legal ReasoningCode0
A large language model-assisted education tool to provide feedback on open-ended responsesCode0
Questioning the Survey Responses of Large Language ModelsCode0
Introducing Flexible Monotone Multiple Choice Item Response Theory Models and Bit ScalesCode0
IPEval: A Bilingual Intellectual Property Agency Consultation Evaluation Benchmark for Large Language ModelsCode0
Affordably Fine-tuned LLMs Provide Better Answers to Course-specific MCQsCode0
Introducing a framework to assess newly created questions with Natural Language ProcessingCode0
iREL at SemEval-2024 Task 9: Improving Conventional Prompting Methods for Brain TeasersCode0
Automatic Generation and Evaluation of Reading Comprehension Test Items with Large Language ModelsCode0
Show:102550
← PrevPage 42 of 111Next →

No leaderboard results yet.