SOTAVerified

Multiple-choice

Papers

Showing 726750 of 1107 papers

TitleStatusHype
Make a Choice! Knowledge Base Question Answering with In-Context Learning0
Query Rewriting for Retrieval-Augmented Large Language Models0
NarrativeXL: A Large-scale Dataset For Long-Term Memory ModelsCode1
Iterative Forward Tuning Boosts In-Context Learning in Language ModelsCode0
VNHSGE: VietNamese High School Graduation Examination Dataset for Large Language ModelsCode1
M3KE: A Massive Multi-Level Multi-Subject Knowledge Evaluation Benchmark for Chinese Large Language ModelsCode1
A quantitative study of NLP approaches to question difficulty estimationCode0
C-Eval: A Multi-Level Multi-Discipline Chinese Evaluation Suite for Foundation ModelsCode3
EMBRACE: Evaluation and Modifications for Boosting RACECode0
Language Models Don't Always Say What They Think: Unfaithful Explanations in Chain-of-Thought PromptingCode1
MindGames: Targeting Theory of Mind in Large Language Models with Dynamic Epistemic Modal LogicCode1
Contextual Response Interpretation for Automated Structured Interviews: A Case Study in Market Research0
Who's the Best Detective? LLMs vs. MLs in Detecting Incoherent Fourth Grade Math Answers0
Analyzing the Performance of ChatGPT in Cardiology and Vascular Pathologies0
Prompt Engineering and Calibration for Zero-Shot Commonsense Reasoning0
DISTO: Evaluating Textual Distractors for Multi-Choice Questions using Negative Sampling based Approach0
FrenchMedMCQA: A French Multiple-Choice Question Answering Dataset for Medical domainCode0
Bridging the Language Gap: Knowledge Injected Multilingual Question Answering0
GPT-4 to GPT-3.5: 'Hold My Scalpel' -- A Look at the Competency of OpenAI's GPT on the Plastic Surgery In-Service Training Exam0
A Multiple Choices Reading Comprehension Corpus for Vietnamese Language EducationCode0
Evaluating GPT-3.5 and GPT-4 Models on Brazilian University Admission ExamsCode1
Explicit Planning Helps Language Models in Logical ReasoningCode1
Automatic Generation of Multiple-Choice Questions0
A Graph-Guided Reasoning Approach for Open-ended Commonsense Question Answering0
Can Generative Pre-trained Transformers (GPT) Pass Assessments in Higher Education Programming Courses?0
Show:102550
← PrevPage 30 of 45Next →

No leaderboard results yet.