SOTAVerified

Multiple-choice

Papers

Showing 771780 of 1107 papers

TitleStatusHype
Performance of ChatGPT-3.5 and GPT-4 on the United States Medical Licensing Examination With and Without Distractions0
INCEPTNET: Precise And Early Disease Detection Application For Medical Images AnalysesCode0
An Automatic Evaluation Framework for Multi-turn Medical Consultations Capabilities of Large Language Models0
Generalised Winograd Schema and its Contextuality0
Spoken Language Intelligence of Large Language Models for Language LearningCode0
Large Language Models Sensitivity to The Order of Options in Multiple-Choice Questions0
A Comparative Study of Open-Source Large Language Models, GPT-4 and Claude 2: Multiple-Choice Test Taking in Nephrology0
Automated Distractor and Feedback Generation for Math Multiple-choice Questions via In-context LearningCode0
ChatGPT for GTFS: Benchmarking LLMs on GTFS Understanding and RetrievalCode0
ReCoMIF: Reading comprehension based multi-source information fusion network for Chinese spoken language understandingCode0
Show:102550
← PrevPage 78 of 111Next →

No leaderboard results yet.