SOTAVerified

Multiple-choice

Papers

Showing 451475 of 1107 papers

TitleStatusHype
Empowering Large Language Models in Wireless Communication: A Novel Dataset and Fine-Tuning Framework0
Generalization v.s. Memorization: Tracing Language Models' Capabilities Back to Pretraining Data0
Beyond VQA: Generating Multi-word Answer and Rationale to Visual Questions0
Generating Adequate Distractors for Multiple-Choice Questions0
Generating Correct Answers for Progressive Matrices Intelligence Tests0
Generating Diagnostic Multiple Choice Comprehension Cloze Questions0
LLMs May Perform MCQA by Selecting the Least Incorrect Option0
Generating multiple-choice questions for medical question answering with distractors and cue-masking0
ELiRF-UPV at SemEval-2018 Task 11: Machine Comprehension using Commonsense Knowledge0
Generating Questions and Multiple-Choice Answers using Semantic Analysis of Texts0
Answer, Assemble, Ace: Understanding How Transformers Answer Multiple Choice Questions0
Genome-Bench: A Scientific Reasoning Benchmark from Real-World Expert Discussions0
ANPMI: Assessing the True Comprehension Capabilities of LLMs for Multiple Choice Questions0
A Graph-Guided Reasoning Approach for Open-ended Commonsense Question Answering0
Evaluating Clinical Competencies of Large Language Models with a General Practice Benchmark0
Eliciting Categorical Data for Optimal Aggregation0
GPT-4o System Card0
GPT-4 to GPT-3.5: 'Hold My Scalpel' -- A Look at the Competency of OpenAI's GPT on the Plastic Surgery In-Service Training Exam0
Eigen Values Features for the Classification of Brain Signals corresponding to 2D and 3D Educational Contents0
Not All Options Are Created Equal: Textual Option Weighting for Token-Efficient LLM-Based Knowledge Tracing0
CodeReviewQA: The Code Review Comprehension Assessment for Large Language Models0
GRAF: Graph Retrieval Augmented by Facts for Romanian Legal Multi-Choice Question Answering0
GraphITE: Estimating Individual Effects of Graph-structured Treatments0
Graph-Structured Representations for Visual Question Answering0
IllusionBench: A Large-scale and Comprehensive Benchmark for Visual Illusion Understanding in Vision-Language Models0
Show:102550
← PrevPage 19 of 45Next →

No leaderboard results yet.