SOTAVerified

Multiple-choice

Papers

Showing 681690 of 1107 papers

TitleStatusHype
Improving the Production Efficiency and Well-formedness of Automatically-Generated Multiple-Choice Cloze Vocabulary Questions0
In Case You Missed It: ARC 'Challenge' Is Not That Challenging0
TVBench: Redesigning Video-Language Evaluation0
Indirect Identification of Psychosocial Risks from Natural Language0
Inferring from Logits: Exploring Best Practices for Decoding-Free Generative Candidate Selection0
Two-Turn Debate Doesn't Help Humans Answer Hard Reading Comprehension Questions0
InnerThoughts: Disentangling Representations and Predictions in Large Language Models0
InstructionBench: An Instructional Video Understanding Benchmark0
Instruction Tuning and CoT Prompting for Contextual Medical QA with LLMs0
Instruction Tuning on Public Government and Cultural Data for Low-Resource Language: a Case Study in Kazakh0
Show:102550
← PrevPage 69 of 111Next →

No leaderboard results yet.