SOTAVerified

Multiple-choice

Papers

Showing 10711080 of 1107 papers

TitleStatusHype
Improving LLM First-Token Predictions in Multiple-Choice Question Answering via Prefilling Attack0
Analysing the Effect of Masking Length Distribution of MLM: An Evaluation Framework and Case Study on Chinese MRC Datasets0
Improving the Production Efficiency and Well-formedness of Automatically-Generated Multiple-Choice Cloze Vocabulary Questions0
In Case You Missed It: ARC 'Challenge' Is Not That Challenging0
TVBench: Redesigning Video-Language Evaluation0
Indirect Identification of Psychosocial Risks from Natural Language0
Inferring from Logits: Exploring Best Practices for Decoding-Free Generative Candidate Selection0
Two-Turn Debate Doesn't Help Humans Answer Hard Reading Comprehension Questions0
InnerThoughts: Disentangling Representations and Predictions in Large Language Models0
InstructionBench: An Instructional Video Understanding Benchmark0
Show:102550
← PrevPage 108 of 111Next →

No leaderboard results yet.