SOTAVerified

Multiple-choice

Papers

Showing 471480 of 1107 papers

TitleStatusHype
Improving Question Answering with External KnowledgeCode0
Introducing a framework to assess newly created questions with Natural Language ProcessingCode0
CRiskEval: A Chinese Multi-Level Risk Evaluation Benchmark Dataset for Large Language ModelsCode0
Utilize the Flow before Stepping into the Same River Twice: Certainty Represented Knowledge Flow for Refusal-Aware Instruction TuningCode0
How much do LLMs learn from negative examples?Code0
Increasing Probability Mass on Answer Choices Does Not Always Improve AccuracyCode0
How Can We Diagnose and Treat Bias in Large Language Models for Clinical Decision-Making?Code0
VCRBench: Exploring Long-form Causal Reasoning Capabilities of Large Video Language ModelsCode0
IdentifyMe: A Challenging Long-Context Mention Resolution Benchmark for LLMsCode0
Harnessing Structured Knowledge: A Concept Map-Based Approach for High-Quality Multiple Choice Question Generation with Effective DistractorsCode0
Show:102550
← PrevPage 48 of 111Next →

No leaderboard results yet.