SOTAVerified

Multiple-choice

Papers

Showing 911920 of 1107 papers

TitleStatusHype
TAXI: Evaluating Categorical Knowledge Editing for Language ModelsCode0
WiCkeD: A Simple Method to Make Multiple Choice Benchmarks More ChallengingCode0
What Makes Reading Comprehension Questions Easier?Code0
Downstream Trade-offs of a Family of Text WatermarksCode0
Teach2Eval: An Indirect Evaluation Method for LLM by Judging How It TeachesCode0
A multimodal dataset for understanding the impact of mobile phones on remote online virtual educationCode0
Utilize the Flow before Stepping into the Same River Twice: Certainty Represented Knowledge Flow for Refusal-Aware Instruction TuningCode0
Can Model Uncertainty Function as a Proxy for Multiple-Choice Question Item Difficulty?Code0
Differentiating Choices via Commonality for Multiple-Choice Question AnsweringCode0
Utilizing Background Knowledge for Robust Reasoning over Traffic SituationsCode0
Show:102550
← PrevPage 92 of 111Next →

No leaderboard results yet.