SOTAVerified

Multiple-choice

Papers

Showing 521530 of 1107 papers

TitleStatusHype
MCQG-SRefine: Multiple Choice Question Generation and Evaluation with Iterative Self-Critique, Correction, and Comparison FeedbackCode0
Evaluating the Instruction-following Abilities of Language Models using Knowledge TasksCode0
Leaving the barn door open for Clever Hans: Simple features predict LLM benchmark answersCode0
Difficult Task Yes but Simple Task No: Unveiling the Laziness in Multimodal LLMsCode0
Personalised Feedback Framework for Online Education Programmes Using Generative AI0
Not All Options Are Created Equal: Textual Option Weighting for Token-Efficient LLM-Based Knowledge Tracing0
LOKI: A Comprehensive Synthetic Data Detection Benchmark using Large Multimodal Models0
LongHalQA: Long-Context Hallucination Evaluation for MultiModal Large Language ModelsCode0
The Future of Learning in the Age of Generative AI: Automated Question Generation and Assessment with Large Language Models0
NoVo: Norm Voting off Hallucinations with Attention Heads in Large Language ModelsCode0
Show:102550
← PrevPage 53 of 111Next →

No leaderboard results yet.