SOTAVerified

Multiple-choice

Papers

Showing 501510 of 1107 papers

TitleStatusHype
Do Large Language Models Understand Conversational Implicature -- A case study with a chinese sitcomCode1
FoundaBench: Evaluating Chinese Fundamental Knowledge Capabilities of Large Language Models0
PLAYER*: Enhancing LLM-based Multi-Agent Communication and Interaction in Murder Mystery GamesCode2
From Multiple-Choice to Extractive QA: A Case Study for English and ArabicCode0
How Far Are We to GPT-4V? Closing the Gap to Commercial Multimodal Models with Open-Source Suites0
SEED-Bench-2-Plus: Benchmarking Multimodal Large Language Models with Text-Rich Visual ComprehensionCode3
AI and Machine Learning for Next Generation Science Assessments0
TAXI: Evaluating Categorical Knowledge Editing for Language ModelsCode0
UnibucLLM: Harnessing LLMs for Automated Prediction of Item Difficulty and Response Time for Multiple-Choice QuestionsCode0
Improving Automated Distractor Generation for Math Multiple-choice Questions with Overgenerate-and-rank0
Show:102550
← PrevPage 51 of 111Next →

No leaderboard results yet.