SOTAVerified

Multiple-choice

Papers

Showing 951960 of 1107 papers

TitleStatusHype
Enhancing Multiple-Choice Question Answering with Causal Knowledge0
Enhancing Video-LLM Reasoning via Agent-of-Thoughts Distillation0
EQUATOR: A Deterministic Framework for Evaluating LLM Reasoning with Open-Ended Questions. # v1.0.0-beta0
Establishing Task Scaling Laws via Compute-Efficient Model Ladders0
Towards Conversational AI for Disease Management0
Evalita-LLM: Benchmarking Large Language Models on Italian0
Towards Decision Support Technology Platform for Modular Systems0
Evaluating LLM-corrupted Crowdsourcing Data Without Ground Truth0
Evaluating LLM -- Generated Multimodal Diagnosis from Medical Images and Symptom Analysis0
Evaluating LLMs on Document-Based QA: Exact Answer Selection and Numerical Extraction using Cogtale dataset0
Show:102550
← PrevPage 96 of 111Next →

No leaderboard results yet.