SOTAVerified

Multiple-choice

Papers

Showing 481490 of 1107 papers

TitleStatusHype
Eliciting Informative Text Evaluations with Large Language ModelsCode0
Imagery as Inquiry: Exploring A Multimodal Dataset for Conversational Recommendation0
Automated Evaluation of Retrieval-Augmented Language Models with Task-Specific Exam GenerationCode2
Embedding Trajectory for Out-of-Distribution Detection in Mathematical ReasoningCode1
Robust portfolio optimization model for electronic coupon allocation0
Multiple-Choice Questions are Efficient and Robust LLM EvaluatorsCode1
Exploring the Capabilities of Prompted Large Language Models in Educational and Assessment Applications0
From Generalist to Specialist: Improving Large Language Models for Medical Physics Using ARCoT0
Benchmarking Large Language Models on CFLUE -- A Chinese Financial Language Understanding Evaluation DatasetCode3
COGNET-MD, an evaluation framework and dataset for Large Language Model benchmarks in the medical domain0
Show:102550
← PrevPage 49 of 111Next →

No leaderboard results yet.