SOTAVerified

Multiple-choice

Papers

Showing 431440 of 1107 papers

TitleStatusHype
Enhancing Distractor Generation for Multiple-Choice Questions with Retrieval Augmented Pretraining and Knowledge Graph Integration0
On the Principles behind Opinion Dynamics in Multi-Agent Systems of Large Language Models0
QOG:Question and Options Generation based on Language Model0
UBENCH: Benchmarking Uncertainty in Large Language Models with Multiple Choice QuestionsCode0
DetectBench: Can Large Language Model Detect and Piece Together Implicit Evidence?Code0
Aqulia-Med LLM: Pioneering Full-Process Open-Source Medical Language Models0
IPEval: A Bilingual Intellectual Property Agency Consultation Evaluation Benchmark for Large Language ModelsCode0
Grade Score: Quantifying LLM Performance in Option SelectionCode0
FoodieQA: A Multimodal Dataset for Fine-Grained Understanding of Chinese Food CultureCode1
VELOCITI: Benchmarking Video-Language Compositional Reasoning with Strict Entailment0
Show:102550
← PrevPage 44 of 111Next →

No leaderboard results yet.