SOTAVerified

Multiple-choice

Papers

Showing 801810 of 1107 papers

TitleStatusHype
A Weak Supervision Approach for Predicting Difficulty of Technical Interview Questions0
Bayesian Statistical Modeling with Predictors from LLMs0
Being Negative but Constructively: Lessons Learnt from Creating Better Visual Question Answering Datasets0
Benchmarking Bias in Large Language Models during Role-Playing0
The Future of Learning in the Age of Generative AI: Automated Question Generation and Assessment with Large Language Models0
Answer, Assemble, Ace: Understanding How Transformers Answer Multiple Choice Questions0
The Generative Energy Arena (GEA): Incorporating Energy Awareness in Large Language Model (LLM) Human Evaluations0
Benchmarking Next-Generation Reasoning-Focused Large Language Models in Ophthalmology: A Head-to-Head Evaluation on 5,888 Items0
Benchmarks for Pirá 2.0, a Reading Comprehension Dataset about the Ocean, the Brazilian Coast, and Climate Change0
Better Distractions: Transformer-based Distractor Generation and Multiple Choice Question Filtering0
Show:102550
← PrevPage 81 of 111Next →

No leaderboard results yet.