SOTAVerified

Multiple-choice

Papers

Showing 726750 of 1107 papers

TitleStatusHype
Weighted Global Normalization for Multiple Choice Reading Comprehension over Long Documents0
Recent Advances in Multi-Choice Machine Reading Comprehension: A Survey on Methods and Datasets0
Correctness Coverage Evaluation for Medical Multiple-Choice Question Answering Based on the Enhanced Conformal Prediction Framework0
Statistically Profiling Biases in Natural Language Reasoning Datasets and Models0
Adaptive Crowdsourcing Algorithms for the Bandit Survey Problem0
Stick to your Role! Stability of Personal Values Expressed in Large Language Models0
Stochastic Multiple Choice Learning for Training Diverse Deep Ensembles0
Adapting Vision-Language Models for Evaluating World Models0
Strategyproof Mean Estimation from Multiple-Choice Questions0
Structured Outputs Enable General-Purpose LLMs to be Medical Experts0
What does BERT Learn from Multiple-Choice Reading Comprehension Datasets?0
Superhuman performance of a large language model on the reasoning tasks of a physician0
What do we expect from Multiple-choice QA Systems?0
What Gives the Answer Away? Question Answering Bias Analysis on Video QA Datasets0
Susu Box or Piggy Bank: Assessing Cultural Commonsense Knowledge between Ghana and the U.S0
SWAG: A Large-Scale Adversarial Dataset for Grounded Commonsense Inference0
SynDARin: Synthesising Datasets for Automated Reasoning in Low-Resource Languages0
TabMCQ: A Dataset of General Knowledge Tables and Multiple-choice Questions0
TA-MAMC at SemEval-2021 Task 4: Task-adaptive Pretraining and Multi-head Attention for Abstract Meaning Reading Comprehension0
Task-Adaptive Pretrained Language Models via Clustered-Importance Sampling0
TCM-Ladder: A Benchmark for Multimodal Question Answering on Traditional Chinese Medicine0
Tell Me Who Your Students Are: GPT Can Generate Valid Multiple-Choice Questions When Students' (Mis)Understanding Is Hinted0
Empowering Sentence Encoders with Prompting and Label Retrieval for Zero-shot Text Classification0
Testing Uncertainty of Large Language Models for Physics Knowledge and Reasoning0
Answering Chinese Elementary School Social Studies Multiple Choice Questions0
Show:102550
← PrevPage 30 of 45Next →

No leaderboard results yet.