SOTAVerified

Multiple-choice

Papers

Showing 211220 of 1107 papers

TitleStatusHype
Counterfactual Variable Control for Robust and Interpretable Question AnsweringCode1
Language Model Uncertainty Quantification with Attention ChainCode1
A BERT-based Distractor Generation Scheme with Multi-tasking and Negative Answer Training Strategies.Code1
AutoLogi: Automated Generation of Logic Puzzles for Evaluating Reasoning Abilities of Large Language ModelsCode1
Complex Reasoning over Logical Queries on Commonsense Knowledge GraphsCode1
Large Language Models Encode Clinical KnowledgeCode1
CodeApex: A Bilingual Programming Evaluation Benchmark for Large Language ModelsCode1
GPT as Knowledge Worker: A Zero-Shot Evaluation of (AI)CPA CapabilitiesCode1
CoLoR-Filter: Conditional Loss Reduction Filtering for Targeted Language Model Pre-trainingCode1
JMedLoRA:Medical Domain Adaptation on Japanese Large Language Models using Instruction-tuningCode1
Show:102550
← PrevPage 22 of 111Next →

No leaderboard results yet.