SOTAVerified

Multiple-choice

Papers

Showing 226250 of 1107 papers

TitleStatusHype
Leveraging Large Language Models for Learning Complex Legal Concepts through StorytellingCode1
Logic-Guided Data Augmentation and Regularization for Consistent Question AnsweringCode1
Enhancing Knowledge Tracing with Concept Map and Response DisentanglementCode1
Clues Before Answers: Generation-Enhanced Multiple-Choice QACode1
CUPCase: Clinically Uncommon Patient Cases and Diagnoses DatasetCode1
CMMU: A Benchmark for Chinese Multi-modal Multi-type Question Understanding and ReasoningCode1
JMedLoRA:Medical Domain Adaptation on Japanese Large Language Models using Instruction-tuningCode1
Fool Your (Vision and) Language Model With Embarrassingly Simple PermutationsCode1
CHOICE: Benchmarking the Remote Sensing Capabilities of Large Vision-Language ModelsCode1
Training Trajectories of Language Models Across ScalesCode1
Embedding Trajectory for Out-of-Distribution Detection in Mathematical ReasoningCode1
Knowledge Graph-Augmented Abstractive Summarization with Semantic-Driven Cloze RewardCode1
TSQA: Tabular Scenario Based Question AnsweringCode1
Counterfactual Variable Control for Robust and Interpretable Question AnsweringCode1
A BERT-based Distractor Generation Scheme with Multi-tasking and Negative Answer Training StrategiesCode1
Language Model Uncertainty Quantification with Attention ChainCode1
Large Language Models Encode Clinical KnowledgeCode1
CommonsenseQA: A Question Answering Challenge Targeting Commonsense KnowledgeCode1
LibriSQA: A Novel Dataset and Framework for Spoken Question Answering with Large Language ModelsCode1
Complex Reasoning over Logical Queries on Commonsense Knowledge GraphsCode1
Assessing the Chemical Intelligence of Large Language ModelsCode1
LogicOCR: Do Your Large Multimodal Models Excel at Logical Reasoning on Text-Rich Images?Code1
MedQA-CS: Benchmarking Large Language Models Clinical Skills Using an AI-SCE FrameworkCode1
Constructing Narrative Event Evolutionary Graph for Script Event PredictionCode1
Mobile-MMLU: A Mobile Intelligence Language Understanding BenchmarkCode1
Show:102550
← PrevPage 10 of 45Next →

No leaderboard results yet.