SOTAVerified

Multiple-choice

Papers

Showing 201250 of 1107 papers

TitleStatusHype
PADL: Language-Directed Physics-Based Character ControlCode1
GPT as Knowledge Worker: A Zero-Shot Evaluation of (AI)CPA CapabilitiesCode1
Mind Reasoning Manners: Enhancing Type Perception for Generalized Zero-shot Logical Reasoning over TextCode1
GPT Takes the Bar ExamCode1
Large Language Models Encode Clinical KnowledgeCode1
Training Trajectories of Language Models Across ScalesCode1
Evaluating the Knowledge Dependency of QuestionsCode1
Leveraging Large Language Models for Multiple Choice Question AnsweringCode1
EduQG: A Multi-format Multiple Choice Dataset for the Educational DomainCode1
Variational Open-Domain Question AnsweringCode1
Can large language models reason about medical questions?Code1
CC-Riddle: A Question Answering Dataset of Chinese Character RiddlesCode1
SQuALITY: Building a Long-Document Summarization Dataset the Hard WayCode1
FETA: A Benchmark for Few-Sample Task Transfer in Open-Domain DialogueCode1
Clues Before Answers: Generation-Enhanced Multiple-Choice QACode1
AdaLoGN: Adaptive Logic Graph Network for Reasoning-Based Machine Reading ComprehensionCode1
Leaf: Multiple-Choice Question GenerationCode1
Bridging Video-text Retrieval with Multiple Choice QuestionsCode1
Multiple Choice Questions based Multi-Interest Policy Learning for Conversational RecommendationCode1
QuALITY: Question Answering with Long Input Texts, Yes!Code1
Surface Form Competition: Why the Highest Probability Answer Isn’t Always RightCode1
MixQG: Neural Question Generation with Mixed Answer TypesCode1
A Few More Examples May Be Worth Billions of ParametersCode1
An MRC Framework for Semantic Role LabelingCode1
ARMAN: Pre-training with Semantically Selecting and Reordering of Sentences for Persian Abstractive SummarizationCode1
General-Purpose Question-Answering with MacawCode1
TIMEDIAL: Temporal Commonsense Reasoning in DialogCode1
NeuralWOZ: Learning to Collect Task-Oriented Dialogue via Model-Based SimulationCode1
Option Tracing: Beyond Correctness Analysis in Knowledge TracingCode1
When Does Pretraining Help? Assessing Self-Supervised Learning for Law and the CaseHOLD DatasetCode1
Surface Form Competition: Why the Highest Probability Answer Isn't Always RightCode1
What to Pre-Train on? Efficient Intermediate Task SelectionCode1
ExplaGraphs: An Explanation Graph Generation Task for Structured Commonsense ReasoningCode1
Quiz-Style Question Generation for News StoriesCode1
TSQA: Tabular Scenario Based Question AnsweringCode1
Explaining NLP Models via Minimal Contrastive Editing (MiCE)Code1
Option Tracing: Beyond Binary Knowledge TracingCode1
IndicNLPSuite: Monolingual Corpora, Evaluation Benchmarks and Pre-trained Multilingual Language Models for Indian LanguagesCode1
A BERT-based Distractor Generation Scheme with Multi-tasking and Negative Answer Training Strategies.Code1
Trajectory-wise Multiple Choice Learning for Dynamics Generalization in Reinforcement LearningCode1
Counterfactual Variable Control for Robust and Interpretable Question AnsweringCode1
A BERT-based Distractor Generation Scheme with Multi-tasking and Negative Answer Training StrategiesCode1
FarsTail: A Persian Natural Language Inference DatasetCode1
Knowledge Graph-Augmented Abstractive Summarization with Semantic-Driven Cloze RewardCode1
UnifiedQA: Crossing Format Boundaries With a Single QA SystemCode1
LifeQA: A Real-life Dataset for Video Question AnsweringCode1
Simulated Annealing Algorithm for the Multiple Choice Multidimensional Knapsack ProblemCode1
STARC: Structured Annotations for Reading ComprehensionCode1
Logic-Guided Data Augmentation and Regularization for Consistent Question AnsweringCode1
From Machine Reading Comprehension to Dialogue State Tracking: Bridging the GapCode1
Show:102550
← PrevPage 5 of 23Next →

No leaderboard results yet.