SOTAVerified

Multiple-choice

Papers

Showing 801850 of 1107 papers

TitleStatusHype
SQuALITY: Building a Long-Document Summarization Dataset the Hard WayCode1
FETA: A Benchmark for Few-Sample Task Transfer in Open-Domain DialogueCode1
Unsupervised multiple-choice question generation for out-of-domain Q&A fine-tuning0
Automatic Generation of Distractors for Fill-in-the-Blank Exercises with Round-Trip Neural Machine Translation0
Clozer”:" Adaptable Data Augmentation for Cloze-style Reading Comprehension0
Answer-level Calibration for Free-form Multiple Choice Question AnsweringCode0
Answer Uncertainty and Unanswerability in Multiple-Choice Machine Reading Comprehension0
Clues Before Answers: Generation-Enhanced Multiple-Choice QACode1
Flamingo: a Visual Language Model for Few-Shot LearningCode4
Single-Turn Debate Does Not Help Humans Answer Hard Reading-Comprehension Questions0
No Task Left Behind: Multi-Task Learning of Knowledge Tracing and Option Tracing for Better Student Assessment0
Clozer: Adaptable Data Augmentation for Cloze-style Reading Comprehension0
Evaluating Prompts Across Multiple Choice Tasks In a Zero-Shot SettingCode0
MedMCQA : A Large-scale Multi-Subject Multi-Choice Dataset for Medical domain Question AnsweringCode2
A Theoretically Grounded Benchmark for Evaluating Machine Commonsense0
AdaLoGN: Adaptive Logic Graph Network for Reasoning-Based Machine Reading ComprehensionCode1
All in One: Exploring Unified Video-Language Pre-trainingCode2
What Makes Reading Comprehension Questions Difficult?Code0
A New Era: Intelligent Tutoring Systems Will Transform Online Learning for Millions0
Aryl: An Elastic Cluster Scheduler for Deep Learning0
NEWSKVQA: Knowledge-Aware News Video Question Answering0
Leaf: Multiple-Choice Question GenerationCode1
Exposing the Limits of Video-Text Models through Contrast Sets0
Answer Uncertainty and Unanswerability in Multiple-Choice Machine Reading Comprehension0
Disaggregating Hops: Can We Guide a Multi-Hop Reasoning Language Model to Incrementally Learn at each Hop?0
MixQG: Neural Question Generation with Mixed Answer Types0
An MRC Framework for Semantic Role Labeling0
Context-guided Triple Matching for Multiple Choice Question Answering0
Bridging Video-text Retrieval with Multiple Choice QuestionsCode1
SaL-Lightning Dataset: Search and Eye Gaze Behavior, Resource Interactions and Knowledge Gain during Web Search0
Multiple Choice Questions based Multi-Interest Policy Learning for Conversational RecommendationCode1
QuALITY: Question Answering with Long Input Texts, Yes!Code1
Answering Chinese Elementary School Social Studies Multiple Choice Questions0
DeepQR: Neural-based Quality Ratings for Learnersourced Multiple-Choice Questions0
What Makes Machine Reading Comprehension Questions Difficult? Investigating Variation in Passage Sources and Question Types0
Fill-in-the-Blank: A Challenging Video Understanding Evaluation Framework0
Unsupervised multiple-choice question generation for out-of-domain Q\&A fine-tuning0
An AI-based Solution for Enhancing Delivery of Digital Learning for Future Teachers0
Surface Form Competition: Why the Highest Probability Answer Isn’t Always RightCode1
Enhancing Multiple-choice Machine Reading Comprehension by Punishing Illogical Interpretations0
A Semantic Feature-Wise Transformation Relation Network for Automatic Short Answer Grading0
Neural Natural Logic Inference for Interpretable Question AnsweringCode0
GANDALF: a General Character Name Description Dataset for Long Fiction0
Narrative Embedding: Re-Contextualization Through Attention0
MIRTT: Learning Multimodal Interaction Representations from Trilinear Transformers for Visual Question AnsweringCode0
Template Filling for Controllable Commonsense Reasoning0
DP-SSL: Towards Robust Semi-supervised Learning with A Few Labeled Samples0
Ranking Facts for Explaining Answers to Elementary Science Questions0
MixQG: Neural Question Generation with Mixed Answer TypesCode1
Towards Mixed-Precision Quantization of Neural Networks via Constrained Optimization0
Show:102550
← PrevPage 17 of 23Next →

No leaderboard results yet.