| SQuALITY: Building a Long-Document Summarization Dataset the Hard Way | May 23, 2022 | Document SummarizationMultiple-choice | CodeCode Available | 1 |
| FETA: A Benchmark for Few-Sample Task Transfer in Open-Domain Dialogue | May 12, 2022 | Dialogue UnderstandingDomain Adaptation | CodeCode Available | 1 |
| Unsupervised multiple-choice question generation for out-of-domain Q&A fine-tuning | May 1, 2022 | Multiple-choiceQuestion Answering | —Unverified | 0 |
| Automatic Generation of Distractors for Fill-in-the-Blank Exercises with Round-Trip Neural Machine Translation | May 1, 2022 | Machine TranslationMultiple-choice | —Unverified | 0 |
| Clozer”:" Adaptable Data Augmentation for Cloze-style Reading Comprehension | May 1, 2022 | Data AugmentationMachine Reading Comprehension | —Unverified | 0 |
| Answer-level Calibration for Free-form Multiple Choice Question Answering | May 1, 2022 | FormLanguage Modeling | CodeCode Available | 0 |
| Answer Uncertainty and Unanswerability in Multiple-Choice Machine Reading Comprehension | May 1, 2022 | Machine Reading ComprehensionMultiple-choice | —Unverified | 0 |
| Clues Before Answers: Generation-Enhanced Multiple-Choice QA | Apr 30, 2022 | DecoderMultiple-choice | CodeCode Available | 1 |
| Flamingo: a Visual Language Model for Few-Shot Learning | Apr 29, 2022 | Few-Shot LearningGenerative Visual Question Answering | CodeCode Available | 4 |
| Single-Turn Debate Does Not Help Humans Answer Hard Reading-Comprehension Questions | Apr 11, 2022 | Multiple-choiceReading Comprehension | —Unverified | 0 |
| No Task Left Behind: Multi-Task Learning of Knowledge Tracing and Option Tracing for Better Student Assessment | Apr 8, 2022 | Knowledge TracingMultiple-choice | —Unverified | 0 |
| Clozer: Adaptable Data Augmentation for Cloze-style Reading Comprehension | Mar 30, 2022 | Data AugmentationMachine Reading Comprehension | —Unverified | 0 |
| Evaluating Prompts Across Multiple Choice Tasks In a Zero-Shot Setting | Mar 29, 2022 | Multiple-choice | CodeCode Available | 0 |
| MedMCQA : A Large-scale Multi-Subject Multi-Choice Dataset for Medical domain Question Answering | Mar 27, 2022 | DiversityMultiple-choice | CodeCode Available | 2 |
| A Theoretically Grounded Benchmark for Evaluating Machine Commonsense | Mar 23, 2022 | Generative Question AnsweringMultiple-choice | —Unverified | 0 |
| AdaLoGN: Adaptive Logic Graph Network for Reasoning-Based Machine Reading Comprehension | Mar 16, 2022 | Logical ReasoningMachine Reading Comprehension | CodeCode Available | 1 |
| All in One: Exploring Unified Video-Language Pre-training | Mar 14, 2022 | AllLanguage Modelling | CodeCode Available | 2 |
| What Makes Reading Comprehension Questions Difficult? | Mar 12, 2022 | Logical ReasoningMultiple-choice | CodeCode Available | 0 |
| A New Era: Intelligent Tutoring Systems Will Transform Online Learning for Millions | Mar 3, 2022 | Active LearningMultiple-choice | —Unverified | 0 |
| Aryl: An Elastic Cluster Scheduler for Deep Learning | Feb 16, 2022 | Deep LearningGPU | —Unverified | 0 |
| NEWSKVQA: Knowledge-Aware News Video Question Answering | Feb 8, 2022 | Common Sense ReasoningManagement | —Unverified | 0 |
| Leaf: Multiple-Choice Question Generation | Jan 22, 2022 | Multiple-choiceQuestion Answering | CodeCode Available | 1 |
| Exposing the Limits of Video-Text Models through Contrast Sets | Jan 16, 2022 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Answer Uncertainty and Unanswerability in Multiple-Choice Machine Reading Comprehension | Jan 16, 2022 | Machine Reading ComprehensionMultiple-choice | —Unverified | 0 |
| Disaggregating Hops: Can We Guide a Multi-Hop Reasoning Language Model to Incrementally Learn at each Hop? | Jan 16, 2022 | Language ModelingLanguage Modelling | —Unverified | 0 |
| MixQG: Neural Question Generation with Mixed Answer Types | Jan 16, 2022 | Multiple-choiceQuestion Answering | —Unverified | 0 |
| An MRC Framework for Semantic Role Labeling | Jan 16, 2022 | Computational EfficiencyMachine Reading Comprehension | —Unverified | 0 |
| Context-guided Triple Matching for Multiple Choice Question Answering | Jan 16, 2022 | BenchmarkingMultiple-choice | —Unverified | 0 |
| Bridging Video-text Retrieval with Multiple Choice Questions | Jan 13, 2022 | Action RecognitionLinear evaluation | CodeCode Available | 1 |
| SaL-Lightning Dataset: Search and Eye Gaze Behavior, Resource Interactions and Knowledge Gain during Web Search | Jan 7, 2022 | Information RetrievalMultiple-choice | —Unverified | 0 |
| Multiple Choice Questions based Multi-Interest Policy Learning for Conversational Recommendation | Dec 22, 2021 | AttributeConversational Recommendation | CodeCode Available | 1 |
| QuALITY: Question Answering with Long Input Texts, Yes! | Dec 16, 2021 | Multiple-choiceMultiple Choice Question Answering (MCQA) | CodeCode Available | 1 |
| Answering Chinese Elementary School Social Studies Multiple Choice Questions | Dec 1, 2021 | Multiple-choice | —Unverified | 0 |
| DeepQR: Neural-based Quality Ratings for Learnersourced Multiple-Choice Questions | Nov 19, 2021 | Contrastive LearningMultiple-choice | —Unverified | 0 |
| What Makes Machine Reading Comprehension Questions Difficult? Investigating Variation in Passage Sources and Question Types | Nov 16, 2021 | Logical ReasoningMachine Reading Comprehension | —Unverified | 0 |
| Fill-in-the-Blank: A Challenging Video Understanding Evaluation Framework | Nov 16, 2021 | Multiple-choiceQuestion Answering | —Unverified | 0 |
| Unsupervised multiple-choice question generation for out-of-domain Q\&A fine-tuning | Nov 16, 2021 | Multiple-choiceQuestion Answering | —Unverified | 0 |
| An AI-based Solution for Enhancing Delivery of Digital Learning for Future Teachers | Nov 9, 2021 | Multiple-choiceQuestion Generation | —Unverified | 0 |
| Surface Form Competition: Why the Highest Probability Answer Isn’t Always Right | Nov 1, 2021 | FormMultiple-choice | CodeCode Available | 1 |
| Enhancing Multiple-choice Machine Reading Comprehension by Punishing Illogical Interpretations | Nov 1, 2021 | AttributeMachine Reading Comprehension | —Unverified | 0 |
| A Semantic Feature-Wise Transformation Relation Network for Automatic Short Answer Grading | Nov 1, 2021 | automatic short answer gradingData Augmentation | —Unverified | 0 |
| Neural Natural Logic Inference for Interpretable Question Answering | Nov 1, 2021 | Multiple-choiceNatural Language Inference | CodeCode Available | 0 |
| GANDALF: a General Character Name Description Dataset for Long Fiction | Nov 1, 2021 | Multiple-choiceQuestion Answering | —Unverified | 0 |
| Narrative Embedding: Re-Contextualization Through Attention | Nov 1, 2021 | Multiple-choiceQuestion Answering | —Unverified | 0 |
| MIRTT: Learning Multimodal Interaction Representations from Trilinear Transformers for Visual Question Answering | Nov 1, 2021 | multimodal interactionMultiple-choice | CodeCode Available | 0 |
| Template Filling for Controllable Commonsense Reasoning | Oct 31, 2021 | Multiple-choice | —Unverified | 0 |
| DP-SSL: Towards Robust Semi-supervised Learning with A Few Labeled Samples | Oct 26, 2021 | Multiple-choiceSemi-Supervised Image Classification | —Unverified | 0 |
| Ranking Facts for Explaining Answers to Elementary Science Questions | Oct 18, 2021 | Interpretable Machine LearningLearning-To-Rank | —Unverified | 0 |
| MixQG: Neural Question Generation with Mixed Answer Types | Oct 15, 2021 | Multiple-choiceQuestion Answering | CodeCode Available | 1 |
| Towards Mixed-Precision Quantization of Neural Networks via Constrained Optimization | Oct 13, 2021 | Multiple-choiceQuantization | —Unverified | 0 |