| SQuALITY: Building a Long-Document Summarization Dataset the Hard Way | May 23, 2022 | Document SummarizationMultiple-choice | CodeCode Available | 1 |
| FETA: A Benchmark for Few-Sample Task Transfer in Open-Domain Dialogue | May 12, 2022 | Dialogue UnderstandingDomain Adaptation | CodeCode Available | 1 |
| Unsupervised multiple-choice question generation for out-of-domain Q&A fine-tuning | May 1, 2022 | Multiple-choiceQuestion Answering | —Unverified | 0 |
| Automatic Generation of Distractors for Fill-in-the-Blank Exercises with Round-Trip Neural Machine Translation | May 1, 2022 | Machine TranslationMultiple-choice | —Unverified | 0 |
| Clozer”:" Adaptable Data Augmentation for Cloze-style Reading Comprehension | May 1, 2022 | Data AugmentationMachine Reading Comprehension | —Unverified | 0 |
| Answer-level Calibration for Free-form Multiple Choice Question Answering | May 1, 2022 | FormLanguage Modeling | CodeCode Available | 0 |
| Answer Uncertainty and Unanswerability in Multiple-Choice Machine Reading Comprehension | May 1, 2022 | Machine Reading ComprehensionMultiple-choice | —Unverified | 0 |
| Clues Before Answers: Generation-Enhanced Multiple-Choice QA | Apr 30, 2022 | DecoderMultiple-choice | CodeCode Available | 1 |
| Flamingo: a Visual Language Model for Few-Shot Learning | Apr 29, 2022 | Few-Shot LearningGenerative Visual Question Answering | CodeCode Available | 4 |
| Single-Turn Debate Does Not Help Humans Answer Hard Reading-Comprehension Questions | Apr 11, 2022 | Multiple-choiceReading Comprehension | —Unverified | 0 |
| No Task Left Behind: Multi-Task Learning of Knowledge Tracing and Option Tracing for Better Student Assessment | Apr 8, 2022 | Knowledge TracingMultiple-choice | —Unverified | 0 |
| Clozer: Adaptable Data Augmentation for Cloze-style Reading Comprehension | Mar 30, 2022 | Data AugmentationMachine Reading Comprehension | —Unverified | 0 |
| Evaluating Prompts Across Multiple Choice Tasks In a Zero-Shot Setting | Mar 29, 2022 | Multiple-choice | CodeCode Available | 0 |
| MedMCQA : A Large-scale Multi-Subject Multi-Choice Dataset for Medical domain Question Answering | Mar 27, 2022 | DiversityMultiple-choice | CodeCode Available | 2 |
| A Theoretically Grounded Benchmark for Evaluating Machine Commonsense | Mar 23, 2022 | Generative Question AnsweringMultiple-choice | —Unverified | 0 |
| AdaLoGN: Adaptive Logic Graph Network for Reasoning-Based Machine Reading Comprehension | Mar 16, 2022 | Logical ReasoningMachine Reading Comprehension | CodeCode Available | 1 |
| All in One: Exploring Unified Video-Language Pre-training | Mar 14, 2022 | AllLanguage Modelling | CodeCode Available | 2 |
| What Makes Reading Comprehension Questions Difficult? | Mar 12, 2022 | Logical ReasoningMultiple-choice | CodeCode Available | 0 |
| A New Era: Intelligent Tutoring Systems Will Transform Online Learning for Millions | Mar 3, 2022 | Active LearningMultiple-choice | —Unverified | 0 |
| Aryl: An Elastic Cluster Scheduler for Deep Learning | Feb 16, 2022 | Deep LearningGPU | —Unverified | 0 |
| NEWSKVQA: Knowledge-Aware News Video Question Answering | Feb 8, 2022 | Common Sense ReasoningManagement | —Unverified | 0 |
| Leaf: Multiple-Choice Question Generation | Jan 22, 2022 | Multiple-choiceQuestion Answering | CodeCode Available | 1 |
| Exposing the Limits of Video-Text Models through Contrast Sets | Jan 16, 2022 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Answer Uncertainty and Unanswerability in Multiple-Choice Machine Reading Comprehension | Jan 16, 2022 | Machine Reading ComprehensionMultiple-choice | —Unverified | 0 |
| Disaggregating Hops: Can We Guide a Multi-Hop Reasoning Language Model to Incrementally Learn at each Hop? | Jan 16, 2022 | Language ModelingLanguage Modelling | —Unverified | 0 |