| SQuALITY: Building a Long-Document Summarization Dataset the Hard Way | May 23, 2022 | Document SummarizationMultiple-choice | CodeCode Available | 1 |
| FETA: A Benchmark for Few-Sample Task Transfer in Open-Domain Dialogue | May 12, 2022 | Dialogue UnderstandingDomain Adaptation | CodeCode Available | 1 |
| Unsupervised multiple-choice question generation for out-of-domain Q&A fine-tuning | May 1, 2022 | Multiple-choiceQuestion Answering | —Unverified | 0 |
| Automatic Generation of Distractors for Fill-in-the-Blank Exercises with Round-Trip Neural Machine Translation | May 1, 2022 | Machine TranslationMultiple-choice | —Unverified | 0 |
| Clozer”:" Adaptable Data Augmentation for Cloze-style Reading Comprehension | May 1, 2022 | Data AugmentationMachine Reading Comprehension | —Unverified | 0 |
| Answer-level Calibration for Free-form Multiple Choice Question Answering | May 1, 2022 | FormLanguage Modeling | CodeCode Available | 0 |
| Answer Uncertainty and Unanswerability in Multiple-Choice Machine Reading Comprehension | May 1, 2022 | Machine Reading ComprehensionMultiple-choice | —Unverified | 0 |
| Clues Before Answers: Generation-Enhanced Multiple-Choice QA | Apr 30, 2022 | DecoderMultiple-choice | CodeCode Available | 1 |
| Flamingo: a Visual Language Model for Few-Shot Learning | Apr 29, 2022 | Few-Shot LearningGenerative Visual Question Answering | CodeCode Available | 4 |
| Single-Turn Debate Does Not Help Humans Answer Hard Reading-Comprehension Questions | Apr 11, 2022 | Multiple-choiceReading Comprehension | —Unverified | 0 |