| Sequence tagging for biomedical extractive question answering | Apr 15, 2021 | Extractive Question-AnsweringQuestion Answering | CodeCode Available | 1 |
| Cooperative Self-training of Machine Reading Comprehension | Mar 12, 2021 | Extractive Question-AnsweringMachine Reading Comprehension | CodeCode Available | 1 |
| LUKE: Deep Contextualized Entity Representations with Entity-aware Self-attention | Oct 2, 2020 | Common Sense ReasoningEntity Typing | CodeCode Available | 1 |
| MKQA: A Linguistically Diverse Benchmark for Multilingual Open Domain Question Answering | Jul 30, 2020 | Extractive Question-AnsweringNatural Questions | CodeCode Available | 1 |
| Look at the First Sentence: Position Bias in Question Answering | Apr 30, 2020 | Extractive Question-AnsweringPosition | CodeCode Available | 1 |
| MLQA: Evaluating Cross-lingual Extractive Question Answering | Oct 16, 2019 | ArticlesExtractive Question-Answering | CodeCode Available | 1 |
| HalluMix: A Task-Agnostic, Multi-Domain Benchmark for Real-World Hallucination Detection | May 1, 2025 | Extractive Question-AnsweringHallucination | —Unverified | 0 |
| When is dataset cartography ineffective? Using training dynamics does not improve robustness against Adversarial SQuAD | Mar 24, 2025 | Adversarial RobustnessExtractive Question-Answering | —Unverified | 0 |
| On Mechanistic Circuits for Extractive Question-Answering | Feb 12, 2025 | Extractive Question-AnsweringLanguage Modeling | —Unverified | 0 |
| FoQA: A Faroese Question-Answering Dataset | Feb 11, 2025 | ArticlesExtractive Question-Answering | —Unverified | 0 |