| LongReason: A Synthetic Long-Context Reasoning Benchmark via Context Expansion | Jan 25, 2025 | Multiple-choiceReading Comprehension | —Unverified | 0 |
| LookAlike: Consistent Distractor Generation in Math MCQs | May 3, 2025 | Distractor GenerationMath | —Unverified | 0 |
| Looking Beyond Sentence-Level Natural Language Inference for Question Answering and Text Summarization | Jun 1, 2021 | Multiple-choiceNatural Language Inference | —Unverified | 0 |
| Looking Beyond Short-Premise Natural Language Inference for Downstream Tasks | Dec 4, 2020 | Multiple-choiceNatural Language Inference | —Unverified | 0 |
| Unsupervised multiple-choice question generation for out-of-domain Q&A fine-tuning | May 1, 2022 | Multiple-choiceQuestion Answering | —Unverified | 0 |
| Make a Choice! Knowledge Base Question Answering with In-Context Learning | May 23, 2023 | In-Context LearningKnowledge Base Question Answering | —Unverified | 0 |
| Amobee at SemEval-2019 Tasks 5 and 6: Multiple Choice CNN Over Contextual Embedding | Apr 17, 2019 | Multiple-choice | —Unverified | 0 |
| MANTA: A Large-Scale Multi-View and Visual-Text Anomaly Detection Dataset for Tiny Objects | Dec 6, 2024 | 2kAnomaly Detection | —Unverified | 0 |
| Unsupervised multiple choices question answering via universal corpus | Feb 27, 2024 | FormKnowledge Graphs | —Unverified | 0 |
| MateInfoUB: A Real-World Benchmark for Testing LLMs in Competitive, Multilingual, and Multimodal Educational Tasks | Jul 3, 2025 | FairnessMultiple-choice | —Unverified | 0 |