| Which Shortcut Solution Do Question Answering Models Prefer to Learn? | Nov 29, 2022 | Multiple-choiceQuestion Answering | CodeCode Available | 0 |
| From Recognition to Cognition: Visual Commonsense Reasoning | Nov 27, 2018 | Multiple-choiceMultiple Choice Question Answering (MCQA) | CodeCode Available | 0 |
| FSBench: A Figure Skating Benchmark for Advancing Artistic Sports Understanding | Jan 1, 2025 | Action RecognitionMultiple-choice | CodeCode Available | 0 |
| LLaVA-OneVision: Easy Visual Task Transfer | Aug 6, 2024 | 3D Question Answering (3D-QA) | CodeCode Available | 0 |
| Fusing Models with Complementary Expertise | Oct 2, 2023 | Multiple-choicetext-classification | CodeCode Available | 0 |
| A Benchmark for Long-Form Medical Question Answering | Nov 14, 2024 | Answer GenerationForm | CodeCode Available | 0 |
| Fùxì: A Benchmark for Evaluating Language Models on Ancient Chinese Text Understanding and Generation | Mar 20, 2025 | Multiple-choiceText Generation | CodeCode Available | 0 |
| ReCoMIF: Reading comprehension based multi-source information fusion network for Chinese spoken language understanding | Aug 1, 2023 | Intent DetectionMultiple-choice | CodeCode Available | 0 |
| NLP at UC Santa Cruz at SemEval-2024 Task 5: Legal Answer Validation using Few-Shot Multi-Choice QA | Apr 4, 2024 | Multiple-choice | CodeCode Available | 0 |
| Gendered Pronoun Resolution using BERT and an extractive question answering formulation | Jun 9, 2019 | coreference-resolutionCoreference Resolution | CodeCode Available | 0 |