| AI-based Arabic Language and Speech Tutor | Oct 22, 2022 | Multiple-choiceSelf-Learning | —Unverified | 0 |
| MMSI-Bench: A Benchmark for Multi-Image Spatial Intelligence | May 29, 2025 | Multiple-choiceSpatial Reasoning | —Unverified | 0 |
| VCEval: Rethinking What is a Good Educational Video and How to Automatically Evaluate It | Jun 15, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Modeling of Item-Difficulty for Ontology-based MCQs | Jul 4, 2016 | Multiple-choice | —Unverified | 0 |
| Monty Hall and Optimized Conformal Prediction to Improve Decision-Making with LLMs | Dec 31, 2024 | Conformal PredictionDecision Making | —Unverified | 0 |
| More Robots are Coming: Large Multimodal Models (ChatGPT) can Solve Visually Diverse Images of Parsons Problems | Nov 3, 2023 | Multiple-choice | —Unverified | 0 |
| MoReVQA: Exploring Modular Reasoning Models for Video Question Answering | Apr 9, 2024 | EgoSchemaMultiple-choice | —Unverified | 0 |
| Mounting Video Metadata on Transformer-based Language Model for Open-ended Video Question Answering | Aug 11, 2021 | Language ModelingLanguage Modelling | —Unverified | 0 |
| AI and Machine Learning for Next Generation Science Assessments | Apr 23, 2024 | Multiple-choice | —Unverified | 0 |
| AGReE: A system for generating Automated Grammar Reading Exercises | Oct 28, 2022 | ArticlesMultiple-choice | —Unverified | 0 |
| MRAG-Bench: Vision-Centric Evaluation for Retrieval-Augmented Multimodal Models | Oct 10, 2024 | Multiple-choiceQuestion Answering | —Unverified | 0 |
| MR. Judge: Multimodal Reasoner as a Judge | May 19, 2025 | MM-VetMultiple-choice | —Unverified | 0 |
| VELOCITI: Benchmarking Video-Language Compositional Reasoning with Strict Entailment | Jun 16, 2024 | Action UnderstandingBenchmarking | —Unverified | 0 |
| MuLTI: Efficient Video-and-Language Understanding with Text-Guided MultiWay-Sampler and Multiple Choice Modeling | Mar 10, 2023 | Multi-Label ClassificationMUlTI-LABEL-ClASSIFICATION | —Unverified | 0 |
| Multilingual CALL Framework for Automatic Language Exercise Generation from Free Text | Apr 1, 2017 | Multiple-choiceWord Sense Disambiguation | —Unverified | 0 |
| Multi-Modal Retrieval Augmentation for Open-Ended and Knowledge-Intensive Video Question Answering | Feb 17, 2025 | Multiple-choiceQuestion Answering | —Unverified | 0 |
| Multiple Choice Learning for Efficient Speech Separation with Many Speakers | Nov 27, 2024 | Multiple-choiceSpeech Separation | —Unverified | 0 |
| Multiple Choice Learning: Learning to Produce Multiple Structured Outputs | Dec 1, 2012 | Multiple-choicePrediction | —Unverified | 0 |
| Multiple Choice Question Corpus Analysis for Distractor Characterization | May 1, 2014 | Multiple-choiceReading Comprehension | —Unverified | 0 |
| Multiple-Choice Question Generation: Towards an Automated Assessment Framework | Sep 23, 2022 | DiversityMultiple-choice | —Unverified | 0 |
| Multiple-Choice Question Generation Using Large Language Models: Methodology and Educator Insights | Jun 5, 2025 | Multiple-choiceQuestion Answering | —Unverified | 0 |
| Multiple Choice Question Generation Utilizing An Ontology | Sep 1, 2017 | Distractor GenerationMultiple-choice | —Unverified | 0 |
| Versatile Multiple Choice Learning and Its Application to Vision Computing | Jun 1, 2019 | image-classificationImage Classification | —Unverified | 0 |
| Multiple Choice Questions: Reasoning Makes Large Language Models (LLMs) More Self-Confident Even When They Are Wrong | Jan 16, 2025 | Multiple-choice | —Unverified | 0 |
| Multi-source Meta Transfer for Low Resource Multiple-Choice Question Answering | Jul 1, 2020 | Domain AdaptationLogical Reasoning | —Unverified | 0 |
| Multi-task Learning with Multi-head Attention for Multi-choice Reading Comprehension | Feb 26, 2020 | Machine Reading ComprehensionMultiple-choice | —Unverified | 0 |
| A Graph-Guided Reasoning Approach for Open-ended Commonsense Question Answering | Mar 18, 2023 | Multiple-choiceQuestion Answering | —Unverified | 0 |
| MV-MATH: Evaluating Multimodal Math Reasoning in Multi-Visual Contexts | Feb 28, 2025 | MathMathematical Reasoning | —Unverified | 0 |
| My Answer Is NOT 'Fair': Mitigating Social Bias in Vision-Language Models via Fair and Biased Residuals | May 26, 2025 | EthicsFairness | —Unverified | 0 |
| Narrative Embedding: Re-Contextualization Through Attention | Nov 1, 2021 | Multiple-choiceQuestion Answering | —Unverified | 0 |
| VersaVid-R1: A Versatile Video Understanding and Reasoning Model from Question Answering to Captioning Tasks | Jun 10, 2025 | Multiple-choiceOpen-Ended Question Answering | —Unverified | 0 |
| NEMO: Can Multimodal LLMs Identify Attribute-Modified Objects? | Nov 26, 2024 | AttributeMultiple-choice | —Unverified | 0 |
| AgMMU: A Comprehensive Agricultural Multimodal Understanding and Reasoning Benchmark | Apr 14, 2025 | ManagementMultiple-choice | —Unverified | 0 |
| Network-based Representations and Dynamic Discrete Choice Models for Multiple Discrete Choice Analysis | Jun 7, 2023 | Discrete Choice ModelsMultiple-choice | —Unverified | 0 |
| WorldQA: Multimodal World Knowledge in Videos through Long-Chain Reasoning | May 6, 2024 | Multiple-choiceVideo Understanding | —Unverified | 0 |
| VideoAutoArena: An Automated Arena for Evaluating Large Multimodal Models in Video Analysis through User Simulation | Nov 20, 2024 | ChatbotMultiple-choice | —Unverified | 0 |
| NEWSKVQA: Knowledge-Aware News Video Question Answering | Feb 8, 2022 | Common Sense ReasoningManagement | —Unverified | 0 |
| Video Instruction Tuning With Synthetic Data | Oct 3, 2024 | 3D Question Answering (3D-QA) | —Unverified | 0 |
| None of the Above, Less of the Right: Parallel Patterns between Humans and LLMs on Multi-Choice Questions Answering | Mar 3, 2025 | Business EthicsEthics | —Unverified | 0 |
| None of the Others: a General Technique to Distinguish Reasoning from Memorization in Multiple-Choice LLM Evaluation Benchmarks | Feb 18, 2025 | MathMemorization | —Unverified | 0 |
| No Task Left Behind: Multi-Task Learning of Knowledge Tracing and Option Tracing for Better Student Assessment | Apr 8, 2022 | Knowledge TracingMultiple-choice | —Unverified | 0 |
| Note on Combinatorial Engineering Frameworks for Hierarchical Modular Systems | Mar 29, 2013 | Combinatorial OptimizationMultiple-choice | —Unverified | 0 |
| Note on Evolution and Forecasting of Requirements: Communications Example | May 22, 2017 | Multiple-choice | —Unverified | 0 |
| Novel-WD: Exploring acquisition of Novel World Knowledge in LLMs Using Prefix-Tuning | Aug 30, 2024 | Causal Language ModelingContinual Learning | —Unverified | 0 |
| NTSEBENCH: Cognitive Reasoning Benchmark for Vision Language Models | Jul 15, 2024 | Common Sense ReasoningMultiple-choice | —Unverified | 0 |
| Objective quantification of mood states using large language models | Feb 13, 2025 | Multiple-choice | —Unverified | 0 |
| OCCULT: Evaluating Large Language Models for Offensive Cyber Operation Capabilities | Feb 18, 2025 | Large Language ModelMultiple-choice | —Unverified | 0 |
| OLMES: A Standard for Language Model Evaluations | Jun 12, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| OmniEval: A Benchmark for Evaluating Omni-modal Models with Visual, Auditory, and Textual Inputs | Jun 26, 2025 | DiversityMultiple-choice | —Unverified | 0 |
| Online Joint Bid/Daily Budget Optimization of Internet Advertising Campaigns | Mar 3, 2020 | Gaussian ProcessesMultiple-choice | —Unverified | 0 |