| SOSBENCH: Benchmarking Safety Alignment on Scientific Knowledge | May 27, 2025 | BenchmarkingMultiple-choice | —Unverified | 0 | 0 |
| Sparse Attention Vectors: Generative Multimodal Model Features Are Discriminative Vision-Language Classifiers | Nov 28, 2024 | Image Captioningimage-classification | —Unverified | 0 | 0 |
| Spending Money Wisely: Online Electronic Coupon Allocation based on Real-Time User Intent Detection | Aug 23, 2020 | Intent DetectionMultiple-choice | —Unverified | 0 | 0 |
| VUDG: A Dataset for Video Understanding Domain Generalization | May 30, 2025 | Domain GeneralizationMultiple-choice | —Unverified | 0 | 0 |
| SPRITE: A Response Model For Multiple Choice Testing | Jan 12, 2015 | modelMultiple-choice | —Unverified | 0 | 0 |
| Weighted Global Normalization for Multiple Choice Reading Comprehension over Long Documents | Dec 5, 2018 | Answer SelectionMultiple-choice | —Unverified | 0 | 0 |
| Recent Advances in Multi-Choice Machine Reading Comprehension: A Survey on Methods and Datasets | Aug 4, 2024 | Few-Shot LearningMachine Reading Comprehension | —Unverified | 0 | 0 |
| Correctness Coverage Evaluation for Medical Multiple-Choice Question Answering Based on the Enhanced Conformal Prediction Framework | Mar 7, 2025 | Conformal PredictionMedical Question Answering | —Unverified | 0 | 0 |
| Statistically Profiling Biases in Natural Language Reasoning Datasets and Models | Feb 9, 2021 | Multiple-choiceNatural Language Understanding | —Unverified | 0 | 0 |
| Adaptive Crowdsourcing Algorithms for the Bandit Survey Problem | Feb 13, 2013 | Information RetrievalMultiple-choice | —Unverified | 0 | 0 |