| CP-Router: An Uncertainty-Aware Router Between LLM and LRM | May 26, 2025 | Conformal PredictionLogical Reasoning | —Unverified | 0 | 0 |
| Cracking the Code: Multi-domain LLM Evaluation on Real-World Professional Exams in Indonesia | Sep 13, 2024 | MathMultiple-choice | —Unverified | 0 | 0 |
| CroaTPAS: A Survey-based Evaluation | Jun 1, 2022 | Multiple-choiceSurvey | —Unverified | 0 | 0 |
| Template Filling for Controllable Commonsense Reasoning | Oct 31, 2021 | Multiple-choice | —Unverified | 0 | 0 |
| Crowd Labeling: a survey | Jan 13, 2013 | Multiple-choiceSurvey | —Unverified | 0 | 0 |
| Crowdsourcing Multiple Choice Science Questions | Jul 19, 2017 | DiversityMultiple-choice | —Unverified | 0 | 0 |
| CS-NLP team at SemEval-2020 Task 4: Evaluation of State-of-the-art NLP Deep Learning Architectures on Commonsense Reasoning Task | May 17, 2020 | Multiple-choiceNatural Language Inference | —Unverified | 0 | 0 |
| CSReader at SemEval-2018 Task 11: Multiple Choice Question Answering as Textual Entailment | Jun 1, 2018 | Common Sense ReasoningLanguage Modelling | —Unverified | 0 | 0 |
| Tokenization Standards for Linguistic Integrity: Turkish as a Benchmark | Feb 10, 2025 | MMLUMorphological Analysis | —Unverified | 0 | 0 |
| A Neural Question Answering Model Based on Semi-Structured Tables | Aug 1, 2018 | Knowledge GraphsMultiple-choice | —Unverified | 0 | 0 |