| Synthetic Data Generation & Multi-Step RL for Reasoning & Tool Use | Apr 7, 2025 | GSM8KMath | —Unverified | 0 | 0 |
| Chimera: Improving Generalist Model with Domain-Specific Experts | Dec 8, 2024 | Mathmodel | —Unverified | 0 | 0 |
| Advancing Mathematical Reasoning in Language Models: The Impact of Problem-Solving Data, Data Synthesis Methods, and Training Stages | Jan 23, 2025 | Instruction FollowingMath | —Unverified | 0 | 0 |
| Classification and Clustering of arXiv Documents, Sections, and Abstracts, Comparing Encodings of Natural and Mathematical Language | May 22, 2020 | ClassificationClustering | —Unverified | 0 | 0 |
| Class Prototypes Based Contrastive Learning for Classifying Multi-Label and Fine-Grained Educational Videos | Jan 1, 2023 | Contrastive LearningMath | —Unverified | 0 | 0 |
| Clear Preferences Leave Traces: Reference Model-Guided Sampling for Preference Learning | Jan 25, 2025 | Math | —Unverified | 0 | 0 |
| SynthRL: Scaling Visual Reasoning with Verifiable Data Synthesis | Jun 2, 2025 | 8kMath | —Unverified | 0 | 0 |
| ClickTree: A Tree-based Method for Predicting Math Students' Performance Based on Clickstream Data | Mar 1, 2024 | Math | —Unverified | 0 | 0 |
| CLST: Cold-Start Mitigation in Knowledge Tracing by Aligning a Generative Language Model as a Students' Knowledge Tracer | Jun 13, 2024 | Domain GeneralizationKnowledge Tracing | —Unverified | 0 | 0 |
| CMATH: Can Your Language Model Pass Chinese Elementary School Math Test? | Jun 29, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 | 0 |
| CMMaTH: A Chinese Multi-modal Math Skill Evaluation Benchmark for Foundation Models | Jun 28, 2024 | DiversityMath | —Unverified | 0 | 0 |
| ChemistryQA: A Complex Question Answering Dataset from Chemistry | Jan 1, 2021 | Machine Reading ComprehensionMath | —Unverified | 0 | 0 |
| Chat-TS: Enhancing Multi-Modal Reasoning Over Time-Series and Natural Language Data | Mar 13, 2025 | Large Language ModelMath | —Unverified | 0 | 0 |
| CodePMP: Scalable Preference Model Pretraining for Large Language Model Reasoning | Oct 3, 2024 | GSM8KLanguage Modeling | —Unverified | 0 | 0 |
| Code Pretraining Improves Entity Tracking Abilities of Language Models | May 31, 2024 | Math | —Unverified | 0 | 0 |
| Cognitive network science reveals bias in GPT-3, ChatGPT, and GPT-4 mirroring math anxiety in high-school students | May 22, 2023 | MathText Generation | —Unverified | 0 | 0 |
| Cognitive Noise and Altruistic Preferences | Oct 10, 2024 | Math | —Unverified | 0 | 0 |
| System-2 Mathematical Reasoning via Enriched Instruction Tuning | Dec 22, 2024 | ERPGSM8K | —Unverified | 0 | 0 |
| Complementing the Linear-Programming Learning Experience with the Design and Use of Computerized Games: The Formula 1 Championship Game | Sep 19, 2021 | Math | —Unverified | 0 | 0 |
| Complexity-Based Prompting for Multi-Step Reasoning | Oct 3, 2022 | Date UnderstandingGSM8K | —Unverified | 0 | 0 |
| Composing Ensembles of Pre-trained Models via Iterative Consensus | Oct 20, 2022 | Arithmetic ReasoningImage Generation | —Unverified | 0 | 0 |
| Compositional Causal Reasoning Evaluation in Language Models | Mar 6, 2025 | Math | —Unverified | 0 | 0 |
| ComSearch: Equation Searching with Combinatorial Mathematics for Solving Math Word Problems with Weak Supervision | Nov 16, 2021 | Math | —Unverified | 0 | 0 |
| ComSearch: Equation Searching with Combinatorial Mathematics for Solving Math Word Problems with Weak Supervision | Jan 16, 2022 | Math | —Unverified | 0 | 0 |
| Tackling Math Word Problems with Fine-to-Coarse Abstracting and Reasoning | May 17, 2022 | Math | —Unverified | 0 | 0 |