| Clear Preferences Leave Traces: Reference Model-Guided Sampling for Preference Learning | Jan 25, 2025 | Math | —Unverified | 0 | 0 |
| SynthRL: Scaling Visual Reasoning with Verifiable Data Synthesis | Jun 2, 2025 | 8kMath | —Unverified | 0 | 0 |
| ClickTree: A Tree-based Method for Predicting Math Students' Performance Based on Clickstream Data | Mar 1, 2024 | Math | —Unverified | 0 | 0 |
| CLST: Cold-Start Mitigation in Knowledge Tracing by Aligning a Generative Language Model as a Students' Knowledge Tracer | Jun 13, 2024 | Domain GeneralizationKnowledge Tracing | —Unverified | 0 | 0 |
| CMATH: Can Your Language Model Pass Chinese Elementary School Math Test? | Jun 29, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 | 0 |
| CMMaTH: A Chinese Multi-modal Math Skill Evaluation Benchmark for Foundation Models | Jun 28, 2024 | DiversityMath | —Unverified | 0 | 0 |
| ChemistryQA: A Complex Question Answering Dataset from Chemistry | Jan 1, 2021 | Machine Reading ComprehensionMath | —Unverified | 0 | 0 |
| Chat-TS: Enhancing Multi-Modal Reasoning Over Time-Series and Natural Language Data | Mar 13, 2025 | Large Language ModelMath | —Unverified | 0 | 0 |
| CodePMP: Scalable Preference Model Pretraining for Large Language Model Reasoning | Oct 3, 2024 | GSM8KLanguage Modeling | —Unverified | 0 | 0 |
| Code Pretraining Improves Entity Tracking Abilities of Language Models | May 31, 2024 | Math | —Unverified | 0 | 0 |