SOTAVerified|Agents Browse Leaderboard About

Math

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 1176–1200 of 1596 papers

Title	Date	Tasks	Status	Hype	Score
Synthetic Data Generation & Multi-Step RL for Reasoning & Tool Use	Apr 7, 2025	GSM8KMath	—Unverified	0	0
Chimera: Improving Generalist Model with Domain-Specific Experts	Dec 8, 2024	Mathmodel	—Unverified	0	0
Advancing Mathematical Reasoning in Language Models: The Impact of Problem-Solving Data, Data Synthesis Methods, and Training Stages	Jan 23, 2025	Instruction FollowingMath	—Unverified	0	0
Classification and Clustering of arXiv Documents, Sections, and Abstracts, Comparing Encodings of Natural and Mathematical Language	May 22, 2020	ClassificationClustering	—Unverified	0	0
Class Prototypes Based Contrastive Learning for Classifying Multi-Label and Fine-Grained Educational Videos	Jan 1, 2023	Contrastive LearningMath	—Unverified	0	0
Clear Preferences Leave Traces: Reference Model-Guided Sampling for Preference Learning	Jan 25, 2025	Math	—Unverified	0	0
SynthRL: Scaling Visual Reasoning with Verifiable Data Synthesis	Jun 2, 2025	8kMath	—Unverified	0	0
ClickTree: A Tree-based Method for Predicting Math Students' Performance Based on Clickstream Data	Mar 1, 2024	Math	—Unverified	0	0
CLST: Cold-Start Mitigation in Knowledge Tracing by Aligning a Generative Language Model as a Students' Knowledge Tracer	Jun 13, 2024	Domain GeneralizationKnowledge Tracing	—Unverified	0	0
CMATH: Can Your Language Model Pass Chinese Elementary School Math Test?	Jun 29, 2023	Language ModelingLanguage Modelling	—Unverified	0	0
CMMaTH: A Chinese Multi-modal Math Skill Evaluation Benchmark for Foundation Models	Jun 28, 2024	DiversityMath	—Unverified	0	0
ChemistryQA: A Complex Question Answering Dataset from Chemistry	Jan 1, 2021	Machine Reading ComprehensionMath	—Unverified	0	0
Chat-TS: Enhancing Multi-Modal Reasoning Over Time-Series and Natural Language Data	Mar 13, 2025	Large Language ModelMath	—Unverified	0	0
CodePMP: Scalable Preference Model Pretraining for Large Language Model Reasoning	Oct 3, 2024	GSM8KLanguage Modeling	—Unverified	0	0
Code Pretraining Improves Entity Tracking Abilities of Language Models	May 31, 2024	Math	—Unverified	0	0
Cognitive network science reveals bias in GPT-3, ChatGPT, and GPT-4 mirroring math anxiety in high-school students	May 22, 2023	MathText Generation	—Unverified	0	0
Cognitive Noise and Altruistic Preferences	Oct 10, 2024	Math	—Unverified	0	0
System-2 Mathematical Reasoning via Enriched Instruction Tuning	Dec 22, 2024	ERPGSM8K	—Unverified	0	0
Complementing the Linear-Programming Learning Experience with the Design and Use of Computerized Games: The Formula 1 Championship Game	Sep 19, 2021	Math	—Unverified	0	0
Complexity-Based Prompting for Multi-Step Reasoning	Oct 3, 2022	Date UnderstandingGSM8K	—Unverified	0	0
Composing Ensembles of Pre-trained Models via Iterative Consensus	Oct 20, 2022	Arithmetic ReasoningImage Generation	—Unverified	0	0
Compositional Causal Reasoning Evaluation in Language Models	Mar 6, 2025	Math	—Unverified	0	0
ComSearch: Equation Searching with Combinatorial Mathematics for Solving Math Word Problems with Weak Supervision	Nov 16, 2021	Math	—Unverified	0	0
ComSearch: Equation Searching with Combinatorial Mathematics for Solving Math Word Problems with Weak Supervision	Jan 16, 2022	Math	—Unverified	0	0
Tackling Math Word Problems with Fine-to-Coarse Abstracting and Reasoning	May 17, 2022	Math	—Unverified	0	0

Show:10 25 50

← PrevPage 48 of 64Next →

No leaderboard results yet.