SOTAVerified|Agents Browse Leaderboard About

Math

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 526–550 of 1596 papers

Title	Date	Tasks	Status	Hype
Can LLMs Reason Abstractly Over Math Word Problems Without CoT? Disentangling Abstract Formulation From Arithmetic Computation	May 29, 2025	GSM8KMath	—Unverified	0
A Graph-Based Synthetic Data Pipeline for Scaling High-Quality Reasoning Instructions	Dec 12, 2024	GSM8KKnowledge Graphs	—Unverified	0
HyperCLOVA X Technical Report	Apr 2, 2024	Instruction FollowingMachine Translation	—Unverified	0
Evaluating Grounded Reasoning by Code-Assisted Large Language Models for Mathematics	Apr 24, 2025	Code GenerationMath	—Unverified	0
Human Learning about AI	Jun 8, 2024	Math	—Unverified	0
Evaluating GPT-4 at Grading Handwritten Solutions in Math Exams	Nov 7, 2024	Math	—Unverified	0
A Chain-of-Thought Prompting Approach with LLMs for Evaluating Students' Formative Assessment Responses in Science	Mar 21, 2024	Active LearningMath	—Unverified	0
Hydrodynamics of Markets:Hidden Links Between Physics and Finance	Mar 14, 2024	Math	—Unverified	0
Hypothesis-Driven Theory-of-Mind Reasoning for Large Language Models	Feb 17, 2025	Math	—Unverified	0
Improving Academic Plagiarism Detection for STEM Documents by Analyzing Mathematical Content and Citations	Jun 27, 2019	Math	—Unverified	0
Can I understand what I create? Self-Knowledge Evaluation of Large Language Models	Jun 10, 2024	Math	—Unverified	0
Can ChatGPT Defend its Belief in Truth? Evaluating LLM Reasoning via Debate	May 22, 2023	BenchmarkingMath	—Unverified	0
A Practice of Post-Training on Llama-3 70B with Optimal Selection of Additional Language Mixture Ratio	Sep 10, 2024	Emotional IntelligenceMath	—Unverified	0
Error Classification of Large Language Models on Math Word Problems: A Dynamically Adaptive Framework	Jan 26, 2025	MathMathematical Reasoning	—Unverified	0
AceReason-Nemotron: Advancing Math and Code Reasoning through Reinforcement Learning	May 22, 2025	Mathreinforcement-learning	—Unverified	0
How well do Computers Solve Math Word Problems? Large-Scale Dataset Construction and Evaluation	Aug 1, 2016	Community Question AnsweringMath	—Unverified	0
EoRA: Training-free Compensation for Compressed LLM with Eigenspace Low-Rank Approximation	Oct 28, 2024	ARCMath	—Unverified	0
Approximation properties of Residual Neural Networks for Kolmogorov PDEs	Oct 30, 2021	image-classificationImage Classification	—Unverified	0
Entropy Martingale Optimal Transport and Nonlinear Pricing-Hedging Duality	May 26, 2020	Math	—Unverified	0
Calculus on MDPs: Potential Shaping as a Gradient	Aug 20, 2022	Math	—Unverified	0
Approximating Sparse PCA from Incomplete Data	Mar 12, 2015	Math	—Unverified	0
Entropy-Guided Watermarking for LLMs: A Test-Time Framework for Robust and Traceable Text Generation	Apr 16, 2025	GSM8KMath	—Unverified	0
BurTorch: Revisiting Training from First Principles by Coupling Autodiff, Math Optimization, and Systems	Mar 18, 2025	CPUMath	—Unverified	0
Entropy Adaptive Decoding: Dynamic Model Switching for Efficient Inference	Feb 5, 2025	Computational EfficiencyLanguage Modeling	—Unverified	0
Entropic Distribution Matching in Supervised Fine-tuning of LLMs: Less Overfitting and Better Diversity	Aug 29, 2024	Code GenerationDiversity	—Unverified	0

Show:10 25 50

← PrevPage 22 of 64Next →

No leaderboard results yet.