SOTAVerified|Agents Browse Leaderboard About

Math

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 811–820 of 1596 papers

Title	Date	Tasks	Status	Hype
Is your LLM trapped in a Mental Set? Investigative study on how mental sets affect the reasoning capabilities of LLMs	Jan 21, 2025	GSM8KIn-Context Learning	—Unverified	0
An Optimal Transport approach to arbitrage correction: Application to volatility Stress-Tests	Jan 21, 2025	Math	—Unverified	0
RedStar: Does Scaling Long-CoT Data Unlock Better Slow-Reasoning Systems?	Jan 20, 2025	MathReinforcement Learning (RL)	—Unverified	0
Chain-of-Reasoning: Towards Unified Mathematical Reasoning in Large Language Models via a Multi-Paradigm Perspective	Jan 19, 2025	Automated Theorem ProvingMath	—Unverified	0
Language Representation Favored Zero-Shot Cross-Domain Cognitive Diagnosis	Jan 18, 2025	cognitive diagnosisMath	CodeCode Available	0
Step-KTO: Optimizing Mathematical Reasoning through Stepwise Binary Feedback	Jan 18, 2025	MathMathematical Reasoning	—Unverified	0
Iterative Label Refinement Matters More than Preference Optimization under Weak Supervision	Jan 14, 2025	Instruction FollowingMath	CodeCode Available	0
ArithmAttack: Evaluating Robustness of LLMs to Noisy Context in Math Problem Solving	Jan 14, 2025	GSM8KMath	CodeCode Available	0
Can Vision-Language Models Evaluate Handwritten Math?	Jan 13, 2025	Math	CodeCode Available	0
Cascaded Self-Evaluation Augmented Training for Efficient Multimodal Large Language Models	Jan 10, 2025	Math	—Unverified	0

Show:10 25 50

← PrevPage 82 of 160Next →

No leaderboard results yet.