SOTAVerified

Math

Papers

Showing 15261550 of 1596 papers

TitleStatusHype
Tighter 'uniform bounds for Black-Scholes implied volatility' and the applications to root-finding0
Language Models with Conformal Factuality Guarantees0
TinyGSM: achieving >80% on GSM8k with small language models0
YODA: Teacher-Student Progressive Learning for Language Models0
Large Language Models Are Struggle to Cope with Unreasonability in Math Problems0
Large Language Models as Analogical Reasoners0
1bit-Merging: Dynamic Quantized Merging for Large Language Models0
Large Language Models Can Self-Correct with Key Condition Verification0
Large Language Models for Mathematical Reasoning: Progresses and Challenges0
Large Language Models Might Not Care What You Are Saying: Prompt Format Beats Descriptions0
Large Language Models' Understanding of Math: Source Criticism and Extrapolation0
Token-by-Token Regeneration and Domain Biases: A Benchmark of LLMs on Advanced Mathematical Problem-Solving0
Token-Hungry, Yet Precise: DeepSeek R1 Highlights the Need for Multi-Step Reasoning Over Speed in MATH0
LaRS: Latent Reasoning Skills for Chain-of-Thought Reasoning0
Benchmarking and Improving Generator-Validator Consistency of Language Models0
Layer Swapping for Zero-Shot Cross-Lingual Transfer in Large Language Models0
Laying the Foundation First? Investigating the Generalization from Atomic Skills to Complex Reasoning Tasks0
BeamLoRA: Beam-Constraint Low-Rank Adaptation0
Basic concepts, definitions, and methods in D number theory0
Lean-ing on Quality: How High-Quality Data Beats Diverse Multilingual Data in AutoFormalization0
Backward bifurcation and saddle-node bifurcation in virus-immune dynamics0
Learning Autonomous Code Integration for Math Language Models0
Learning Beyond Pattern Matching? Assaying Mathematical Understanding in LLMs0
Learning by Analogy: Enhancing Few-Shot Prompting for Math Word Problem Solving with Computational Graph-Based Retrieval0
Token-Supervised Value Models for Enhancing Mathematical Reasoning Capabilities of Large Language Models0
Show:102550
← PrevPage 62 of 64Next →

No leaderboard results yet.