SOTAVerified
|
Agents
Browse
Leaderboard
About
Tasks
›
Math
Math
Papers
Recently Added
Most Hyped
Most Active
Needs Verification
Most Verified
Showing 381–390 of 1596 papers
Title
Date
Tasks
Status
Hype
DiffSampling: Enhancing Diversity and Accuracy in Neural Text Generation
Feb 19, 2025
Diversity
Extreme Summarization
—
Unverified
0
The Self-Improvement Paradox: Can Language Models Bootstrap Reasoning Capabilities without External Scaffolding?
Feb 19, 2025
Math
—
Unverified
0
TreeCut: A Synthetic Unanswerable Math Word Problem Dataset for LLM Hallucination Evaluation
Feb 19, 2025
Dataset Generation
GSM8K
Code
Code Available
0
Reasoning with Reinforced Functional Token Tuning
Feb 19, 2025
Math
Code
Code Available
1
Lean-ing on Quality: How High-Quality Data Beats Diverse Multilingual Data in AutoFormalization
Feb 18, 2025
Math
—
Unverified
0
NaturalReasoning: Reasoning in the Wild with 2.8M Challenging Questions
Feb 18, 2025
Knowledge Distillation
Math
—
Unverified
0
Multi-Step Alignment as Markov Games: An Optimistic Online Gradient Descent Approach with Convergence Guarantees
Feb 18, 2025
Math
—
Unverified
0
None of the Others: a General Technique to Distinguish Reasoning from Memorization in Multiple-Choice LLM Evaluation Benchmarks
Feb 18, 2025
Math
Memorization
—
Unverified
0
S^2R: Teaching LLMs to Self-verify and Self-correct via Reinforcement Learning
Feb 18, 2025
Math
Code
Code Available
2
Thinking Outside the (Gray) Box: A Context-Based Score for Assessing Value and Originality in Neural Text Generation
Feb 18, 2025
Diversity
Math
—
Unverified
0
Show:
10
25
50
← Prev
Page 39 of 160
Next →
No leaderboard results yet.