SOTAVerified

Math

Papers

Showing 751775 of 1596 papers

TitleStatusHype
Reasoning with Latent Thoughts: On the Power of Looped Transformers0
DISC: DISC: Dynamic Decomposition Improves LLM Inference Scaling0
SBSC: Step-By-Step Coding for Improving Mathematical Olympiad Performance0
Inference Computation Scaling for Feature Augmentation in Recommendation Systems0
Does Reasoning Introduce Bias? A Study of Social Bias Evaluation and Mitigation in LLM Reasoning0
The Relationship Between Reasoning and Performance in Large Language Models -- o3 (mini) Thinks Harder, Not LongerCode0
Earlier Tokens Contribute More: Learning Direct Preference Optimization From Temporal Decay PerspectiveCode0
A Survey on Feedback-based Multi-step Reasoning for Large Language Models on Mathematics0
GATE: Graph-based Adaptive Tool Evolution Across Diverse TasksCode0
CER: Confidence Enhanced Reasoning in LLMsCode0
TreeCut: A Synthetic Unanswerable Math Word Problem Dataset for LLM Hallucination EvaluationCode0
BeamLoRA: Beam-Constraint Low-Rank Adaptation0
DiffSampling: Enhancing Diversity and Accuracy in Neural Text Generation0
The Self-Improvement Paradox: Can Language Models Bootstrap Reasoning Capabilities without External Scaffolding?0
Multi-Step Alignment as Markov Games: An Optimistic Online Gradient Descent Approach with Convergence Guarantees0
Lean-ing on Quality: How High-Quality Data Beats Diverse Multilingual Data in AutoFormalization0
NaturalReasoning: Reasoning in the Wild with 2.8M Challenging Questions0
Thinking Outside the (Gray) Box: A Context-Based Score for Assessing Value and Originality in Neural Text Generation0
None of the Others: a General Technique to Distinguish Reasoning from Memorization in Multiple-Choice LLM Evaluation Benchmarks0
Energy-Conscious LLM Decoding: Impact of Text Generation Strategies on GPU Energy Consumption0
Why Vision Language Models Struggle with Visual Arithmetic? Towards Enhanced Chart and Geometry Understanding0
Warmup-Distill: Bridge the Distribution Mismatch between Teacher and Student before Knowledge DistillationCode0
Teaching LLMs According to Their Aptitude: Adaptive Reasoning for Mathematical Problem Solving0
Hypothesis-Driven Theory-of-Mind Reasoning for Large Language Models0
A Study on Leveraging Search and Self-Feedback for Agent Reasoning0
Show:102550
← PrevPage 31 of 64Next →

No leaderboard results yet.