SOTAVerified

Math

Papers

Showing 226250 of 1596 papers

TitleStatusHype
Flaming-hot Initiation with Regular Execution Sampling for Large Language ModelsCode2
MathCoder: Seamless Code Integration in LLMs for Enhanced Mathematical ReasoningCode2
Archon: An Architecture Search Framework for Inference-Time TechniquesCode2
AbstentionBench: Reasoning LLMs Fail on Unanswerable QuestionsCode2
Unsupervised Post-Training for Multi-Modal LLM Reasoning via GRPOCode2
MathPile: A Billion-Token-Scale Pretraining Corpus for MathCode2
Memorizing TransformersCode2
On the Emergence of Thinking in LLMs I: Searching for the Right IntuitionCode2
SciInstruct: a Self-Reflective Instruction Annotated Dataset for Training Scientific Language ModelsCode2
Expression Syntax Information Bottleneck for Math Word ProblemsCode1
M1: Towards Scalable Test-Time Compute with Mamba Reasoning ModelsCode1
Lottery Ticket Adaptation: Mitigating Destructive Interference in LLMsCode1
A Probabilistic Inference Approach to Inference-Time Scaling of LLMs using Particle-Based Monte Carlo MethodsCode1
Can Large Language Models Detect Errors in Long Chain-of-Thought Reasoning?Code1
Explaining Datasets in Words: Statistical Models with Natural Language ParametersCode1
Can an AI Win Ghana's National Science and Maths Quiz? An AI Grand Challenge for EducationCode1
A Practical Two-Stage Recipe for Mathematical LLMs: Maximizing Accuracy with SFT and Efficiency with Reinforcement LearningCode1
LogQuant: Log-Distributed 2-Bit Quantization of KV Cache with Superior Accuracy PreservationCode1
Evolving Prompts In-Context: An Open-ended, Self-replicating PerspectiveCode1
LLMThinkBench: Towards Basic Math Reasoning and Overthinking in Large Language ModelsCode1
EXAONE Deep: Reasoning Enhanced Language ModelsCode1
LoRA Soups: Merging LoRAs for Practical Skill Composition TasksCode1
MAGDi: Structured Distillation of Multi-Agent Interaction Graphs Improves Reasoning in Smaller Language ModelsCode1
Building Dataset for Grounding of Formulae — Annotating Coreference Relations Among Math IdentifiersCode1
Broken Neural Scaling LawsCode1
Show:102550
← PrevPage 10 of 64Next →

No leaderboard results yet.