SOTAVerified

Math

Papers

Showing 201225 of 1596 papers

TitleStatusHype
Accelerating Chain-of-Thought Reasoning: When Goal-Gradient Importance Meets Dynamic Skipping0
Multimodal Assessment of Classroom Discourse Quality: A Text-Centered Attention-Based Multi-Task Learning Approach0
Learning from Peers in Reasoning Models0
Agent RL Scaling Law: Agent RL with Spontaneous Code Execution for Mathematical Problem SolvingCode2
S-GRPO: Early Exit via Reinforcement Learning in Reasoning Models0
Kalman Filter Enhanced GRPO for Reinforcement Learning-Based Language Model ReasoningCode1
DialogueReason: Rule-Based RL Sparks Dialogue Reasoning in LLMs0
xGen-small Technical Report0
Generative Discovery of Partial Differential Equations by Learning from Math Handbooks0
Scalable LLM Math Reasoning Acceleration with Low-rank Distillation0
Putting the Value Back in RL: Better Test-Time Scaling by Unifying LLM Reasoners With Verifiers0
SIMPLEMIX: Frustratingly Simple Mixing of Off- and On-policy Data in Language Model Preference Learning0
RM-R1: Reward Modeling as ReasoningCode2
Generating Narrated Lecture Videos from Slides with Synchronized Highlights0
Rewriting Pre-Training Data Boosts LLM Performance in Math and CodeCode1
A Survey of Slow Thinking-based Reasoning LLMs using Reinforced Learning and Inference-time Scaling Law0
LookAlike: Consistent Distractor Generation in Math MCQs0
TutorGym: A Testbed for Evaluating AI Agents as Tutors and StudentsCode0
DeepCritic: Deliberate Critique with Large Language ModelsCode1
NeMo-Inspector: A Visualization Tool for LLM Generation AnalysisCode1
LLMs Do Not Have Human-Like Working Memory0
Phi-4-reasoning Technical Report0
AdaptMI: Adaptive Skill-based In-context Math Instruction for Small Language Models0
Phi-4-Mini-Reasoning: Exploring the Limits of Small Reasoning Language Models in Math0
Local Prompt Optimization0
Show:102550
← PrevPage 9 of 64Next →

No leaderboard results yet.