SOTAVerified

Math

Papers

Showing 76100 of 1596 papers

TitleStatusHype
Rectified Sparse Attention0
OpenThoughts: Data Recipes for Reasoning ModelsCode7
Generating Pedagogically Meaningful Visuals for Math Word Problems: A New Benchmark and Analysis of Text-to-Image ModelsCode1
MASTER: Enhancing Large Language Model via Multi-Agent Simulated Teaching0
Unleashing the Reasoning Potential of Pre-trained LLMs by Critique Fine-Tuning on One Problem0
Knowledge or Reasoning? A Close Look at How LLMs Think Across Domains0
Invariance Makes LLM Unlearning Resilient Even to Unanticipated Downstream Fine-TuningCode0
The Surprising Effectiveness of Negative Reinforcement in LLM ReasoningCode2
SynthRL: Scaling Visual Reasoning with Verifiable Data Synthesis0
STORM-BORN: A Challenging Mathematical Derivations Dataset Curated via a Human-in-the-Loop Multi-Agent FrameworkCode1
GThinker: Towards General Multimodal Reasoning via Cue-Guided RethinkingCode0
SiLVR: A Simple Language-based Video Reasoning FrameworkCode1
Agent-X: Evaluating Deep Multimodal Reasoning in Vision-Centric Agentic TasksCode1
Mixed-R1: Unified Reward Perspective For Reasoning Capability in Multimodal Large Language ModelsCode0
Reflect, Retry, Reward: Self-Improving LLMs via Reinforcement Learning0
A*-Thought: Efficient Reasoning via Bidirectional Compression for Low-Resource SettingsCode1
Accelerated Sampling from Masked Diffusion Models via Entropy Bounded Unmasking0
Harnessing Negative Signals: Reinforcement Distillation from Teacher Data for LLM ReasoningCode1
AReaL: A Large-Scale Asynchronous Reinforcement Learning System for Language ReasoningCode7
Let's Reason Formally: Natural-Formal Hybrid Reasoning Enhances LLM's Math Capability0
Discriminative Policy Optimization for Token-Level Reward ModelsCode0
Can LLMs Reason Abstractly Over Math Word Problems Without CoT? Disentangling Abstract Formulation From Arithmetic Computation0
PBEBench: A Multi-Step Programming by Examples Reasoning Benchmark inspired by Historical Linguistics0
Matryoshka Model Learning for Improved Elastic Student Models0
Infi-MMR: Curriculum-based Unlocking Multimodal Reasoning via Phased Reinforcement Learning in Multimodal Small Language Models0
Show:102550
← PrevPage 4 of 64Next →

No leaderboard results yet.