Math

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 276–300 of 1596 papers

Title	Date	Tasks	Status	Hype
M1: Towards Scalable Test-Time Compute with Mamba Reasoning Models	Apr 14, 2025	MambaMath	CodeCode Available	1
MAGDi: Structured Distillation of Multi-Agent Interaction Graphs Improves Reasoning in Smaller Language Models	Feb 2, 2024	Language ModellingLarge Language Model	CodeCode Available	1
LoRA Soups: Merging LoRAs for Practical Skill Composition Tasks	Oct 16, 2024	Mathparameter-efficient fine-tuning	CodeCode Available	1
Is Bigger and Deeper Always Better? Probing LLaMA Across Scales and Layers	Dec 7, 2023	MathMultiple-choice	CodeCode Available	1
LogQuant: Log-Distributed 2-Bit Quantization of KV Cache with Superior Accuracy Preservation	Mar 25, 2025	Code CompletionLanguage Modeling	CodeCode Available	1
Lottery Ticket Adaptation: Mitigating Destructive Interference in LLMs	Jun 24, 2024	Instruction FollowingMath	CodeCode Available	1
LLMThinkBench: Towards Basic Math Reasoning and Overthinking in Large Language Models	Jul 5, 2025	BenchmarkingGPU	CodeCode Available	1
A Neural Network Solves, Explains, and Generates University Math Problems by Program Synthesis and Few-Shot Learning at Human Level	Dec 31, 2021	Few-Shot LearningLanguage Modelling	CodeCode Available	1
Leveraging Online Olympiad-Level Math Problems for LLMs Training and Contamination-Resistant Evaluation	Jan 24, 2025	Math	CodeCode Available	1
Advancing Multimodal Reasoning via Reinforcement Learning with Cold Start	May 28, 2025	MathMultimodal Reasoning	CodeCode Available	1
LEVER: Learning to Verify Language-to-Code Generation with Execution	Feb 16, 2023	Arithmetic ReasoningCode Generation	CodeCode Available	1
OptiBench Meets ReSocratic: Measure and Improve LLMs for Optimization Modeling	Jul 13, 2024	BenchmarkingMath	CodeCode Available	1
CREATOR: Tool Creation for Disentangling Abstract and Concrete Reasoning of Large Language Models	May 23, 2023	2kMath	CodeCode Available	1
Benchmarking Large Language Models for Persian: A Preliminary Study Focusing on ChatGPT	Apr 3, 2024	BenchmarkingGeneral Knowledge	CodeCode Available	1
MathChat: Converse to Tackle Challenging Math Problems with LLM Agents	Jun 2, 2023	Elementary MathematicsMath	CodeCode Available	1
Learning to Reason Deductively: Math Word Problem Solving as Complex Relation Extraction	Mar 19, 2022	MathMath Word Problem Solving	CodeCode Available	1
BEATS: Optimizing LLM Mathematical Capabilities with BackVerify and Adaptive Disambiguate based Efficient Tree Search	Sep 26, 2024	MathMathematical Problem-Solving	CodeCode Available	1
Learning Goal-Conditioned Representations for Language Reward Models	Jul 18, 2024	GSM8KMath	CodeCode Available	1
Learning Multi-Step Reasoning by Solving Arithmetic Tasks	Jun 2, 2023	MathMathematical Reasoning	CodeCode Available	1
Critical Tokens Matter: Token-Level Contrastive Estimation Enhances LLM's Reasoning Capability	Nov 29, 2024	GSM8KMath	CodeCode Available	1
An Early Evaluation of GPT-4V(ision)	Oct 25, 2023	Math	CodeCode Available	1
Let's Verify Math Questions Step by Step	May 20, 2025	MathMathematical Reasoning	CodeCode Available	1
LASeR: Learning to Adaptively Select Reward Models with Multi-Armed Bandits	Oct 2, 2024	Instruction FollowingMath	CodeCode Available	1
Large Language Models Can Be Easily Distracted by Irrelevant Context	Jan 31, 2023	Arithmetic ReasoningLanguage Modeling	CodeCode Available	1
Large (Vision) Language Models are Unsupervised In-Context Learners	Apr 3, 2025	GSM8KIn-Context Learning	CodeCode Available	1

Show:10 25 50

← PrevPage 12 of 64Next →

No leaderboard results yet.