Math

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 951–1000 of 1596 papers

Title	Date	Tasks	Status
Token-by-Token Regeneration and Domain Biases: A Benchmark of LLMs on Advanced Mathematical Problem-Solving	Jan 28, 2025	MathMathematical Problem-Solving	—Unverified
Token-Hungry, Yet Precise: DeepSeek R1 Highlights the Need for Multi-Step Reasoning Over Speed in MATH	Jan 30, 2025	Language ModelingLanguage Modelling	—Unverified
LaRS: Latent Reasoning Skills for Chain-of-Thought Reasoning	Dec 7, 2023	In-Context LearningMath	—Unverified
Benchmarking and Improving Generator-Validator Consistency of Language Models	Oct 3, 2023	BenchmarkingInstruction Following	—Unverified
Layer Swapping for Zero-Shot Cross-Lingual Transfer in Large Language Models	Oct 2, 2024	Cross-Lingual TransferMath	—Unverified
Laying the Foundation First? Investigating the Generalization from Atomic Skills to Complex Reasoning Tasks	Mar 14, 2024	MathSkill Generalization	—Unverified
BeamLoRA: Beam-Constraint Low-Rank Adaptation	Feb 19, 2025	Code GenerationMath	—Unverified
Basic concepts, definitions, and methods in D number theory	Mar 21, 2020	Math	—Unverified
Lean-ing on Quality: How High-Quality Data Beats Diverse Multilingual Data in AutoFormalization	Feb 18, 2025	Math	—Unverified
Backward bifurcation and saddle-node bifurcation in virus-immune dynamics	Dec 1, 2021	Math	—Unverified
Learning Autonomous Code Integration for Math Language Models	Feb 2, 2025	Math	—Unverified
Learning Beyond Pattern Matching? Assaying Mathematical Understanding in LLMs	May 24, 2024	In-Context LearningLanguage Modeling	—Unverified
Learning by Analogy: Enhancing Few-Shot Prompting for Math Word Problem Solving with Computational Graph-Based Retrieval	Nov 25, 2024	MathMath Word Problem Solving	—Unverified
Token-Supervised Value Models for Enhancing Mathematical Reasoning Capabilities of Large Language Models	Jul 12, 2024	GSM8KMath	—Unverified
Learning Fine-Grained Expressions to Solve Math Word Problems	Sep 1, 2017	MathMath Word Problem Solving	—Unverified
Learning from Peers in Reasoning Models	May 12, 2025	Math	—Unverified
Activation Functions Considered Harmful: Recovering Neural Network Weights through Controlled Channels	Mar 24, 2025	Math	—Unverified
What Makes a Good Dataset for Symbol Description Reading?	Apr 17, 2023	document understandingMath	—Unverified
Learning Hierarchical Structures On-The-Fly with a Recurrent-Recursive Model for Sequences	Jul 1, 2018	Language ModelingLanguage Modelling	—Unverified
151 Estrategias de Trading (151 Trading Strategies)	Nov 14, 2019	DescriptiveMath	—Unverified
Learning to Love Edge Cases in Formative Math Assessment: Using the AMMORE Dataset and Chain-of-Thought Prompting to Improve Grading Accuracy	Sep 26, 2024	Knowledge TracingMath	—Unverified
Learning to Rank Chain-of-Thought: An Energy-Based Approach with Outcome Supervision	May 21, 2025	GSM8KLearning-To-Rank	—Unverified
Learning to Reason Across Parallel Samples for LLM Reasoning	Jun 10, 2025	MathRe-Ranking	—Unverified
Backup Control Barrier Functions: Formulation and Comparative Study	Apr 22, 2021	Math	—Unverified
WARM: A Weakly (+Semi) Supervised Model for Solving Math word Problems	Apr 14, 2021	Math	—Unverified
Learning to Solve Domain-Specific Calculation Problems with Knowledge-Intensive Programs Generator	Dec 12, 2024	Math	—Unverified
Least-to-Most Prompting Enables Complex Reasoning in Large Language Models	May 21, 2022	Arithmetic ReasoningMath	—Unverified
Les mathématiques de la langue : l'approche formelle de Montague	May 16, 2014	Math	—Unverified
Let's Do a Thought Experiment: Using Counterfactuals to Improve Moral Reasoning	Jun 25, 2023	counterfactualMath	—Unverified
Let's Reason Formally: Natural-Formal Hybrid Reasoning Enhances LLM's Math Capability	May 29, 2025	MathMathematical Reasoning	—Unverified
Let's Reinforce Step by Step	Nov 10, 2023	GSM8KLogical Reasoning	—Unverified
Let's reward step by step: Step-Level reward model as the Navigators for Reasoning	Oct 16, 2023	Code GenerationGSM8K	—Unverified
Auto-regressive Text Generation with Pre-Trained Language Models: An Empirical Study on Question-type Short Text Generation	Jan 16, 2022	MathText Generation	—Unverified
Leveraging Affect Transfer Learning for Behavior Prediction in an Intelligent Tutoring System	Feb 12, 2020	MathTransfer Learning	—Unverified
Leveraging LLMs to Assess Tutor Moves in Real-Life Dialogues: A Feasibility Study	Jun 20, 2025	Math	—Unverified
Leveraging Multimodal Dialog Technology for the Design of Automated and Interactive Student Agents for Teacher Training	Jul 1, 2018	Math	—Unverified
Leveraging Unstructured Text Data for Federated Instruction Tuning of Large Language Models	Sep 11, 2024	Language ModellingLarge Language Model	—Unverified
Towards better Human-Agent Alignment: Assessing Task Utility in LLM-Powered Applications	Feb 14, 2024	Math	—Unverified
LEWIS (LayEr WIse Sparsity) -- A Training Free Guided Model Merging Approach	Mar 5, 2025	Instruction FollowingMath	—Unverified
Limits of an AI program for solving college math problems	Aug 14, 2022	Few-Shot LearningMath	—Unverified
Automatized Evaluation of Formalization Exercises in Mathematics	Jun 2, 2020	MathSentence	—Unverified
LinguaLIFT: An Effective Two-stage Instruction Tuning Framework for Low-Resource Language Tasks	Dec 17, 2024	Math	—Unverified
Automatic tagging of knowledge points for K12 math problems	Aug 21, 2022	ClassificationMath	—Unverified
Automatic Robustness Stress Testing of LLMs as Mathematical Problem Solvers	Jun 5, 2025	GSM8KMath	—Unverified
LLaMa-SciQ: An Educational Chatbot for Answering Science MCQ	Sep 25, 2024	ChatbotGSM8K	—Unverified
Automatic Generation of Question Hints for Mathematics Problems using Large Language Models in Educational Technology	Nov 5, 2024	MathMisconceptions	—Unverified
Meaning-Typed Programming: Language Abstraction and Runtime for Model-Integrated Applications	May 14, 2024	GSM8KMath	—Unverified
LLMs as Potential Brainstorming Partners for Math and Science Problems	Oct 10, 2023	Math	—Unverified
Automatic Generation of High Quality CCGbanks for Parser Domain Adaptation	Jun 5, 2019	Domain AdaptationMath	—Unverified
LLMs can Find Mathematical Reasoning Mistakes by Pedagogical Chain-of-Thought	May 9, 2024	HallucinationMath	—Unverified

Show:10 25 50

← PrevPage 20 of 32Next →

No leaderboard results yet.