Math

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 851–900 of 1596 papers

Title	Date	Tasks	Status
MindStar: Enhancing Math Reasoning in Pre-trained LLMs at Inference Time	May 25, 2024	GSM8KMath	—Unverified
Mining Commonsense and Domain Knowledge from Math Word Problems	Oct 1, 2021	Math	—Unverified
Mining Math Conjectures from LLMs: A Pruning Approach	Dec 9, 2024	Math	—Unverified
Assignment Flows for Data Labeling on Graphs: Convergence and Stability	Feb 26, 2020	General ClassificationMath	—Unverified
Assessing the impact of social activity permissiveness on the COVID-19 infection curve of several countries	Jun 8, 2021	Math	—Unverified
Mixture of Parrots: Experts improve memorization more than reasoning	Oct 24, 2024	MathMemorization	—Unverified
ML2SC: Deploying Machine Learning Models as Smart Contracts on the Blockchain	Mar 28, 2024	Math	—Unverified
MME-CoT: Benchmarking Chain-of-Thought in Large Multimodal Models for Reasoning Quality, Robustness, and Efficiency	Feb 13, 2025	BenchmarkingMath	—Unverified
Assessing the Impact of Prompting Methods on ChatGPT's Mathematical Capabilities	Dec 22, 2023	ChatbotGSM8K	—Unverified
MMTM: Multi-Tasking Multi-Decoder Transformer for Math Word Problems	Jun 2, 2022	DecoderMath	—Unverified
MNIST-Fraction: Enhancing Math Education with AI-Driven Fraction Detection and Analysis	Dec 11, 2024	Math	—Unverified
Modeling and Analyzing Scorer Preferences in Short-Answer Math Questions	Jun 1, 2023	Math	—Unverified
Where to find Grokking in LLM Pretraining? Monitor Memorization-to-Generalization without Test	Jun 26, 2025	Code GenerationLarge Language Model	—Unverified
Modeling Student Response Times: Towards Efficient One-on-one Tutoring Dialogues	Nov 1, 2018	Math	—Unverified
Modelling silicosis: dynamics of a model with piecewise constant rate coefficients	Sep 2, 2021	Math	—Unverified
Models Can and Should Embrace the Communicative Nature of Human-Generated Math	Sep 25, 2024	Math	—Unverified
MoELoRA: Contrastive Learning Guided Mixture of Experts on Parameter-Efficient Fine-Tuning for Large Language Models	Feb 20, 2024	Common Sense ReasoningContrastive Learning	—Unverified
MoL for LLMs: Dual-Loss Optimization to Enhance Domain Expertise While Preserving General Capabilities	May 17, 2025	Math	—Unverified
Assessing and Verifying Task Utility in LLM-Powered Applications	May 3, 2024	Math	—Unverified
More Thinking, Less Seeing? Assessing Amplified Hallucination in Multimodal Reasoning Models	May 23, 2025	DiagnosticHallucination	—Unverified
MSA at BEA 2025 Shared Task: Disagreement-Aware Instruction Tuning for Multi-Dimensional Evaluation of LLMs as Math Tutors	May 24, 2025	Language ModelingLanguage Modelling	—Unverified
Multi-lingual Functional Evaluation for Large Language Models	Jun 25, 2025	BelebeleInstruction Following	—Unverified
Multimodal Assessment of Classroom Discourse Quality: A Text-Centered Attention-Based Multi-Task Learning Approach	May 12, 2025	MathMulti-Task Learning	—Unverified
A Chinese Math Word Problem Solving System Based on Linguistic Theory and Non-statistical Approach	Sep 1, 2020	MathMath Word Problem Solving	—Unverified
Multi-Stage Pre-Training for Math-Understanding: ^2(AL)BERT	Jan 16, 2022	Language ModelingLanguage Modelling	—Unverified
Multi-Step Alignment as Markov Games: An Optimistic Online Gradient Descent Approach with Convergence Guarantees	Feb 18, 2025	Math	—Unverified
Multi-step Problem Solving Through a Verifier: An Empirical Analysis on Model-induced Process Supervision	Feb 5, 2024	GSM8KMath	—Unverified
Multi-tool Integration Application for Math Reasoning Using Large Language Model	Aug 22, 2024	Language ModelingLanguage Modelling	—Unverified
Ask-Before-Detection: Identifying and Mitigating Conformity Bias in LLM-Powered Error Detector for Math Word Problem Solutions	Dec 22, 2024	GSM8KMath	—Unverified
Tree-structured Decoding for Solving Math Word Problems	Nov 1, 2019	Math	—Unverified
A Rule-Based Computational Model of Cognitive Arithmetic	May 3, 2017	Mathmodel	—Unverified
MV-MATH: Evaluating Multimodal Math Reasoning in Multi-Visual Contexts	Feb 28, 2025	MathMathematical Reasoning	—Unverified
Which Data Attributes Stimulate Math and Code Reasoning? An Investigation via Influence Functions	May 26, 2025	AttributeMath	—Unverified
MWP-BERT: Numeracy-Augmented Pre-training for Math Word Problem Solving	Jan 16, 2022	Language ModelingLanguage Modelling	—Unverified
MWPRanker: An Expression Similarity Based Math Word Problem Retriever	Jul 3, 2023	Logical SequenceMath	—Unverified
A Chain-of-Thought Prompting Approach with LLMs for Evaluating Students' Formative Assessment Responses in Science	Mar 21, 2024	Active LearningMath	—Unverified
TTT-Bench: A Benchmark for Evaluating Reasoning Ability with Simple and Novel Tic-Tac-Toe-style Games	Jun 11, 2025	Logical ReasoningMath	—Unverified
NaturalReasoning: Reasoning in the Wild with 2.8M Challenging Questions	Feb 18, 2025	Knowledge DistillationMath	—Unverified
Natural- to formal-language generation using Tensor Product Representations	Sep 25, 2019	DecoderMath	—Unverified
Navigating the Labyrinth: Evaluating and Enhancing LLMs' Ability to Reason About Search Problems	Jun 18, 2024	In-Context LearningMath	—Unverified
AceReason-Nemotron: Advancing Math and Code Reasoning through Reinforcement Learning	May 22, 2025	Mathreinforcement-learning	—Unverified
NEOLAF, an LLM-powered neural-symbolic cognitive architecture	Aug 8, 2023	Incremental LearningMath	—Unverified
"Turing Tests" For An AI Scientist	May 22, 2024	AI AgentData Compression	—Unverified
Network psychometrics and cognitive network science open new ways for detecting, understanding and tackling the complexity of math anxiety: A review	Aug 31, 2021	Math	—Unverified
Neural Math Word Problem Solver with Reinforcement Learning	Aug 1, 2018	Feature EngineeringMath	—Unverified
Neural Symbolic Reader: Scalable Integration of Distributed and Symbolic Representations for Reading Comprehension	May 1, 2020	Data AugmentationMath	—Unverified
Neuro-Symbolic Data Generation for Math Reasoning	Dec 6, 2024	DiversityMath	—Unverified
NLU for Game-based Learning in Real: Initial Evaluations	May 27, 2022	Intent RecognitionMath	—Unverified
No Free Lunch: Rethinking Internal Feedback for LLM Reasoning	Jun 20, 2025	Mathreinforcement-learning	—Unverified
Arithmetic Reasoning with LLM: Prolog Generation & Permutation	May 28, 2024	Arithmetic ReasoningData Augmentation	—Unverified

Show:10 25 50

← PrevPage 18 of 32Next →

No leaderboard results yet.