Math

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 501–550 of 1596 papers

Title	Date	Tasks	Status	Hype
Large (Vision) Language Models are Unsupervised In-Context Learners	Apr 3, 2025	GSM8KIn-Context Learning	CodeCode Available	1
JiuZhang3.0: Efficiently Improving Mathematical Reasoning by Training Small Data Synthesis Models	May 23, 2024	Knowledge DistillationMath	CodeCode Available	1
Explaining Datasets in Words: Statistical Models with Natural Language Parameters	Sep 13, 2024	ClusteringLanguage Modeling	CodeCode Available	1
JiuZhang: A Chinese Pre-trained Language Model for Mathematical Problem Understanding	Jun 13, 2022	Language ModelingLanguage Modelling	CodeCode Available	1
Improving the Validity of Automatically Generated Feedback via Reinforcement Learning	Mar 2, 2024	MathMisconceptions	CodeCode Available	1
From GAN to WGAN	Apr 18, 2019	Generative Adversarial NetworkMath	CodeCode Available	1
EXAONE Deep: Reasoning Enhanced Language Models	Mar 16, 2025	Math	CodeCode Available	1
Evolving Prompts In-Context: An Open-ended, Self-replicating Perspective	Jun 22, 2025	In-Context LearningLarge Language Model	CodeCode Available	1
Injecting Numerical Reasoning Skills into Language Models	Apr 9, 2020	Data AugmentationDecoder	CodeCode Available	1
Language Models as Science Tutors	Feb 16, 2024	GSM8KMath	CodeCode Available	1
MUSTARD: Mastering Uniform Synthesis of Theorem and Proof Data	Feb 14, 2024	Automated Theorem ProvingLanguage Modelling	CodeCode Available	1
Self-Training Elicits Concise Reasoning in Large Language Models	Feb 27, 2025	GSM8KIn-Context Learning	CodeCode Available	1
Examining the Robustness of Large Language Models across Language Complexity	Jan 30, 2025	Math	—Unverified	0
Examining the Behavior of LLM Architectures Within the Framework of Standardized National Exams in Brazil	Aug 9, 2024	MathMultiple-choice	—Unverified	0
Can Stories Help LLMs Reason? Curating Information Space Through Narrative	Oct 25, 2024	Math	—Unverified	0
Evolving LLMs' Self-Refinement Capability via Iterative Preference Optimization	Feb 8, 2025	GSM8KMath	—Unverified	0
Can LLMs understand Math? -- Exploring the Pitfalls in Mathematical Reasoning	May 21, 2025	MathMathematical Reasoning	—Unverified	0
A range characterization of the single-quadrant ADRT	Oct 11, 2020	Math	—Unverified	0
EvoGPT-f: An Evolutionary GPT Framework for Benchmarking Formal Math Languages	Feb 12, 2024	Automated Theorem ProvingBenchmarking	—Unverified	0
Illinois Math Solver: Math Reasoning on the Web	Jun 1, 2016	MathMath Word Problem Solving	—Unverified	0
AI4Math: A Native Spanish Benchmark for University-Level Mathematical Reasoning in Large Language Models	May 25, 2025	MathMathematical Reasoning	—Unverified	0
Identifying equivalent Calabi--Yau topologies: A discrete challenge from math and physics for machine learning	Feb 15, 2022	BIG-bench Machine LearningMath	—Unverified	0
Improve Mathematical Reasoning in Language Models by Automated Process Supervision	Jun 5, 2024	GSM8KMath	—Unverified	0
Evaluating the Design Features of an Intelligent Tutoring System for Advanced Mathematics Learning	Dec 23, 2024	Math	—Unverified	0
Evaluating Robustness of Reward Models for Mathematical Reasoning	Oct 2, 2024	MathMathematical Reasoning	—Unverified	0
Can LLMs Reason Abstractly Over Math Word Problems Without CoT? Disentangling Abstract Formulation From Arithmetic Computation	May 29, 2025	GSM8KMath	—Unverified	0
A Graph-Based Synthetic Data Pipeline for Scaling High-Quality Reasoning Instructions	Dec 12, 2024	GSM8KKnowledge Graphs	—Unverified	0
HyperCLOVA X Technical Report	Apr 2, 2024	Instruction FollowingMachine Translation	—Unverified	0
Evaluating Grounded Reasoning by Code-Assisted Large Language Models for Mathematics	Apr 24, 2025	Code GenerationMath	—Unverified	0
Human Learning about AI	Jun 8, 2024	Math	—Unverified	0
Evaluating GPT-4 at Grading Handwritten Solutions in Math Exams	Nov 7, 2024	Math	—Unverified	0
A Chain-of-Thought Prompting Approach with LLMs for Evaluating Students' Formative Assessment Responses in Science	Mar 21, 2024	Active LearningMath	—Unverified	0
Hydrodynamics of Markets:Hidden Links Between Physics and Finance	Mar 14, 2024	Math	—Unverified	0
Hypothesis-Driven Theory-of-Mind Reasoning for Large Language Models	Feb 17, 2025	Math	—Unverified	0
Improving Academic Plagiarism Detection for STEM Documents by Analyzing Mathematical Content and Citations	Jun 27, 2019	Math	—Unverified	0
Can I understand what I create? Self-Knowledge Evaluation of Large Language Models	Jun 10, 2024	Math	—Unverified	0
Can ChatGPT Defend its Belief in Truth? Evaluating LLM Reasoning via Debate	May 22, 2023	BenchmarkingMath	—Unverified	0
A Practice of Post-Training on Llama-3 70B with Optimal Selection of Additional Language Mixture Ratio	Sep 10, 2024	Emotional IntelligenceMath	—Unverified	0
Error Classification of Large Language Models on Math Word Problems: A Dynamically Adaptive Framework	Jan 26, 2025	MathMathematical Reasoning	—Unverified	0
AceReason-Nemotron: Advancing Math and Code Reasoning through Reinforcement Learning	May 22, 2025	Mathreinforcement-learning	—Unverified	0
How well do Computers Solve Math Word Problems? Large-Scale Dataset Construction and Evaluation	Aug 1, 2016	Community Question AnsweringMath	—Unverified	0
EoRA: Training-free Compensation for Compressed LLM with Eigenspace Low-Rank Approximation	Oct 28, 2024	ARCMath	—Unverified	0
Approximation properties of Residual Neural Networks for Kolmogorov PDEs	Oct 30, 2021	image-classificationImage Classification	—Unverified	0
Entropy Martingale Optimal Transport and Nonlinear Pricing-Hedging Duality	May 26, 2020	Math	—Unverified	0
Calculus on MDPs: Potential Shaping as a Gradient	Aug 20, 2022	Math	—Unverified	0
Approximating Sparse PCA from Incomplete Data	Mar 12, 2015	Math	—Unverified	0
Entropy-Guided Watermarking for LLMs: A Test-Time Framework for Robust and Traceable Text Generation	Apr 16, 2025	GSM8KMath	—Unverified	0
BurTorch: Revisiting Training from First Principles by Coupling Autodiff, Math Optimization, and Systems	Mar 18, 2025	CPUMath	—Unverified	0
Entropy Adaptive Decoding: Dynamic Model Switching for Efficient Inference	Feb 5, 2025	Computational EfficiencyLanguage Modeling	—Unverified	0
Entropic Distribution Matching in Supervised Fine-tuning of LLMs: Less Overfitting and Better Diversity	Aug 29, 2024	Code GenerationDiversity	—Unverified	0

Show:10 25 50

← PrevPage 11 of 32Next →

No leaderboard results yet.