Math

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 801–850 of 1596 papers

Title	Date	Tasks	Status
Pheromone-based Learning of Optimal Reasoning Paths	Jan 31, 2025	ARCGSM8K	—Unverified
PixelWorld: Towards Perceiving Everything as Pixels	Jan 31, 2025	Math	—Unverified
Fairshare Data Pricing via Data Valuation for Large Language Models	Jan 31, 2025	Data ValuationMath	—Unverified
Token-Hungry, Yet Precise: DeepSeek R1 Highlights the Need for Multi-Step Reasoning Over Speed in MATH	Jan 30, 2025	Language ModelingLanguage Modelling	—Unverified
Examining the Robustness of Large Language Models across Language Complexity	Jan 30, 2025	Math	—Unverified
Token-by-Token Regeneration and Domain Biases: A Benchmark of LLMs on Advanced Mathematical Problem-Solving	Jan 28, 2025	MathMathematical Problem-Solving	—Unverified
Error Classification of Large Language Models on Math Word Problems: A Dynamically Adaptive Framework	Jan 26, 2025	MathMathematical Reasoning	—Unverified
Clear Preferences Leave Traces: Reference Model-Guided Sampling for Preference Learning	Jan 25, 2025	Math	—Unverified
DrawEduMath: Evaluating Vision Language Models with Expert-Annotated Students' Hand-Drawn Math Images	Jan 24, 2025	Math	—Unverified
Advancing Mathematical Reasoning in Language Models: The Impact of Problem-Solving Data, Data Synthesis Methods, and Training Stages	Jan 23, 2025	Instruction FollowingMath	—Unverified
Is your LLM trapped in a Mental Set? Investigative study on how mental sets affect the reasoning capabilities of LLMs	Jan 21, 2025	GSM8KIn-Context Learning	—Unverified
An Optimal Transport approach to arbitrage correction: Application to volatility Stress-Tests	Jan 21, 2025	Math	—Unverified
RedStar: Does Scaling Long-CoT Data Unlock Better Slow-Reasoning Systems?	Jan 20, 2025	MathReinforcement Learning (RL)	—Unverified
Chain-of-Reasoning: Towards Unified Mathematical Reasoning in Large Language Models via a Multi-Paradigm Perspective	Jan 19, 2025	Automated Theorem ProvingMath	—Unverified
Language Representation Favored Zero-Shot Cross-Domain Cognitive Diagnosis	Jan 18, 2025	cognitive diagnosisMath	CodeCode Available
Step-KTO: Optimizing Mathematical Reasoning through Stepwise Binary Feedback	Jan 18, 2025	MathMathematical Reasoning	—Unverified
Iterative Label Refinement Matters More than Preference Optimization under Weak Supervision	Jan 14, 2025	Instruction FollowingMath	CodeCode Available
ArithmAttack: Evaluating Robustness of LLMs to Noisy Context in Math Problem Solving	Jan 14, 2025	GSM8KMath	CodeCode Available
Can Vision-Language Models Evaluate Handwritten Math?	Jan 13, 2025	Math	CodeCode Available
Cascaded Self-Evaluation Augmented Training for Efficient Multimodal Large Language Models	Jan 10, 2025	Math	—Unverified
Stream Aligner: Efficient Sentence-Level Alignment via Distribution Induction	Jan 9, 2025	MathSentence	CodeCode Available
A General Retrieval-Augmented Generation Framework for Multimodal Case-Based Reasoning Applications	Jan 9, 2025	MathRAG	—Unverified
End-to-End Bangla AI for Solving Math Olympiad Problem Benchmark: Leveraging Large Language Model Using Integrated Approach	Jan 8, 2025	Language ModelingLanguage Modelling	—Unverified
A Diversity-Enhanced Knowledge Distillation Model for Practical Math Word Problem Solving	Jan 7, 2025	DiversityKnowledge Distillation	CodeCode Available
Quantization Meets Reasoning: Exploring LLM Low-Bit Quantization Degradation for Mathematical Reasoning	Jan 6, 2025	MathMathematical Reasoning	—Unverified
InfiFusion: A Unified Framework for Enhanced Cross-Model Reasoning via LLM Fusion	Jan 6, 2025	GSM8KHumanEval	—Unverified
Understand, Solve and Translate: Bridging the Multilingual Mathematical Reasoning Gap	Jan 5, 2025	MathMathematical Reasoning	—Unverified
Empowering Bengali Education with AI: Solving Bengali Math Word Problems through Transformer Models	Jan 5, 2025	Math	—Unverified
Instruction-Following Pruning for Large Language Models	Jan 3, 2025	Instruction FollowingMath	—Unverified
A Probabilistic Model for Node Classification in Directed Graphs	Jan 3, 2025	MathNode Classification	CodeCode Available
Recursive Decomposition of Logical Thoughts: Framework for Superior Reasoning and Knowledge Propagation in Large Language Models	Jan 3, 2025	GSM8KMath	—Unverified
DIVE: Diversified Iterative Self-Improvement	Jan 1, 2025	DiversityGSM8K	CodeCode Available
Experimental Demonstration of an Optical Neural PDE Solver via On-Chip PINN Training	Jan 1, 2025	Math	—Unverified
Rethink Delay Doppler Channels and Time-Frequency Coding	Dec 31, 2024	Math	—Unverified
Measuring Large Language Models Capacity to Annotate Journalistic Sourcing	Dec 30, 2024	BenchmarkingEthics	—Unverified
Slow Perception: Let's Perceive Geometric Figures Step-by-step	Dec 30, 2024	MathVisual Reasoning	—Unverified
Dynamic Skill Adaptation for Large Language Models	Dec 26, 2024	Math	—Unverified
StructTest: Benchmarking LLMs' Reasoning through Compositional Structured Outputs	Dec 23, 2024	BenchmarkingLogical Reasoning	—Unverified
Evaluating the Design Features of an Intelligent Tutoring System for Advanced Mathematics Learning	Dec 23, 2024	Math	—Unverified
Towards Intrinsic Self-Correction Enhancement in Monte Carlo Tree Search Boosted Reasoning via Iterative Preference Learning	Dec 23, 2024	Arithmetic ReasoningGSM8K	—Unverified
Ask-Before-Detection: Identifying and Mitigating Conformity Bias in LLM-Powered Error Detector for Math Word Problem Solutions	Dec 22, 2024	GSM8KMath	—Unverified
System-2 Mathematical Reasoning via Enriched Instruction Tuning	Dec 22, 2024	ERPGSM8K	—Unverified
Correct implied volatility shapes and reliable pricing in the rough Heston model	Dec 20, 2024	Math	—Unverified
Template-Driven LLM-Paraphrased Framework for Tabular Math Word Problem Generation	Dec 20, 2024	MathMathematical Reasoning	CodeCode Available
Ensembling Large Language Models with Process Reward-Guided Tree Search for Better Complex Reasoning	Dec 20, 2024	Language ModelingLanguage Modelling	—Unverified
Formal Mathematical Reasoning: A New Frontier in AI	Dec 20, 2024	Automated Theorem ProvingMath	—Unverified
Critical-Questions-of-Thought: Steering LLM reasoning with Argumentative Querying	Dec 19, 2024	MathMathematical Reasoning	CodeCode Available
Conceptual In-Context Learning and Chain of Concepts: Solving Complex Conceptual Problems Using Large Language Models	Dec 19, 2024	In-Context LearningMath	—Unverified
AceMath: Advancing Frontier Math Reasoning with Post-Training and Reward Modeling	Dec 19, 2024	Math	—Unverified
Data for Mathematical Copilots: Better Ways of Presenting Proofs for Machine Learning	Dec 19, 2024	Math	—Unverified

Show:10 25 50

← PrevPage 17 of 32Next →

No leaderboard results yet.