Math

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 801–825 of 1596 papers

Title	Date	Tasks	Status
Pheromone-based Learning of Optimal Reasoning Paths	Jan 31, 2025	ARCGSM8K	—Unverified
PixelWorld: Towards Perceiving Everything as Pixels	Jan 31, 2025	Math	—Unverified
Fairshare Data Pricing via Data Valuation for Large Language Models	Jan 31, 2025	Data ValuationMath	—Unverified
Token-Hungry, Yet Precise: DeepSeek R1 Highlights the Need for Multi-Step Reasoning Over Speed in MATH	Jan 30, 2025	Language ModelingLanguage Modelling	—Unverified
Examining the Robustness of Large Language Models across Language Complexity	Jan 30, 2025	Math	—Unverified
Token-by-Token Regeneration and Domain Biases: A Benchmark of LLMs on Advanced Mathematical Problem-Solving	Jan 28, 2025	MathMathematical Problem-Solving	—Unverified
Error Classification of Large Language Models on Math Word Problems: A Dynamically Adaptive Framework	Jan 26, 2025	MathMathematical Reasoning	—Unverified
Clear Preferences Leave Traces: Reference Model-Guided Sampling for Preference Learning	Jan 25, 2025	Math	—Unverified
DrawEduMath: Evaluating Vision Language Models with Expert-Annotated Students' Hand-Drawn Math Images	Jan 24, 2025	Math	—Unverified
Advancing Mathematical Reasoning in Language Models: The Impact of Problem-Solving Data, Data Synthesis Methods, and Training Stages	Jan 23, 2025	Instruction FollowingMath	—Unverified
Is your LLM trapped in a Mental Set? Investigative study on how mental sets affect the reasoning capabilities of LLMs	Jan 21, 2025	GSM8KIn-Context Learning	—Unverified
An Optimal Transport approach to arbitrage correction: Application to volatility Stress-Tests	Jan 21, 2025	Math	—Unverified
RedStar: Does Scaling Long-CoT Data Unlock Better Slow-Reasoning Systems?	Jan 20, 2025	MathReinforcement Learning (RL)	—Unverified
Chain-of-Reasoning: Towards Unified Mathematical Reasoning in Large Language Models via a Multi-Paradigm Perspective	Jan 19, 2025	Automated Theorem ProvingMath	—Unverified
Language Representation Favored Zero-Shot Cross-Domain Cognitive Diagnosis	Jan 18, 2025	cognitive diagnosisMath	CodeCode Available
Step-KTO: Optimizing Mathematical Reasoning through Stepwise Binary Feedback	Jan 18, 2025	MathMathematical Reasoning	—Unverified
Iterative Label Refinement Matters More than Preference Optimization under Weak Supervision	Jan 14, 2025	Instruction FollowingMath	CodeCode Available
ArithmAttack: Evaluating Robustness of LLMs to Noisy Context in Math Problem Solving	Jan 14, 2025	GSM8KMath	CodeCode Available
Can Vision-Language Models Evaluate Handwritten Math?	Jan 13, 2025	Math	CodeCode Available
Cascaded Self-Evaluation Augmented Training for Efficient Multimodal Large Language Models	Jan 10, 2025	Math	—Unverified
Stream Aligner: Efficient Sentence-Level Alignment via Distribution Induction	Jan 9, 2025	MathSentence	CodeCode Available
A General Retrieval-Augmented Generation Framework for Multimodal Case-Based Reasoning Applications	Jan 9, 2025	MathRAG	—Unverified
End-to-End Bangla AI for Solving Math Olympiad Problem Benchmark: Leveraging Large Language Model Using Integrated Approach	Jan 8, 2025	Language ModelingLanguage Modelling	—Unverified
A Diversity-Enhanced Knowledge Distillation Model for Practical Math Word Problem Solving	Jan 7, 2025	DiversityKnowledge Distillation	CodeCode Available
Quantization Meets Reasoning: Exploring LLM Low-Bit Quantization Degradation for Mathematical Reasoning	Jan 6, 2025	MathMathematical Reasoning	—Unverified

Show:10 25 50

← PrevPage 33 of 64Next →

No leaderboard results yet.