SOTAVerified

Math

Papers

Showing 801850 of 1596 papers

TitleStatusHype
Pheromone-based Learning of Optimal Reasoning Paths0
PixelWorld: Towards Perceiving Everything as Pixels0
Fairshare Data Pricing via Data Valuation for Large Language Models0
Token-Hungry, Yet Precise: DeepSeek R1 Highlights the Need for Multi-Step Reasoning Over Speed in MATH0
Examining the Robustness of Large Language Models across Language Complexity0
Token-by-Token Regeneration and Domain Biases: A Benchmark of LLMs on Advanced Mathematical Problem-Solving0
Error Classification of Large Language Models on Math Word Problems: A Dynamically Adaptive Framework0
Clear Preferences Leave Traces: Reference Model-Guided Sampling for Preference Learning0
DrawEduMath: Evaluating Vision Language Models with Expert-Annotated Students' Hand-Drawn Math Images0
Advancing Mathematical Reasoning in Language Models: The Impact of Problem-Solving Data, Data Synthesis Methods, and Training Stages0
Is your LLM trapped in a Mental Set? Investigative study on how mental sets affect the reasoning capabilities of LLMs0
An Optimal Transport approach to arbitrage correction: Application to volatility Stress-Tests0
RedStar: Does Scaling Long-CoT Data Unlock Better Slow-Reasoning Systems?0
Chain-of-Reasoning: Towards Unified Mathematical Reasoning in Large Language Models via a Multi-Paradigm Perspective0
Language Representation Favored Zero-Shot Cross-Domain Cognitive DiagnosisCode0
Step-KTO: Optimizing Mathematical Reasoning through Stepwise Binary Feedback0
Iterative Label Refinement Matters More than Preference Optimization under Weak SupervisionCode0
ArithmAttack: Evaluating Robustness of LLMs to Noisy Context in Math Problem SolvingCode0
Can Vision-Language Models Evaluate Handwritten Math?Code0
Cascaded Self-Evaluation Augmented Training for Efficient Multimodal Large Language Models0
Stream Aligner: Efficient Sentence-Level Alignment via Distribution InductionCode0
A General Retrieval-Augmented Generation Framework for Multimodal Case-Based Reasoning Applications0
End-to-End Bangla AI for Solving Math Olympiad Problem Benchmark: Leveraging Large Language Model Using Integrated Approach0
A Diversity-Enhanced Knowledge Distillation Model for Practical Math Word Problem SolvingCode0
Quantization Meets Reasoning: Exploring LLM Low-Bit Quantization Degradation for Mathematical Reasoning0
InfiFusion: A Unified Framework for Enhanced Cross-Model Reasoning via LLM Fusion0
Understand, Solve and Translate: Bridging the Multilingual Mathematical Reasoning Gap0
Empowering Bengali Education with AI: Solving Bengali Math Word Problems through Transformer Models0
Instruction-Following Pruning for Large Language Models0
A Probabilistic Model for Node Classification in Directed GraphsCode0
Recursive Decomposition of Logical Thoughts: Framework for Superior Reasoning and Knowledge Propagation in Large Language Models0
DIVE: Diversified Iterative Self-ImprovementCode0
Experimental Demonstration of an Optical Neural PDE Solver via On-Chip PINN Training0
Rethink Delay Doppler Channels and Time-Frequency Coding0
Measuring Large Language Models Capacity to Annotate Journalistic Sourcing0
Slow Perception: Let's Perceive Geometric Figures Step-by-step0
Dynamic Skill Adaptation for Large Language Models0
StructTest: Benchmarking LLMs' Reasoning through Compositional Structured Outputs0
Evaluating the Design Features of an Intelligent Tutoring System for Advanced Mathematics Learning0
Towards Intrinsic Self-Correction Enhancement in Monte Carlo Tree Search Boosted Reasoning via Iterative Preference Learning0
Ask-Before-Detection: Identifying and Mitigating Conformity Bias in LLM-Powered Error Detector for Math Word Problem Solutions0
System-2 Mathematical Reasoning via Enriched Instruction Tuning0
Correct implied volatility shapes and reliable pricing in the rough Heston model0
Template-Driven LLM-Paraphrased Framework for Tabular Math Word Problem GenerationCode0
Ensembling Large Language Models with Process Reward-Guided Tree Search for Better Complex Reasoning0
Formal Mathematical Reasoning: A New Frontier in AI0
Critical-Questions-of-Thought: Steering LLM reasoning with Argumentative QueryingCode0
Conceptual In-Context Learning and Chain of Concepts: Solving Complex Conceptual Problems Using Large Language Models0
AceMath: Advancing Frontier Math Reasoning with Post-Training and Reward Modeling0
Data for Mathematical Copilots: Better Ways of Presenting Proofs for Machine Learning0
Show:102550
← PrevPage 17 of 32Next →

No leaderboard results yet.