SOTAVerified

Mathematical Problem-Solving

Papers

Showing 51100 of 106 papers

TitleStatusHype
LocationReasoner: Evaluating LLMs on Real-World Site Selection ReasoningCode0
MathFlow: Enhancing the Perceptual Flow of MLLMs for Visual Mathematical ProblemsCode0
Mathify: Evaluating Large Language Models on Mathematical Problem Solving TasksCode0
PT-MoE: An Efficient Finetuning Framework for Integrating Mixture-of-Experts into Prompt TuningCode0
SEGO: Sequential Subgoal Optimization for Mathematical Problem-SolvingCode0
SmallToLarge (S2L): Scalable Data Selection for Fine-tuning Large Language Models by Summarizing Training Trajectories of Small ModelsCode0
JiuZhang 2.0: A Unified Chinese Pre-trained Language Model for Multi-task Mathematical Problem Solving0
Improving Small-Scale Large Language Models Function Calling for Reasoning Tasks0
MathFimer: Enhancing Mathematical Reasoning by Expanding Reasoning Steps through Fill-in-the-Middle Task0
Towards Spoken Mathematical Reasoning: Benchmarking Speech-based Models over Multi-faceted Math Problems0
How Do Large Language Monkeys Get Their Power (Laws)?0
The Buffer Mechanism for Multi-Step Information Reasoning in Language Models0
Holistic Capability Preservation: Towards Compact Yet Comprehensive Reasoning Models0
FG-PRM: Fine-grained Hallucination Detection and Mitigation in Language Model Mathematical Reasoning0
Exposing the Achilles' Heel: Evaluating LLMs Ability to Handle Mistakes in Mathematical Reasoning0
Evaluation of LLMs for mathematical problem solving0
Metacognitive Capabilities of LLMs: An Exploration in Mathematical Problem Solving0
Mixture-of-Instructions: Comprehensive Alignment of a Large Language Model through the Mixture of Diverse System Prompting Instructions0
Can reasoning models comprehend mathematical problems in Chinese ancient texts? An empirical study based on data from Suanjing Shishu0
Navigating Semantic Relations: Challenges for Language Models in Abstract Common-Sense Reasoning0
Can LLMs plan paths with extra hints from solvers?0
Building Math Agents with Multi-Turn Iterative Preference Learning0
OccamLLM: Fast and Exact Language Model Arithmetic in a Single Step0
On Vanishing Variance in Transformer Length Generalization0
Brains vs. Bytes: Evaluating LLM Proficiency in Olympiad Mathematics0
Beyond Traditional Teaching: The Potential of Large Language Models and Chatbots in Graduate Engineering Education0
Performance Comparison of Large Language Models on Advanced Calculus Problems0
PersonaMath: Enhancing Math Reasoning through Persona-Driven Data Augmentation0
PoLAR: Polar-Decomposed Low-Rank Adapter Representation0
Premise Order Matters in Reasoning with Large Language Models0
3D-Properties: Identifying Challenges in DPO and Charting a Path Forward0
Bayesian artificial brain with ChatGPT0
Reasoning Models Can Be Effective Without Thinking0
Scaling Autonomous Agents via Automatic Reward Modeling And Planning0
Scaling Laws for Autoregressive Generative Modeling0
SECURA: Sigmoid-Enhanced CUR Decomposition with Uninterrupted Retention and Low-Rank Adaptation in Large Language Models0
VisAidMath: Benchmarking Visual-Aided Mathematical Reasoning0
Self-Evolved Preference Optimization for Enhancing Mathematical Reasoning in Small Language Models0
Is PRM Necessary? Problem-Solving RL Implicitly Induces PRM Capability in LLMs0
SMART: Self-Generating and Self-Validating Multi-Dimensional Assessment for LLMs' Mathematical Problem Solving0
Automating Mathematical Proof Generation Using Large Language Model Agents and Knowledge Graphs0
STRIVE: Structured Reasoning for Self-Improvement in Claim Verification0
Reasoning with OmniThought: A Large CoT Dataset with Verbosity and Cognitive Difficulty Annotations0
Automatic Detection of Reflective Thinking in Mathematical Problem Solving based on Unconstrained Bodily Exploration0
Teaching LLMs According to Their Aptitude: Adaptive Reasoning for Mathematical Problem Solving0
TeleMath: A Benchmark for Large Language Models in Telecom Mathematical Problem Solving0
The Consensus Game: Language Model Generation via Equilibrium Search0
Three Questions Concerning the Use of Large Language Models to Facilitate Mathematics Learning0
Token-by-Token Regeneration and Domain Biases: A Benchmark of LLMs on Advanced Mathematical Problem-Solving0
Large Language Models for Mathematical Reasoning: Progresses and Challenges0
Show:102550
← PrevPage 2 of 3Next →

No leaderboard results yet.