SOTAVerified

Math

Papers

Showing 11511175 of 1596 papers

TitleStatusHype
Stem-ming the Tide: Predicting STEM attrition using student transcript data0
STEM-POM: Evaluating Language Models Math-Symbol Reasoning in Document Parsing0
Step-by-Step Reasoning for Math Problems via Twisted Sequential Monte Carlo0
xGen-small Technical Report0
VideoGameBench: Can Vision-Language Models complete popular video games?0
Step Guided Reasoning: Improving Mathematical Reasoning using Guidance Generation and Step Reasoning0
Step-KTO: Optimizing Mathematical Reasoning through Stepwise Binary Feedback0
A case study : Influence of Dimension Reduction on regression trees-based Algorithms -Predicting Aeronautics Loads of a Derivative Aircraft0
Let GPT be a Math Tutor: Teaching Math Word Problem Solvers with Customized Exercise Generation0
Agent-RLVR: Training Software Engineering Agents via Guidance and Environment Rewards0
A Careful Examination of Large Language Model Performance on Grade School Arithmetic0
Strictly monotone mean-variance preferences with applications to portfolio selection0
StructTest: Benchmarking LLMs' Reasoning through Compositional Structured Outputs0
A Bayesian model for recognizing handwritten mathematical expressions0
Students' Perceived Roles, Opportunities, and Challenges of a Generative AI-powered Teachable Agent: A Case of Middle School Math Class0
VISTA: Visual Integrated System for Tailored Automation in Math Problem Generation Using LLM0
Subtle Errors Matter: Preference Learning via Error-injected Self-editing0
A General Retrieval-Augmented Generation Framework for Multimodal Case-Based Reasoning Applications0
Supervised Optimism Correction: Be Confident When LLMs Are Sure0
Sustainable Border Control Policy in the COVID-19 Pandemic: A Math Modeling Study0
SVM-based Deep Stacking Networks0
SWE-RL: Advancing LLM Reasoning via Reinforcement Learning on Open Software Evolution0
Visual Analytics of Student Learning Behaviors on K-12 Mathematics E-learning Platforms0
Symbolic Mixture-of-Experts: Adaptive Skill-based Routing for Heterogeneous Reasoning0
Advancing Process Verification for Large Language Models via Tree-Based Preference Learning0
Show:102550
← PrevPage 47 of 64Next →

No leaderboard results yet.