SOTAVerified

Math

Papers

Showing 551600 of 1596 papers

TitleStatusHype
Building Math Agents with Multi-Turn Iterative Preference Learning0
A Bayesian model for recognizing handwritten mathematical expressions0
Ensembling Large Language Models with Process Reward-Guided Tree Search for Better Complex Reasoning0
Enigmata: Scaling Logical Reasoning in Large Language Models with Synthetic Verifiable Puzzles0
A Perspective on Large Language Models, Intelligent Machines, and Knowledge Acquisition0
APE-Bench I: Towards File-level Automated Proof Engineering of Formal Math Libraries0
AceMath: Advancing Frontier Math Reasoning with Post-Training and Reward Modeling0
Enhancing Reasoning Capabilities of Small Language Models with Blueprints and Prompt Template Search0
GraphReason: Enhancing Reasoning Capabilities of Large Language Models through A Graph-Based Verification Approach0
BRiTE: Bootstrapping Reinforced Thinking Process to Enhance Language Model Reasoning0
Introduction to Coresets: Accurate Coresets0
Enhancing Math Learning in an LMS Using AI-Driven Question Recommendations0
Enhancing Mathematical Reasoning in LLMs with Background Operators0
Bridging the Training-Inference Gap in LLMs by Leveraging Self-Generated Tokens0
Enhancing LLM Intelligence with ARM-RAG: Auxiliary Rationale Memory for Retrieval Augmented Generation0
Bridging Offline and Online Reinforcement Learning for LLMs0
Energy-Conscious LLM Decoding: Impact of Text Generation Strategies on GPU Energy Consumption0
End-to-End Evaluation of a Spoken Dialogue System for Learning Basic Mathematics0
End-to-End Bangla AI for Solving Math Olympiad Problem Benchmark: Leveraging Large Language Model Using Integrated Approach0
Breaking Ties: Regression Discontinuity Design Meets Market Design0
構建一個中文國小數學文字問題語料庫(Building a Corpus for Developing the Chinese Elementary School Math Word Problem Solver)[In Chinese]0
AceReason-Nemotron 1.1: Advancing Math and Code Reasoning through SFT and RL Synergy0
Investigating Large Language Models in Diagnosing Students' Cognitive Skills in Math Problem-solving0
Enabling Massive Deep Neural Networks with the GraphBLAS0
Empowering Bengali Education with AI: Solving Bengali Math Word Problems through Transformer Models0
Empirical entropy, minimax regret and minimax risk0
Emergent inabilities? Inverse scaling over the course of pretraining0
Branch-Train-MiX: Mixing Expert LLMs into a Mixture-of-Experts LLM0
An upper bound of the mutation probability in the genetic algorithm for general 0-1 knapsack problem0
Embracing AI in Education: Understanding the Surge in Large Language Model Use by Secondary Students0
Embedding Self-Correction as an Inherent Ability in Large Language Models for Enhanced Mathematical Reasoning0
Brains vs. Bytes: Evaluating LLM Proficiency in Olympiad Mathematics0
Embedded Phase Shifting: Robust Phase Shifting With Embedded Signals0
A novel variational model for image registration using Gaussian curvature0
1bit-Merging: Dynamic Quantized Merging for Large Language Models0
Efficient Tool Use with Chain-of-Abstraction Reasoning0
A note on the option price and 'Mass at zero in the uncorrelated SABR model and implied volatility asymptotics'0
An Optimal Transport approach to arbitrage correction: Application to volatility Stress-Tests0
Fewer is More: Boosting LLM Reasoning with Reinforced Context Pruning0
Agent-RLVR: Training Software Engineering Agents via Guidance and Environment Rewards0
Intriguing Properties of Large Language and Vision Models0
Introducing the Mathematics Meme Repository0
Investigating Math Word Problems using Pretrained Multilingual Language Models0
Kappa Learning: A New Method for Measuring Similarity Between Educational Items Using Performance Data0
Effects of context, complexity, and clustering on evaluation for math formula retrieval0
Interactive Sketchpad: A Multimodal Tutoring System for Collaborative, Visual Problem-Solving0
BOLT: Bootstrap Long Chain-of-Thought in Language Models without Distillation0
An Optimal Likelihood Free Method for Biological Model Selection0
Interleaved Reasoning for Large Language Models via Reinforcement Learning0
EasyMath: A 0-shot Math Benchmark for SLMs0
Show:102550
← PrevPage 12 of 32Next →

No leaderboard results yet.