SOTAVerified

Math

Papers

Showing 576600 of 1596 papers

TitleStatusHype
Empirical entropy, minimax regret and minimax risk0
Emergent inabilities? Inverse scaling over the course of pretraining0
Branch-Train-MiX: Mixing Expert LLMs into a Mixture-of-Experts LLM0
An upper bound of the mutation probability in the genetic algorithm for general 0-1 knapsack problem0
Embracing AI in Education: Understanding the Surge in Large Language Model Use by Secondary Students0
Embedding Self-Correction as an Inherent Ability in Large Language Models for Enhanced Mathematical Reasoning0
Brains vs. Bytes: Evaluating LLM Proficiency in Olympiad Mathematics0
Embedded Phase Shifting: Robust Phase Shifting With Embedded Signals0
A novel variational model for image registration using Gaussian curvature0
1bit-Merging: Dynamic Quantized Merging for Large Language Models0
Efficient Tool Use with Chain-of-Abstraction Reasoning0
A note on the option price and 'Mass at zero in the uncorrelated SABR model and implied volatility asymptotics'0
An Optimal Transport approach to arbitrage correction: Application to volatility Stress-Tests0
Fewer is More: Boosting LLM Reasoning with Reinforced Context Pruning0
Agent-RLVR: Training Software Engineering Agents via Guidance and Environment Rewards0
Intriguing Properties of Large Language and Vision Models0
Introducing the Mathematics Meme Repository0
Investigating Math Word Problems using Pretrained Multilingual Language Models0
Kappa Learning: A New Method for Measuring Similarity Between Educational Items Using Performance Data0
Effects of context, complexity, and clustering on evaluation for math formula retrieval0
Interactive Sketchpad: A Multimodal Tutoring System for Collaborative, Visual Problem-Solving0
BOLT: Bootstrap Long Chain-of-Thought in Language Models without Distillation0
An Optimal Likelihood Free Method for Biological Model Selection0
Interleaved Reasoning for Large Language Models via Reinforcement Learning0
EasyMath: A 0-shot Math Benchmark for SLMs0
Show:102550
← PrevPage 24 of 64Next →

No leaderboard results yet.