SOTAVerified

Math

Papers

Showing 901950 of 1596 papers

TitleStatusHype
Noisy Deductive Reasoning: How Humans Construct Math, and How Math Constructs Universes0
No more hard prompts: SoftSRV prompting for synthetic data generation0
AceReason-Nemotron 1.1: Advancing Math and Code Reasoning through SFT and RL Synergy0
Non-congruent non-degenerate curves with identical signatures0
None of the Others: a General Technique to Distinguish Reasoning from Memorization in Multiple-Choice LLM Evaluation Benchmarks0
Nonlinear and Machine Learning Analyses on High-Density EEG data of Math Experts and Novices0
Not All LLM Reasoners Are Created Equal0
Noun-MWP: Math Word Problems Meet Noun Answers0
Novice Learner and Expert Tutor: Evaluating Math Reasoning Abilities of Large Language Models with Misconceptions0
NumGPT: Improving Numeracy Ability of Generative Pre-trained Models0
NVLM: Open Frontier-Class Multimodal LLMs0
O1 Embedder: Let Retrievers Think Before Action0
A risk analysis for a system stabilized by a central agent0
ArGoT: A Glossary of Terms extracted from the arXiv0
Who's the Best Detective? LLMs vs. MLs in Detecting Incoherent Fourth Grade Math Answers0
ARB: Advanced Reasoning Benchmark for Large Language Models0
On Designing Effective RL Reward at Training Time for LLM Reasoning0
oneDAL Optimization for ARM Scalable Vector Extension: Maximizing Efficiency for High-Performance Data Science0
One RL to See Them All: Visual Triple Unified Reinforcement Learning0
Online Difficulty Filtering for Reasoning Oriented Reinforcement Learning0
On Sharpness of Error Bounds for Multivariate Neural Network Approximation0
On sparse connectivity, adversarial robustness, and a novel model of the artificial neuron0
On the definition of a confounder0
On the Difficulty of Characterizing Network Formation with Endogenous Behavior0
On the Effect of Negative Gradient in Group Relative Deep Reinforcement Optimization0
A range characterization of the single-quadrant ADRT0
On the Empirical Complexity of Reasoning and Planning in LLMs0
On the existence of minimizers in shallow residual ReLU neural network optimization landscapes0
On the Inductive Bias of Stacking Towards Improving Reasoning0
On the quasi-sure superhedging duality with frictions0
OntoMath^PRO 2.0 Ontology: Updates of the Formal Model0
OpenAI-o1 AB Testing: Does the o1 model really do good reasoning in math problem solving?0
Unbiased Math Word Problems Benchmark for Mitigating Solving Bias0
A Practice of Post-Training on Llama-3 70B with Optimal Selection of Additional Language Mixture Ratio0
Approximation properties of Residual Neural Networks for Kolmogorov PDEs0
Optimal AdaBoost Converges0
Optimal classification in sparse Gaussian graphic model0
Optimizing Chain-of-Thought Reasoning: Tackling Arranging Bottleneck via Plan Augmentation0
Optimizing Test-Time Compute via Meta Reinforcement Fine-Tuning0
Orca-Math: Unlocking the potential of SLMs in Grade School Math0
OTC: Optimal Tool Calls via Reinforcement Learning0
Outcome-based Reinforcement Learning to Predict the Future0
Overcoming Vocabulary Mismatch: Vocabulary-agnostic Teacher Guided Language Modeling0
Oxford Handbook on AI Ethics Book Chapter on Race and Gender0
P3: A Policy-Driven, Pace-Adaptive, and Diversity-Promoted Framework for data pruning in LLM Training0
Uncertainty-Based Joint Training For Semi-Supervised Math Word Problem0
Approximating Sparse PCA from Incomplete Data0
A Perspective on Large Language Models, Intelligent Machines, and Knowledge Acquisition0
PARAMANU-AYN: Pretrain from scratch or Continual Pretraining of LLMs for Legal Domain Adaptation?0
PARAMANU-GANITA: Language Model with Mathematical Capabilities0
Show:102550
← PrevPage 19 of 32Next →

No leaderboard results yet.