| No more hard prompts: SoftSRV prompting for synthetic data generation | Oct 21, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| AceReason-Nemotron 1.1: Advancing Math and Code Reasoning through SFT and RL Synergy | Jun 16, 2025 | MathReinforcement Learning (RL) | —Unverified | 0 |
| Non-congruent non-degenerate curves with identical signatures | Dec 20, 2019 | Math | —Unverified | 0 |
| None of the Others: a General Technique to Distinguish Reasoning from Memorization in Multiple-Choice LLM Evaluation Benchmarks | Feb 18, 2025 | MathMemorization | —Unverified | 0 |
| Nonlinear and Machine Learning Analyses on High-Density EEG data of Math Experts and Novices | Dec 1, 2022 | EEGElectroencephalogram (EEG) | —Unverified | 0 |
| Not All LLM Reasoners Are Created Equal | Oct 2, 2024 | AllCode Generation | —Unverified | 0 |
| Noun-MWP: Math Word Problems Meet Noun Answers | Oct 1, 2022 | MathQuestion Answering | —Unverified | 0 |
| Novice Learner and Expert Tutor: Evaluating Math Reasoning Abilities of Large Language Models with Misconceptions | Oct 3, 2023 | MathMathematical Reasoning | —Unverified | 0 |
| NumGPT: Improving Numeracy Ability of Generative Pre-trained Models | Sep 7, 2021 | Math | —Unverified | 0 |
| NVLM: Open Frontier-Class Multimodal LLMs | Sep 17, 2024 | MathMultimodal Reasoning | —Unverified | 0 |
| O1 Embedder: Let Retrievers Think Before Action | Feb 11, 2025 | Contrastive LearningMath | —Unverified | 0 |
| A risk analysis for a system stabilized by a central agent | Aug 17, 2015 | Math | —Unverified | 0 |
| ArGoT: A Glossary of Terms extracted from the arXiv | Sep 7, 2021 | ArticlesMath | —Unverified | 0 |
| Who's the Best Detective? LLMs vs. MLs in Detecting Incoherent Fourth Grade Math Answers | Apr 21, 2023 | MathMultiple-choice | —Unverified | 0 |
| ARB: Advanced Reasoning Benchmark for Large Language Models | Jul 25, 2023 | Math | —Unverified | 0 |
| On Designing Effective RL Reward at Training Time for LLM Reasoning | Oct 19, 2024 | GSM8KMath | —Unverified | 0 |
| oneDAL Optimization for ARM Scalable Vector Extension: Maximizing Efficiency for High-Performance Data Science | Apr 5, 2025 | Math | —Unverified | 0 |
| One RL to See Them All: Visual Triple Unified Reinforcement Learning | May 23, 2025 | AllMath | —Unverified | 0 |
| Online Difficulty Filtering for Reasoning Oriented Reinforcement Learning | Apr 4, 2025 | Mathreinforcement-learning | —Unverified | 0 |
| On Sharpness of Error Bounds for Multivariate Neural Network Approximation | Apr 5, 2020 | Math | —Unverified | 0 |
| On sparse connectivity, adversarial robustness, and a novel model of the artificial neuron | Jun 16, 2020 | Adversarial RobustnessComputational Efficiency | —Unverified | 0 |
| On the definition of a confounder | Apr 2, 2013 | Causal Inferencecounterfactual | —Unverified | 0 |
| On the Difficulty of Characterizing Network Formation with Endogenous Behavior | Feb 12, 2023 | Math | —Unverified | 0 |
| On the Effect of Negative Gradient in Group Relative Deep Reinforcement Optimization | May 24, 2025 | MathReinforcement Learning (RL) | —Unverified | 0 |
| A range characterization of the single-quadrant ADRT | Oct 11, 2020 | Math | —Unverified | 0 |
| On the Empirical Complexity of Reasoning and Planning in LLMs | Apr 17, 2024 | Math | —Unverified | 0 |
| On the existence of minimizers in shallow residual ReLU neural network optimization landscapes | Feb 28, 2023 | Math | —Unverified | 0 |
| On the Inductive Bias of Stacking Towards Improving Reasoning | Sep 27, 2024 | Inductive BiasMath | —Unverified | 0 |
| On the quasi-sure superhedging duality with frictions | Sep 18, 2019 | Math | —Unverified | 0 |
| OntoMath^PRO 2.0 Ontology: Updates of the Formal Model | Mar 17, 2023 | ManagementMath | —Unverified | 0 |
| OpenAI-o1 AB Testing: Does the o1 model really do good reasoning in math problem solving? | Nov 9, 2024 | Logical ReasoningMath | —Unverified | 0 |
| Unbiased Math Word Problems Benchmark for Mitigating Solving Bias | Jan 16, 2022 | Math | —Unverified | 0 |
| A Practice of Post-Training on Llama-3 70B with Optimal Selection of Additional Language Mixture Ratio | Sep 10, 2024 | Emotional IntelligenceMath | —Unverified | 0 |
| Approximation properties of Residual Neural Networks for Kolmogorov PDEs | Oct 30, 2021 | image-classificationImage Classification | —Unverified | 0 |
| Optimal AdaBoost Converges | Oct 11, 2022 | Math | —Unverified | 0 |
| Optimal classification in sparse Gaussian graphic model | Dec 21, 2012 | ClassificationGeneral Classification | —Unverified | 0 |
| Optimizing Chain-of-Thought Reasoning: Tackling Arranging Bottleneck via Plan Augmentation | Oct 22, 2024 | GSM8KMath | —Unverified | 0 |
| Optimizing Test-Time Compute via Meta Reinforcement Fine-Tuning | Mar 10, 2025 | MathMeta Reinforcement Learning | —Unverified | 0 |
| Orca-Math: Unlocking the potential of SLMs in Grade School Math | Feb 16, 2024 | Arithmetic ReasoningGSM8K | —Unverified | 0 |
| OTC: Optimal Tool Calls via Reinforcement Learning | Apr 21, 2025 | Mathreinforcement-learning | —Unverified | 0 |
| Outcome-based Reinforcement Learning to Predict the Future | May 23, 2025 | Holdout SetMath | —Unverified | 0 |
| Overcoming Vocabulary Mismatch: Vocabulary-agnostic Teacher Guided Language Modeling | Mar 24, 2025 | Continual PretrainingLanguage Modeling | —Unverified | 0 |
| Oxford Handbook on AI Ethics Book Chapter on Race and Gender | Aug 8, 2019 | BIG-bench Machine LearningDecision Making | —Unverified | 0 |
| P3: A Policy-Driven, Pace-Adaptive, and Diversity-Promoted Framework for data pruning in LLM Training | Aug 10, 2024 | DiversityLogical Reasoning | —Unverified | 0 |
| Uncertainty-Based Joint Training For Semi-Supervised Math Word Problem | Nov 16, 2021 | Math | —Unverified | 0 |
| Approximating Sparse PCA from Incomplete Data | Mar 12, 2015 | Math | —Unverified | 0 |
| A Perspective on Large Language Models, Intelligent Machines, and Knowledge Acquisition | Aug 13, 2024 | Common Sense ReasoningMath | —Unverified | 0 |
| PARAMANU-AYN: Pretrain from scratch or Continual Pretraining of LLMs for Legal Domain Adaptation? | Mar 20, 2024 | Abstractive Text SummarizationContinual Pretraining | —Unverified | 0 |
| PARAMANU-GANITA: Language Model with Mathematical Capabilities | Apr 22, 2024 | Domain AdaptationGSM8K | —Unverified | 0 |
| Parameterized Approximation for Robust Clustering in Discrete Geometric Spaces | May 12, 2023 | ClusteringFairness | —Unverified | 0 |