| OTC: Optimal Tool Calls via Reinforcement Learning | Apr 21, 2025 | Mathreinforcement-learning | —Unverified | 0 | 0 |
| Outcome-based Reinforcement Learning to Predict the Future | May 23, 2025 | Holdout SetMath | —Unverified | 0 | 0 |
| Overcoming Vocabulary Mismatch: Vocabulary-agnostic Teacher Guided Language Modeling | Mar 24, 2025 | Continual PretrainingLanguage Modeling | —Unverified | 0 | 0 |
| Oxford Handbook on AI Ethics Book Chapter on Race and Gender | Aug 8, 2019 | BIG-bench Machine LearningDecision Making | —Unverified | 0 | 0 |
| P3: A Policy-Driven, Pace-Adaptive, and Diversity-Promoted Framework for data pruning in LLM Training | Aug 10, 2024 | DiversityLogical Reasoning | —Unverified | 0 | 0 |
| Uncertainty-Based Joint Training For Semi-Supervised Math Word Problem | Nov 16, 2021 | Math | —Unverified | 0 | 0 |
| Approximating Sparse PCA from Incomplete Data | Mar 12, 2015 | Math | —Unverified | 0 | 0 |
| A Perspective on Large Language Models, Intelligent Machines, and Knowledge Acquisition | Aug 13, 2024 | Common Sense ReasoningMath | —Unverified | 0 | 0 |
| PARAMANU-AYN: Pretrain from scratch or Continual Pretraining of LLMs for Legal Domain Adaptation? | Mar 20, 2024 | Abstractive Text SummarizationContinual Pretraining | —Unverified | 0 | 0 |
| PARAMANU-GANITA: Language Model with Mathematical Capabilities | Apr 22, 2024 | Domain AdaptationGSM8K | —Unverified | 0 | 0 |