| FRACTAL: Fine-Grained Scoring from Aggregate Text Labels | Apr 7, 2024 | MathMultiple Instance Learning | —Unverified | 0 |
| BRiTE: Bootstrapping Reinforced Thinking Process to Enhance Language Model Reasoning | Jan 31, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| From Blind Solvers to Logical Thinkers: Benchmarking LLMs' Logical Integrity on Faulty Mathematical Problems | Oct 24, 2024 | BenchmarkingCommon Sense Reasoning | —Unverified | 0 |
| From fixation probabilities to d-player games: an inverse problem in evolutionary dynamics | Nov 20, 2018 | MathUnity | —Unverified | 0 |
| The Mathematics of Market Timing | Dec 13, 2017 | Math | —Unverified | 0 |
| From Good to Great: Improving Math Reasoning with Tool-Augmented Interleaf Prompting | Dec 18, 2023 | DiversityGSM8K | —Unverified | 0 |
| From Large to Tiny: Distilling and Refining Mathematical Expertise for Math Word Problems with Weakly Supervision | Mar 21, 2024 | Math | —Unverified | 0 |
| From Textbooks to Knowledge: A Case Study in Harvesting Axiomatic Knowledge from Textbooks to Solve Geometry Problems | Sep 1, 2017 | MathQuestion Answering | —Unverified | 0 |
| From Text to Visuals: Using LLMs to Generate Math Diagrams with Vector Graphics | Mar 10, 2025 | MathQuestion Answering | —Unverified | 0 |
| Bridging the Training-Inference Gap in LLMs by Leveraging Self-Generated Tokens | Oct 18, 2024 | MathQuestion Answering | —Unverified | 0 |
| Bridging Offline and Online Reinforcement Learning for LLMs | Jun 26, 2025 | Instruction FollowingMath | —Unverified | 0 |
| Breaking Ties: Regression Discontinuity Design Meets Market Design | Dec 31, 2020 | Mathregression | —Unverified | 0 |
| Gamifying Math Education using Object Detection | Apr 13, 2023 | MathObject | —Unverified | 0 |
| GAPS: Geometry-Aware Problem Solver | Jan 29, 2024 | Geometry Problem SolvingMath | —Unverified | 0 |
| Gemma 3 Technical Report | Mar 25, 2025 | Instruction FollowingMath | —Unverified | 0 |
| Generalization v.s. Memorization: Tracing Language Models' Capabilities Back to Pretraining Data | Jul 20, 2024 | Language ModellingMachine Translation | —Unverified | 0 |
| Branch-Train-MiX: Mixing Expert LLMs into a Mixture-of-Experts LLM | Mar 12, 2024 | Arithmetic ReasoningCode Generation | —Unverified | 0 |
| Brains vs. Bytes: Evaluating LLM Proficiency in Olympiad Mathematics | Apr 1, 2025 | MathMathematical Problem-Solving | —Unverified | 0 |
| Generate & Rank: A Multi-task Framework for Math Word Problems | Sep 7, 2021 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Generating Equation by Utilizing Operators : GEO model | Dec 1, 2020 | DecoderMachine Translation | —Unverified | 0 |
| Controlling Equational Reasoning in Large Language Models with Prompt Interventions | Jul 19, 2023 | HallucinationIn-Context Learning | —Unverified | 0 |
| Generating Math Word Problems from Equations with Topic Controlling and Commonsense Enforcement | Dec 14, 2020 | MathText Generation | —Unverified | 0 |
| Generating Narrated Lecture Videos from Slides with Synchronized Highlights | May 5, 2025 | Mathtext-to-speech | —Unverified | 0 |
| SelfBudgeter: Adaptive Token Allocation for Efficient LLM Reasoning | May 16, 2025 | Math | —Unverified | 0 |
| Generative AI for Enhancing Active Learning in Education: A Comparative Study of GPT-3.5 and GPT-4 in Crafting Customized Test Questions | Jun 20, 2024 | Active LearningMath | —Unverified | 0 |
| Generative Discovery of Partial Differential Equations by Learning from Math Handbooks | May 9, 2025 | Computational EfficiencyMath | —Unverified | 0 |
| Generative Verifiers: Reward Modeling as Next-Token Prediction | Aug 27, 2024 | MathPrediction | —Unverified | 0 |
| GenPRM: Scaling Test-Time Compute of Process Reward Models via Generative Reasoning | Apr 1, 2025 | MathMathematical Reasoning | —Unverified | 0 |
| Geo-LLaVA: A Large Multi-Modal Model for Solving Geometry Math Problems with Meta In-Context Learning | Dec 12, 2024 | Geometry Problem SolvingIn-Context Learning | —Unverified | 0 |
| Geometry is All You Need: A Unified Taxonomy of Matrix and Tensor Factorization for Compression of Generative Language Models | Oct 3, 2024 | AllLanguage Modeling | —Unverified | 0 |
| The Perfect Blend: Redefining RLHF with Mixture of Judges | Sep 30, 2024 | Instruction FollowingMath | —Unverified | 0 |
| Giving BERT a Calculator: Finding Operations and Arguments with Reading Comprehension | Aug 31, 2019 | MathQuestion Answering | —Unverified | 0 |
| GLoRe: When, Where, and How to Improve LLM Reasoning via Global and Local Refinements | Feb 13, 2024 | GSM8KMath | —Unverified | 0 |
| Gold-medalist Performance in Solving Olympiad Geometry with AlphaGeometry2 | Feb 5, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Fewer is More: Boosting LLM Reasoning with Reinforced Context Pruning | Dec 14, 2023 | Arithmetic ReasoningFew-Shot Learning | —Unverified | 0 |
| BOLT: Bootstrap Long Chain-of-Thought in Language Models without Distillation | Feb 6, 2025 | In-Context LearningKnowledge Distillation | —Unverified | 0 |
| GPT Carry-On: Training Foundation Model for Customization Could Be Simple, Scalable and Affordable | Apr 10, 2025 | GPUMath | —Unverified | 0 |
| GPT takes the SAT: Tracing changes in Test Difficulty and Math Performance of Students | Sep 16, 2024 | Math | —Unverified | 0 |
| GPU Domain Specialization via Composable On-Package Architecture | Apr 5, 2021 | GPUMath | —Unverified | 0 |
| Graders should cheat: privileged information enables expert-level automated evaluations | Feb 16, 2025 | Math | —Unverified | 0 |
| Graph2Tac: Online Representation Learning of Formal Math Concepts | Jan 5, 2024 | AI AgentAutomated Theorem Proving | —Unverified | 0 |
| GRIN: GRadient-INformed MoE | Sep 18, 2024 | HellaSwagHumanEval | —Unverified | 0 |
| BloomWise: Enhancing Problem-Solving capabilities of Large Language Models using Bloom's-Taxonomy-Inspired Prompts | Oct 5, 2024 | Math | —Unverified | 0 |
| Blink of an eye: a simple theory for feature localization in generative models | Feb 2, 2025 | Math | —Unverified | 0 |
| GSSF: A Generative Sequence Similarity Function based on a Seq2Seq model for clustering online handwritten mathematical answers | May 21, 2021 | ClusteringDescriptive | —Unverified | 0 |
| Guideline Forest: Experience-Induced Multi-Guideline Reasoning with Stepwise Aggregation | Jun 9, 2025 | GSM8KHumanEval | —Unverified | 0 |
| Guiding Language Model Reasoning with Planning Tokens | Oct 9, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Hallucinating AI Hijacking Attack: Large Language Models and Malicious Code Recommenders | Oct 9, 2024 | Math | —Unverified | 0 |
| The Role of Diversity in In-Context Learning for Large Language Models | May 26, 2025 | DiversityIn-Context Learning | —Unverified | 0 |
| The Search-and-Mix Paradigm in Approximate Nash Equilibrium Algorithms | Oct 12, 2023 | Math | —Unverified | 0 |