| Hard Math -- Easy UVM: Pragmatic solutions for verifying hardware algorithms using UVM | Dec 6, 2024 | Math | —Unverified | 0 |
| The Self-Improvement Paradox: Can Language Models Bootstrap Reasoning Capabilities without External Scaffolding? | Feb 19, 2025 | Math | —Unverified | 0 |
| Adaptive Inference-Time Compute: LLMs Can Predict if They Can Do Better, Even Mid-Generation | Oct 3, 2024 | GSM8KMath | —Unverified | 0 |
| Hawkeye:Efficient Reasoning with Model Collaboration | Apr 1, 2025 | Mathmodel | —Unverified | 0 |
| Heimdall: test-time scaling on the generative verification | Apr 14, 2025 | Math | —Unverified | 0 |
| HelpSteer3: Human-Annotated Feedback and Edit Data to Empower Inference-Time Scaling in Open-Ended General-Domain Tasks | Mar 6, 2025 | ChatbotLogical Reasoning | —Unverified | 0 |
| hep-th | Jun 27, 2018 | Binary ClassificationMath | —Unverified | 0 |
| Herald: A Natural Language Annotated Lean 4 Dataset | Oct 9, 2024 | MathMathematical Reasoning | —Unverified | 0 |
| Hierarchical Attention Decoder for Solving Math Word Problems | Nov 16, 2021 | DecoderMath | —Unverified | 0 |
| Hierarchical evolutive systems, fuzzy categories and the living single cell | Jan 31, 2018 | Math | —Unverified | 0 |
| WebMIaS on Docker: Deploying Math-Aware Search in a Single Line of Code | Jun 1, 2021 | MathRetrieval | —Unverified | 0 |
| Homeostatic Mechanisms in Biological Systems | Feb 22, 2022 | Math | —Unverified | 0 |
| Big Math and the One-Brain Barrier A Position Paper and Architecture Proposal | Apr 23, 2019 | MathPosition | —Unverified | 0 |
| How Difficulty-Aware Staged Reinforcement Learning Enhances LLMs' Reasoning Capabilities: A Preliminary Experimental Study | Apr 1, 2025 | Code GenerationMath | —Unverified | 0 |
| Biased Programmers? Or Biased Data? A Field Experiment in Operationalizing AI Ethics | Dec 4, 2020 | EthicsMath | —Unverified | 0 |
| The Tangent Search Engine: Improved Similarity Metrics and Scalability for Math Formula Search | Jul 22, 2015 | Information RetrievalMath | —Unverified | 0 |
| How well do Computers Solve Math Word Problems? Large-Scale Dataset Construction and Evaluation | Aug 1, 2016 | Community Question AnsweringMath | —Unverified | 0 |
| Weighted Polynomial Approximations: Limits for Learning and Pseudorandomness | Dec 8, 2014 | Math | —Unverified | 0 |
| How You See Me | Nov 20, 2018 | Math | —Unverified | 0 |
| Human Learning about AI | Jun 8, 2024 | Math | —Unverified | 0 |
| Hydrodynamics of Markets:Hidden Links Between Physics and Finance | Mar 14, 2024 | Math | —Unverified | 0 |
| HyperCLOVA X Technical Report | Apr 2, 2024 | Instruction FollowingMachine Translation | —Unverified | 0 |
| Hypothesis-Driven Theory-of-Mind Reasoning for Large Language Models | Feb 17, 2025 | Math | —Unverified | 0 |
| Identifying equivalent Calabi--Yau topologies: A discrete challenge from math and physics for machine learning | Feb 15, 2022 | BIG-bench Machine LearningMath | —Unverified | 0 |
| Illinois Math Solver: Math Reasoning on the Web | Jun 1, 2016 | MathMath Word Problem Solving | —Unverified | 0 |
| The Unreasonable Effectiveness of Model Merging for Cross-Lingual Transfer in LLMs | May 23, 2025 | Cross-Lingual TransferMath | —Unverified | 0 |
| Improve Mathematical Reasoning in Language Models by Automated Process Supervision | Jun 5, 2024 | GSM8KMath | —Unverified | 0 |
| Improving Academic Plagiarism Detection for STEM Documents by Analyzing Mathematical Content and Citations | Jun 27, 2019 | Math | —Unverified | 0 |
| Improving Assessment of Tutoring Practices using Retrieval-Augmented Generation | Feb 4, 2024 | HallucinationMath | —Unverified | 0 |
| Improving Automated Distractor Generation for Math Multiple-choice Questions with Overgenerate-and-rank | Apr 19, 2024 | Distractor GenerationMath | —Unverified | 0 |
| Improving Complex Reasoning with Dynamic Prompt Corruption: A soft prompt Optimization Approach | Mar 17, 2025 | GSM8KMath | —Unverified | 0 |
| Improving Equation Set Problems with Label Augmentation | Nov 16, 2021 | DecoderMath | —Unverified | 0 |
| Improving Large Language Model Fine-tuning for Solving Math Problems | Oct 16, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Improving LLM Reasoning through Scaling Inference Computation with Collaborative Verification | Oct 5, 2024 | GSM8KMath | —Unverified | 0 |
| Improving Math Problem Solving in Large Language Models Through Categorization and Strategy Tailoring | Oct 29, 2024 | Math | —Unverified | 0 |
| Improving Math Word Problems with Pre-trained Knowledge and Hierarchical Reasoning | Nov 1, 2021 | MathSentence | —Unverified | 0 |
| Improving Multilingual Math Reasoning for African Languages | May 26, 2025 | MathMathematical Reasoning | —Unverified | 0 |
| The Word is Mightier than the Label: Learning without Pointillistic Labels using Data Programming | Aug 24, 2021 | Mathtext-classification | —Unverified | 0 |
| In between myth and reality: AI for math -- a case study in category theory | Apr 17, 2025 | Math | —Unverified | 0 |
| Incremental Sequence Classification with Temporal Consistency | May 22, 2025 | ClassificationLanguage Modeling | —Unverified | 0 |
| Inference-Aware Fine-Tuning for Best-of-N Sampling in Large Language Models | Dec 18, 2024 | HumanEvalImitation Learning | —Unverified | 0 |
| Inference Computation Scaling for Feature Augmentation in Recommendation Systems | Feb 22, 2025 | MathRecommendation Systems | —Unverified | 0 |
| Beyond Sentential Semantic Parsing: Tackling the Math SAT with a Cascade of Tree Transducers | Sep 1, 2017 | coreference-resolutionCoreference Resolution | —Unverified | 0 |
| InfiFusion: A Unified Framework for Enhanced Cross-Model Reasoning via LLM Fusion | Jan 6, 2025 | GSM8KHumanEval | —Unverified | 0 |
| Infi-MMR: Curriculum-based Unlocking Multimodal Reasoning via Phased Reinforcement Learning in Multimodal Small Language Models | May 29, 2025 | Logical ReasoningMath | —Unverified | 0 |
| InfiMM-WebMath-40B: Advancing Multimodal Pre-Training for Enhanced Mathematical Reasoning | Sep 19, 2024 | MathMathematical Reasoning | —Unverified | 0 |
| Information Token Driven Machine Learning for Electronic Markets: Performance Effects in Behavioral Financial Big Data Analytics | Mar 30, 2020 | BIG-bench Machine LearningMath | —Unverified | 0 |
| InftyThink: Breaking the Length Limits of Long-Context Reasoning in Large Language Models | Mar 9, 2025 | Computational EfficiencyMath | —Unverified | 0 |
| Innovative Thinking, Infinite Humor: Humor Research of Large Language Models through Structured Thought Leaps | Oct 14, 2024 | Math | —Unverified | 0 |
| Inspecting Spoken Language Understanding from Kids for Basic Math Learning at Home | Jun 1, 2023 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |