SOTAVerified

Math

Papers

Showing 551600 of 1596 papers

TitleStatusHype
Efficient Non-Parametric Optimizer Search for Diverse TasksCode0
Rethinking Fine-Tuning when Scaling Test-Time Compute: Limiting Confidence Improves Mathematical ReasoningCode0
Scalable and Equitable Math Problem Solving Strategy Prediction in Big Educational DataCode0
Smart Vision-Language ReasonersCode0
Effects of structure on reasoning in instance-level Self-DiscoverCode0
Effective Skill Unlearning through Intervention and AbstentionCode0
Earlier Tokens Contribute More: Learning Direct Preference Optimization From Temporal Decay PerspectiveCode0
DyRRen: A Dynamic Retriever-Reranker-Generator Model for Numerical Reasoning over Tabular and Textual DataCode0
Reasoning Graph Enhanced Exemplars Retrieval for In-Context LearningCode0
AgentGroupChat-V2: Divide-and-Conquer Is What LLM-Based Multi-Agent System NeedCode0
An Independent Evaluation of ChatGPT on Mathematical Word Problems (MWP)Code0
PT-MoE: An Efficient Finetuning Framework for Integrating Mixture-of-Experts into Prompt TuningCode0
Reasoning in Large Language Models Through Symbolic Math Word ProblemsCode0
DOP: Diagnostic-Oriented Prompting for Large Language Models in Mathematical CorrectionCode0
Don't Get Lost in the Trees: Streamlining LLM Reasoning by Overcoming Tree Search Exploration PitfallsCode0
An extrapolated and provably convergent algorithm for nonlinear matrix decomposition with the ReLU functionCode0
Practice Makes a Solver Perfect: Data Augmentation for Math Word Problem SolversCode0
Beyond Examples: High-level Automated Reasoning Paradigm in In-Context Learning via MCTSCode0
Adversarial Examples for Evaluating Math Word Problem SolversCode0
An Exploration of Self-Supervised Mutual Information Alignment for Multi-Task SettingsCode0
Personalized Exercise Recommendation with Semantically-Grounded Knowledge TracingCode0
Prover-Verifier Games improve legibility of LLM outputsCode0
Does ChatGPT Comprehend the Place Value in Numbers When Solving Math Word Problems?Code0
Beyond Accuracy Optimization: Computer Vision Losses for Large Language Model Fine-TuningCode0
DiVERT: Distractor Generation with Variational Errors Represented as Text for Math Multiple-choice QuestionsCode0
An algorithm to represent inbreeding treesCode0
DIVE: Diversified Iterative Self-ImprovementCode0
Benchmarking Large Language Models for Math Reasoning TasksCode0
OntoMath^PRO Ontology: A Linked Data Hub for MathematicsCode0
Distinguishing affixoid formations from compoundsCode0
Discriminative Policy Optimization for Token-Level Reward ModelsCode0
Benchmarking Hallucination in Large Language Models based on Unanswerable Math Word ProblemCode0
An Edge-Enhanced Hierarchical Graph-to-Tree Network for Math Word Problem SolvingCode0
Not All Votes Count! Programs as Verifiers Improve Self-Consistency of Language Models for Math ReasoningCode0
NUMCoT: Numerals and Units of Measurement in Chain-of-Thought Reasoning using Large Language ModelsCode0
One Language, Many Gaps: Evaluating Dialect Fairness and Robustness of Large Language Models in Reasoning TasksCode0
Neural Machine Translation and Sequence-to-sequence Models: A TutorialCode0
Deterministic and Nondeterministic Particle Motion with Interaction MechanismsCode0
Multi-Scale Attention with Dense Encoder for Handwritten Mathematical Expression RecognitionCode0
More is More: Addition Bias in Large Language ModelsCode0
AutoMSC: Automatic Assignment of Mathematics Subject Classification LabelsCode0
Modeling Intra-Relation in Math Word Problems with Different Functional Multi-Head AttentionsCode0
MM-MATH: Advancing Multimodal Math Evaluation with Process Evaluation and Fine-grained ClassificationCode0
MMATH: A Multilingual Benchmark for Mathematical ReasoningCode0
Automatic Short Math Answer Grading via In-context Meta-learningCode0
MM-Eval: A Hierarchical Benchmark for Modern Mongolian Evaluation in LLMsCode0
A Diversity-Enhanced Knowledge Distillation Model for Practical Math Word Problem SolvingCode0
Misplaced Trust: Measuring the Interference of Machine Learning in Human Decision-MakingCode0
Mixed-R1: Unified Reward Perspective For Reasoning Capability in Multimodal Large Language ModelsCode0
Decomposing Elements of Problem Solving: What "Math" Does RL Teach?Code0
Show:102550
← PrevPage 12 of 32Next →

No leaderboard results yet.