SOTAVerified

Math

Papers

Showing 501525 of 1596 papers

TitleStatusHype
Problem-Oriented Segmentation and Retrieval: Case Study on Tutoring ConversationsCode1
Pretrained Language Models are Symbolic Mathematics Solvers too!Code1
Reasoning or Memorization? Unreliable Results of Reinforcement Learning Due to Data ContaminationCode1
AI Coders Are Among Us: Rethinking Programming Language Grammar Towards Efficient Code GenerationCode1
FINEREASON: Evaluating and Improving LLMs' Deliberate Reasoning through Reflective Puzzle SolvingCode1
RaDeR: Reasoning-aware Dense Retrieval ModelsCode1
Explaining Datasets in Words: Statistical Models with Natural Language ParametersCode1
REAL-Prover: Retrieval Augmented Lean Prover for Mathematical ReasoningCode1
Expression Syntax Information Bottleneck for Math Word ProblemsCode1
EXAONE Deep: Reasoning Enhanced Language ModelsCode1
Fine-Tuning Large Language Models on Quantum Optimization Problems for Circuit GenerationCode1
Reasoning with Reinforced Functional Token TuningCode1
Prover-Verifier Games improve legibility of LLM outputsCode0
PT-MoE: An Efficient Finetuning Framework for Integrating Mixture-of-Experts into Prompt TuningCode0
Can LLMs Solve longer Math Word Problems Better?Code0
A quantitative study of NLP approaches to question difficulty estimationCode0
Evaluating Token-Level and Passage-Level Dense Retrieval Models for Math Information RetrievalCode0
Can LLMs Reason in the Wild with Programs?Code0
A Probabilistic Model for Node Classification in Directed GraphsCode0
Can LLMs Master Math? Investigating Large Language Models on Math Stack ExchangeCode0
Evaluating Judges as Evaluators: The JETTS Benchmark of LLM-as-Judges as Test-Time Scaling EvaluatorsCode0
Practice Makes a Solver Perfect: Data Augmentation for Math Word Problem SolversCode0
Evaluating and Optimizing Educational Content with Large Language Model JudgmentsCode0
Can Large Language Models Replicate ITS Feedback on Open-Ended Math Questions?Code0
A Goal-Driven Tree-Structured Neural Model for Math Word ProblemsCode0
Show:102550
← PrevPage 21 of 64Next →

No leaderboard results yet.