SOTAVerified

Math

Papers

Showing 526550 of 1596 papers

TitleStatusHype
Can LLMs Reason Abstractly Over Math Word Problems Without CoT? Disentangling Abstract Formulation From Arithmetic Computation0
A Graph-Based Synthetic Data Pipeline for Scaling High-Quality Reasoning Instructions0
HyperCLOVA X Technical Report0
Evaluating Grounded Reasoning by Code-Assisted Large Language Models for Mathematics0
Human Learning about AI0
Evaluating GPT-4 at Grading Handwritten Solutions in Math Exams0
A Chain-of-Thought Prompting Approach with LLMs for Evaluating Students' Formative Assessment Responses in Science0
Hydrodynamics of Markets:Hidden Links Between Physics and Finance0
Hypothesis-Driven Theory-of-Mind Reasoning for Large Language Models0
Improving Academic Plagiarism Detection for STEM Documents by Analyzing Mathematical Content and Citations0
Can I understand what I create? Self-Knowledge Evaluation of Large Language Models0
Can ChatGPT Defend its Belief in Truth? Evaluating LLM Reasoning via Debate0
A Practice of Post-Training on Llama-3 70B with Optimal Selection of Additional Language Mixture Ratio0
Error Classification of Large Language Models on Math Word Problems: A Dynamically Adaptive Framework0
AceReason-Nemotron: Advancing Math and Code Reasoning through Reinforcement Learning0
How well do Computers Solve Math Word Problems? Large-Scale Dataset Construction and Evaluation0
EoRA: Training-free Compensation for Compressed LLM with Eigenspace Low-Rank Approximation0
Approximation properties of Residual Neural Networks for Kolmogorov PDEs0
Entropy Martingale Optimal Transport and Nonlinear Pricing-Hedging Duality0
Calculus on MDPs: Potential Shaping as a Gradient0
Approximating Sparse PCA from Incomplete Data0
Entropy-Guided Watermarking for LLMs: A Test-Time Framework for Robust and Traceable Text Generation0
BurTorch: Revisiting Training from First Principles by Coupling Autodiff, Math Optimization, and Systems0
Entropy Adaptive Decoding: Dynamic Model Switching for Efficient Inference0
Entropic Distribution Matching in Supervised Fine-tuning of LLMs: Less Overfitting and Better Diversity0
Show:102550
← PrevPage 22 of 64Next →

No leaderboard results yet.