SOTAVerified

Math

Papers

Showing 701725 of 1596 papers

TitleStatusHype
More is More: Addition Bias in Large Language ModelsCode0
Deconfounded Causality-aware Parameter-Efficient Fine-Tuning for Problem-Solving Improvement of LLMs0
General OCR Theory: Towards OCR-2.0 via a Unified End-to-end ModelCode9
S^3c-Math: Spontaneous Step-level Self-correction Makes Large Language Models Better Mathematical Reasoners0
MultiMath: Bridging Visual and Mathematical Reasoning for Large Language ModelsCode1
Logic Contrastive Reasoning with Lightweight Large Language Model for Math Word Problems0
Entropic Distribution Matching in Supervised Fine-tuning of LLMs: Less Overfitting and Better Diversity0
Critic-CoT: Boosting the reasoning abilities of large language model via Chain-of-thoughts Critic0
Physics of Language Models: Part 2.2, How to Learn From Mistakes on Grade-School Math Problems0
SIaM: Self-Improving Code-Assisted Mathematical Reasoning of Large Language Models0
Generative Verifiers: Reward Modeling as Next-Token Prediction0
What makes math problems hard for reinforcement learning: a case studyCode1
Students' Perceived Roles, Opportunities, and Challenges of a Generative AI-powered Teachable Agent: A Case of Middle School Math Class0
Multi-tool Integration Application for Math Reasoning Using Large Language Model0
SORSA: Singular Values and Orthonormal Regularized Singular Vectors Adaptation of Large Language ModelsCode1
Mathematical Information Retrieval: Search and Question Answering0
Benchmarking Large Language Models for Math Reasoning TasksCode0
QPO: Query-dependent Prompt Optimization via Multi-Loop Offline Reinforcement Learning0
A Study of PHOC Spatial Region Configurations for Math Formula Retrieval0
Large Language Models Might Not Care What You Are Saying: Prompt Format Beats Descriptions0
Math-PUMA: Progressive Upward Multimodal Alignment to Enhance Mathematical ReasoningCode1
Leveraging Web-Crawled Data for High-Quality Fine-TuningCode0
Does Reasoning Emerge? Examining the Probabilities of Causation in Large Language Models0
Bridging and Modeling Correlations in Pairwise Data for Direct Preference OptimizationCode1
MathScape: Evaluating MLLMs in multimodal Math Scenarios through a Hierarchical BenchmarkCode0
Show:102550
← PrevPage 29 of 64Next →

No leaderboard results yet.