SOTAVerified|Agents Browse Leaderboard About

Math

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 301–310 of 1596 papers

Title	Date	Tasks	Status	Hype
Uncovering the Impact of Chain-of-Thought Reasoning for Direct Preference Optimization: Lessons from Text-to-SQL	Feb 17, 2025	Code GenerationMath	CodeCode Available	1
Enhancing Cross-Tokenizer Knowledge Distillation with Contextual Dynamical Mapping	Feb 16, 2025	Code GenerationInstruction Following	CodeCode Available	1
Dyve: Thinking Fast and Slow for Dynamic Process Verification	Feb 16, 2025	Math	CodeCode Available	1
Do Large Language Model Benchmarks Test Reliability?	Feb 5, 2025	Language ModelingLanguage Modelling	CodeCode Available	1
A Probabilistic Inference Approach to Inference-Time Scaling of LLMs using Particle-Based Monte Carlo Methods	Feb 3, 2025	MathMathematical Reasoning	CodeCode Available	1
Efficient Neural Theorem Proving via Fine-grained Proof Structure Analysis	Jan 30, 2025	Automated Theorem ProvingMath	CodeCode Available	1
Leveraging Online Olympiad-Level Math Problems for LLMs Training and Contamination-Resistant Evaluation	Jan 24, 2025	Math	CodeCode Available	1
Pairwise RM: Perform Best-of-N Sampling with Knockout Tournament	Jan 22, 2025	Math	CodeCode Available	1
Control LLM: Controlled Evolution for Intelligence Retention in LLM	Jan 19, 2025	MathMathematical Reasoning	CodeCode Available	1
ZNO-Eval: Benchmarking reasoning capabilities of large language models in Ukrainian	Jan 12, 2025	BenchmarkingMath	CodeCode Available	1

Show:10 25 50

← PrevPage 31 of 160Next →

No leaderboard results yet.