SOTAVerified

Math

Papers

Showing 2130 of 1596 papers

TitleStatusHype
Logic-RL: Unleashing LLM Reasoning with Rule-Based Reinforcement LearningCode7
LLMs Can Easily Learn to Reason from Demonstrations Structure, not content, is what matters!Code7
EvoAgentX: An Automated Framework for Evolving Agentic WorkflowsCode7
O1 Replication Journey: A Strategic Progress Report -- Part 1Code7
AWQ: Activation-aware Weight Quantization for LLM Compression and AccelerationCode6
GPT-4 Technical ReportCode6
Mistral 7BCode6
Chain-of-Thought Prompting Elicits Reasoning in Large Language ModelsCode6
Qwen Technical ReportCode6
Process Reinforcement through Implicit RewardsCode5
Show:102550
← PrevPage 3 of 160Next →

No leaderboard results yet.