SOTAVerified

Math

Papers

Showing 2130 of 1596 papers

TitleStatusHype
Unpacking DPO and PPO: Disentangling Best Practices for Learning from Preference FeedbackCode7
StarCoder 2 and The Stack v2: The Next GenerationCode7
DSPy: Compiling Declarative Language Model Calls into Self-Improving PipelinesCode7
Plan-and-Solve Prompting: Improving Zero-Shot Chain-of-Thought Reasoning by Large Language ModelsCode7
Mistral 7BCode6
Qwen Technical ReportCode6
AWQ: Activation-aware Weight Quantization for LLM Compression and AccelerationCode6
GPT-4 Technical ReportCode6
Chain-of-Thought Prompting Elicits Reasoning in Large Language ModelsCode6
Reinforcement Learning from Human FeedbackCode5
Show:102550
← PrevPage 3 of 160Next →

No leaderboard results yet.