SOTAVerified

HumanEval

Papers

Showing 221230 of 264 papers

TitleStatusHype
OpenCodeInstruct: A Large-scale Instruction Tuning Dataset for Code LLMs0
PanGu-Coder2: Boosting Large Language Models for Code with Ranking Feedback0
Past as a Guide: Leveraging Retrospective Learning for Python Code Completion0
PERC: Plan-As-Query Example Retrieval for Underrepresented Code Generation0
Piloting Copilot, Codex, and StarCoder2: Hot Temperature, Cold Prompts, or Black Magic?0
Plan for Speed -- Dilated Scheduling for Masked Diffusion Language Models0
PLUM: Improving Code LMs with Execution-Guided On-Policy Preference Learning Driven By Synthetic Test Cases0
Prior Prompt Engineering for Reinforcement Fine-Tuning0
Qiskit Code Assistant: Training LLMs for generating Quantum Computing Code0
Qiskit HumanEval: An Evaluation Benchmark For Quantum Code Generative Models0
Show:102550
← PrevPage 23 of 27Next →

No leaderboard results yet.