SOTAVerified|Agents Browse Leaderboard About

HumanEval

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 251–264 of 264 papers

Title	Date	Tasks	Status	Hype
PanGu-Coder2: Boosting Large Language Models for Code with Ranking Feedback	Jul 27, 2023	Code GenerationHumanEval	—Unverified	0
Textbooks Are All You Need	Jun 20, 2023	AllCode Generation	—Unverified	0
Large Language Models of Code Fail at Completing Code with Potential Bugs	Jun 6, 2023	Code CompletionHumanEval	CodeCode Available	0
SelfEvolve: A Code Evolution Framework via Large Language Models	Jun 5, 2023	Code GenerationHumanEval	—Unverified	0
CodeT5+: Open Code Large Language Models for Code Understanding and Generation	May 13, 2023	Arithmetic ReasoningCode Completion	CodeCode Available	0
Structured Chain-of-Thought Prompting for Code Generation	May 11, 2023	Code GenerationHumanEval	—Unverified	0
Self-Edit: Fault-Aware Code Editor for Code Generation	May 6, 2023	Code GenerationHumanEval	CodeCode Available	0
Using Large Language Models to Generate JUnit Tests: An Empirical Study	Apr 30, 2023	Code GenerationHumanEval	CodeCode Available	0
Stochastic Code Generation	Apr 14, 2023	Code GenerationDecoder	—Unverified	0
Large Language Models Meet NL2Code: A Survey	Dec 19, 2022	HumanEvalSurvey	—Unverified	0
The Stack: 3 TB of permissively licensed source code	Nov 20, 2022	HumanEvalmbpp	—Unverified	0
Evaluating How Fine-tuning on Bimodal Data Effects Code Generation	Nov 15, 2022	Code GenerationHumanEval	CodeCode Available	0
Piloting Copilot, Codex, and StarCoder2: Hot Temperature, Cold Prompts, or Black Magic?	Oct 26, 2022	HumanEvalLanguage Modelling	—Unverified	0
Interactive Code Generation via Test-Driven User-Intent Formalization	Aug 11, 2022	Code GenerationHumanEval	—Unverified	0

Show:10 25 50

← PrevPage 11 of 11Next →

No leaderboard results yet.