SOTAVerified|Agents Browse Leaderboard About

HumanEval

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 226–250 of 264 papers

Title	Date	Tasks	Status	Hype
LORD: Low Rank Decomposition Of Monolingual Code LLMs For One-Shot Compression	Sep 25, 2023	Code GenerationHumanEval	—Unverified	0
Baichuan 2: Open Large-scale Language Models	Sep 19, 2023	Feature EngineeringGSM8K	CodeCode Available	4
Can Programming Languages Boost Each Other via Instruction Tuning?	Aug 31, 2023	HumanEval	CodeCode Available	0
Code Llama: Open Foundation Models for Code	Aug 24, 2023	16kCode Generation	CodeCode Available	6
CodeCoT: Tackling Code Syntax Errors in CoT Reasoning for Code Generation	Aug 17, 2023	Code GenerationFew-Shot Learning	—Unverified	0
OctoPack: Instruction Tuning Code Large Language Models	Aug 14, 2023	Code GenerationCode Repair	CodeCode Available	3
ClassEval: A Manually-Crafted Benchmark for Evaluating LLMs on Class-level Code Generation	Aug 3, 2023	Class-level Code GenerationCode Generation	CodeCode Available	1
PanGu-Coder2: Boosting Large Language Models for Code with Ranking Feedback	Jul 27, 2023	Code GenerationHumanEval	—Unverified	0
Predicting Code Coverage without Execution	Jul 25, 2023	HumanEval	CodeCode Available	1
Textbooks Are All You Need	Jun 20, 2023	AllCode Generation	—Unverified	0
Is Self-Repair a Silver Bullet for Code Generation?	Jun 16, 2023	Code GenerationHumanEval	CodeCode Available	1
WizardCoder: Empowering Code Large Language Models with Evol-Instruct	Jun 14, 2023	Code GenerationHumanEval	CodeCode Available	5
Large Language Models of Code Fail at Completing Code with Potential Bugs	Jun 6, 2023	Code CompletionHumanEval	CodeCode Available	0
SelfEvolve: A Code Evolution Framework via Large Language Models	Jun 5, 2023	Code GenerationHumanEval	—Unverified	0
ANPL: Towards Natural Programming with Interactive Decomposition	May 29, 2023	ARCCode Generation	CodeCode Available	1
LeTI: Learning to Generate from Textual Interactions	May 17, 2023	Code GenerationEvent Argument Extraction	CodeCode Available	1
CodeT5+: Open Code Large Language Models for Code Understanding and Generation	May 13, 2023	Arithmetic ReasoningCode Completion	CodeCode Available	0
Structured Chain-of-Thought Prompting for Code Generation	May 11, 2023	Code GenerationHumanEval	—Unverified	0
StarCoder: may the source be with you!	May 9, 2023	8kCode Generation	CodeCode Available	5
Self-Edit: Fault-Aware Code Editor for Code Generation	May 6, 2023	Code GenerationHumanEval	CodeCode Available	0
Is Your Code Generated by ChatGPT Really Correct? Rigorous Evaluation of Large Language Models for Code Generation	May 2, 2023	Code GenerationHumanEval	CodeCode Available	3
Using Large Language Models to Generate JUnit Tests: An Empirical Study	Apr 30, 2023	Code GenerationHumanEval	CodeCode Available	0
Stochastic Code Generation	Apr 14, 2023	Code GenerationDecoder	—Unverified	0
CodeGeeX: A Pre-Trained Model for Code Generation with Multilingual Benchmarking on HumanEval-X	Mar 30, 2023	BenchmarkingCode Generation	CodeCode Available	5
Reflexion: Language Agents with Verbal Reinforcement Learning	Mar 20, 2023	Decision MakingHumanEval	CodeCode Available	4

Show:10 25 50

← PrevPage 10 of 11Next →

No leaderboard results yet.