SOTAVerified

HumanEval

Papers

Showing 226250 of 264 papers

TitleStatusHype
LORD: Low Rank Decomposition Of Monolingual Code LLMs For One-Shot Compression0
Baichuan 2: Open Large-scale Language ModelsCode4
Can Programming Languages Boost Each Other via Instruction Tuning?Code0
Code Llama: Open Foundation Models for CodeCode6
CodeCoT: Tackling Code Syntax Errors in CoT Reasoning for Code Generation0
OctoPack: Instruction Tuning Code Large Language ModelsCode3
ClassEval: A Manually-Crafted Benchmark for Evaluating LLMs on Class-level Code GenerationCode1
PanGu-Coder2: Boosting Large Language Models for Code with Ranking Feedback0
Predicting Code Coverage without ExecutionCode1
Textbooks Are All You Need0
Is Self-Repair a Silver Bullet for Code Generation?Code1
WizardCoder: Empowering Code Large Language Models with Evol-InstructCode5
Large Language Models of Code Fail at Completing Code with Potential BugsCode0
SelfEvolve: A Code Evolution Framework via Large Language Models0
ANPL: Towards Natural Programming with Interactive DecompositionCode1
LeTI: Learning to Generate from Textual InteractionsCode1
CodeT5+: Open Code Large Language Models for Code Understanding and GenerationCode0
Structured Chain-of-Thought Prompting for Code Generation0
StarCoder: may the source be with you!Code5
Self-Edit: Fault-Aware Code Editor for Code GenerationCode0
Is Your Code Generated by ChatGPT Really Correct? Rigorous Evaluation of Large Language Models for Code GenerationCode3
Using Large Language Models to Generate JUnit Tests: An Empirical StudyCode0
Stochastic Code Generation0
CodeGeeX: A Pre-Trained Model for Code Generation with Multilingual Benchmarking on HumanEval-XCode5
Reflexion: Language Agents with Verbal Reinforcement LearningCode4
Show:102550
← PrevPage 10 of 11Next →

No leaderboard results yet.