SOTAVerified

HumanEval

Papers

Showing 101125 of 264 papers

TitleStatusHype
Predicting Code Coverage without ExecutionCode1
ANPL: Towards Natural Programming with Interactive DecompositionCode1
Comments as Natural Logic Pivots: Improve Code Generation via Comment PerspectiveCode0
ShareLoRA: Parameter Efficient and Robust Large Language Model Fine-tuning via Shared Low-Rank AdaptationCode0
Self-Correcting Code Generation Using Small Language ModelsCode0
A Novel Approach for Automatic Program Repair using Round-Trip Translation with Large Language ModelsCode0
Self-Edit: Fault-Aware Code Editor for Code GenerationCode0
HumanEval on Latest GPT Models -- 2024Code0
CodeT5+: Open Code Large Language Models for Code Understanding and GenerationCode0
RGD: Multi-LLM Based Agent Debugger via Refinement and Generation GuidanceCode0
Rethinking Channel Dimensions to Isolate Outliers for Low-bit Weight Quantization of Large Language ModelsCode0
Measuring the Influence of Incorrect Code on Test GenerationCode0
AMR-Evol: Adaptive Modular Response Evolution Elicits Better Knowledge Distillation for Large Language Models in Code GenerationCode0
FALCON: Feedback-driven Adaptive Long/short-term memory reinforced Coding Optimization systemCode0
Using Large Language Models to Generate JUnit Tests: An Empirical StudyCode0
Personalised Distillation: Empowering Open-Sourced LLMs with Adaptive Learning for Code GenerationCode0
One Language, Many Gaps: Evaluating Dialect Fairness and Robustness of Large Language Models in Reasoning TasksCode0
Evaluating How Fine-tuning on Bimodal Data Effects Code GenerationCode0
Enhancing Large Language Models in Coding Through Multi-Perspective Self-ConsistencyCode0
Enhancing Code Generation via Bidirectional Comment-Level Mutual GroundingCode0
CoCoNUT: Structural Code Understanding does not fall out of a treeCode0
Inference Scaling fLaws: The Limits of LLM Resampling with Imperfect VerifiersCode0
mHumanEval -- A Multilingual Benchmark to Evaluate Large Language Models for Code GenerationCode0
Multi-Programming Language Ensemble for Code Generation in Large Language ModelCode0
Can Programming Languages Boost Each Other via Instruction Tuning?Code0
Show:102550
← PrevPage 5 of 11Next →

No leaderboard results yet.