SOTAVerified

HumanEval

Papers

Showing 231240 of 264 papers

TitleStatusHype
Test-Driven Development for Code Generation0
HumanEval on Latest GPT Models -- 2024Code0
Learning How To Ask: Cycle-Consistency Refines Prompts in Multimodal Foundation Models0
NoFunEval: Funny How Code LMs Falter on Requirements Beyond Functional Correctness0
A Novel Approach for Automatic Program Repair using Round-Trip Translation with Large Language ModelsCode0
Mutation-based Consistency Testing for Evaluating the Code Understanding Capability of LLMs0
PythonSaga: Redefining the Benchmark to Evaluate Code Generating LLMs0
Instruction Fusion: Advancing Prompt Evolution through HybridizationCode0
A Review of Repository Level Prompting for LLMs0
Decoding Data Quality via Synthetic Corruptions: Embedding-guided Pruning of Code Data0
Show:102550
← PrevPage 24 of 27Next →

No leaderboard results yet.