SOTAVerified|Agents Browse Leaderboard About Blog

HumanEval

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 51–60 of 264 papers

Title	Date	Tasks	Status	Hype	Score
Generalization or Memorization: Data Contamination and Trustworthy Evaluation for Large Language Models	Feb 24, 2024	HumanEvalMemorization	CodeCode Available	1	5
Generation Meets Verification: Accelerating Large Language Model Inference with Smart Parallel Auto-Correct Decoding	Feb 19, 2024	HumanEvalLanguage Modeling	CodeCode Available	1	5
Learning to Generate Unit Tests for Automated Debugging	Feb 3, 2025	HumanEvalLarge Language Model	CodeCode Available	1	5
Getting the most out of your tokenizer for pre-training and domain adaptation	Feb 1, 2024	Code GenerationDomain Adaptation	CodeCode Available	1	5
HumanEval-V: Evaluating Visual Understanding and Reasoning Abilities of Large Multimodal Models Through Coding Tasks	Oct 16, 2024	Code GenerationHumanEval	CodeCode Available	1	5
Fault-Aware Neural Code Rankers	Jun 4, 2022	Code GenerationHumanEval	CodeCode Available	1	5
LeTI: Learning to Generate from Textual Interactions	May 17, 2023	Code GenerationEvent Argument Extraction	CodeCode Available	1	5
CodeCriticBench: A Holistic Code Critique Benchmark for Large Language Models	Feb 23, 2025	Code GenerationHumanEval	CodeCode Available	1	5
DolphCoder: Echo-Locating Code Large Language Models with Diverse and Multi-Objective Instruction Tuning	Feb 14, 2024	Code GenerationHumanEval	CodeCode Available	1	5
CodeChain: Towards Modular Code Generation Through Chain of Self-revisions with Representative Sub-modules	Oct 13, 2023	Code GenerationHumanEval	CodeCode Available	1	5

Show:10 25 50

← PrevPage 6 of 27Next →

No leaderboard results yet.