SOTAVerified

HumanEval

Papers

Showing 161170 of 264 papers

TitleStatusHype
CodeMirage: Hallucinations in Code Generated by Large Language Models0
CodeMixBench: Evaluating Large Language Models on Code Generation with Code-Mixed Prompts0
Code-Optimise: Self-Generated Preference Data for Correctness and Efficiency0
CodeShell Technical Report0
CodeTree: Agent-guided Tree Search for Code Generation with Large Language Models0
Concept Distillation from Strong to Weak Models via Hypotheses-to-Theories Prompting0
Context-Augmented Code Generation Using Programming Knowledge Graphs0
CPL: Critical Plan Step Learning Boosts LLM Generalization in Reasoning Tasks0
CREST: Effectively Compacting a Datastore For Retrieval-Based Speculative Decoding0
CRUXEval-X: A Benchmark for Multilingual Code Reasoning, Understanding and Execution0
Show:102550
← PrevPage 17 of 27Next →

No leaderboard results yet.