SOTAVerified

mbpp

Papers

Showing 101129 of 129 papers

TitleStatusHype
CodeTree: Agent-guided Tree Search for Code Generation with Large Language Models0
Decoding Data Quality via Synthetic Corruptions: Embedding-guided Pruning of Code Data0
Scattered Forest Search: Smarter Code Space Exploration with LLMs0
Demo-Craft: Using In-Context Learning to Improve Code Generation in Large Language Models0
Discrete Flow Matching0
Divide-and-Conquer Meets Consensus: Unleashing the Power of Functions in Code Generation0
Scoring Verifiers: Evaluating Synthetic Verification for Code and Reasoning0
DSTC: Direct Preference Learning with Only Self-Generated Tests and Code to Improve Code LMs0
DynaCode: A Dynamic Complexity-Aware Code Benchmark for Evaluating Large Language Models in Code Generation0
Structured Chain-of-Thought Prompting for Code Generation0
Enhancing LLM-Based Code Generation with Complexity Metrics: A Feedback-Driven Approach0
Enhancing Reasoning Capabilities of Small Language Models with Blueprints and Prompt Template Search0
Software Vulnerability and Functionality Assessment using LLMs0
Selection of Prompt Engineering Techniques for Code Generation through Predicting Code Complexity0
Grammar-Based Code Representation: Is It a Worthy Pursuit for LLMs?0
Guideline Forest: Experience-Induced Multi-Guideline Reasoning with Stepwise Aggregation0
Self-Explained Keywords Empower Large Language Models for Code Generation0
What I cannot execute, I do not understand: Training and Evaluating LLMs on Program Execution Traces0
Evaluating LLM-driven User-Intent Formalization for Verification-Aware Languages0
Interactive Code Generation via Test-Driven User-Intent Formalization0
Code-Optimise: Self-Generated Preference Data for Correctness and Efficiency0
Interval-censored Hawkes processes0
Synthesize, Partition, then Adapt: Eliciting Diverse Samples from Foundation Models0
Isolating Language-Coding from Problem-Solving: Benchmarking LLMs with PseudoEval0
CodeMixBench: Evaluating Large Language Models on Code Generation with Code-Mixed Prompts0
Large Language Model-Aware In-Context Learning for Code Generation0
CodeMirage: Hallucinations in Code Generated by Large Language Models0
Test-Driven Development for Code Generation0
Learning to Reason via Self-Iterative Process Feedback for Small Language Models0
Show:102550
← PrevPage 3 of 3Next →

No leaderboard results yet.