SOTAVerified

mbpp

Papers

Showing 5175 of 129 papers

TitleStatusHype
Rethinking Repetition Problems of LLMs in Code GenerationCode1
RLTF: Reinforcement Learning from Unit Test FeedbackCode1
EffiLearner: Enhancing Efficiency of Generated Code via Self-OptimizationCode1
Unchosen Experts Can Contribute Too: Unleashing MoE Models' Power by Self-ContrastCode1
Unsupervised Evaluation of Code LLMs with Round-Trip CorrectnessCode1
XFT: Unlocking the Power of Code Instruction Tuning by Simply Merging Upcycled Mixture-of-ExpertsCode1
RGD: Multi-LLM Based Agent Debugger via Refinement and Generation GuidanceCode0
Comments as Natural Logic Pivots: Improve Code Generation via Comment PerspectiveCode0
CodePAD: Sequence-based Code Generation with Pushdown AutomatonCode0
FALCON: Feedback-driven Adaptive Long/short-term memory reinforced Coding Optimization systemCode0
Teaching Large Language Models to Self-DebugCode0
Self-Correcting Code Generation Using Small Language ModelsCode0
Instruction Fusion: Advancing Prompt Evolution through HybridizationCode0
Underwater Object Tracker: UOSTrack for Marine Organism Grasping of Underwater VehiclesCode0
Enhancing Large Language Models in Coding Through Multi-Perspective Self-ConsistencyCode0
AMR-Evol: Adaptive Modular Response Evolution Elicits Better Knowledge Distillation for Large Language Models in Code GenerationCode0
Inference Scaling fLaws: The Limits of LLM Resampling with Imperfect VerifiersCode0
Textbooks Are All You Need0
LiveCodeBench: Holistic and Contamination Free Evaluation of Large Language Models for Code0
LLaDA 1.5: Variance-Reduced Preference Optimization for Large Language Diffusion Models0
Bridging the Language Gap: Enhancing Multilingual Prompt-Based Code Generation in LLMs via Zero-Shot Cross-Lingual Transfer0
Bridging Code Semantic and LLMs: Semantic Chain-of-Thought Prompting for Code Generation0
USCD: Improving Code Generation of LLMs by Uncertainty-Aware Selective Contrastive Decoding0
The Program Testing Ability of Large Language Models for Code0
The Stack: 3 TB of permissively licensed source code0
Show:102550
← PrevPage 3 of 6Next →

No leaderboard results yet.