SOTAVerified|Agents Browse Leaderboard About

mbpp

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 81–90 of 129 papers

Title	Date	Tasks	Status	Hype
XFT: Unlocking the Power of Code Instruction Tuning by Simply Merging Upcycled Mixture-of-Experts	Apr 23, 2024	HumanEvalmbpp	CodeCode Available	1
NExT: Teaching Large Language Models to Reason about Code Execution	Apr 23, 2024	HumanEvalmbpp	—Unverified	0
Comments as Natural Logic Pivots: Improve Code Generation via Comment Perspective	Apr 11, 2024	Code GenerationHumanEval	CodeCode Available	0
CYCLE: Learning to Self-Refine the Code Generation	Mar 27, 2024	Code GenerationHumanEval	CodeCode Available	1
SOEN-101: Code Generation by Emulating Software Process Models Using Large Language Model Agents	Mar 23, 2024	Code GenerationHumanEval	—Unverified	0
Software Vulnerability and Functionality Assessment using LLMs	Mar 13, 2024	Code GenerationHumanEval	—Unverified	0
LiveCodeBench: Holistic and Contamination Free Evaluation of Large Language Models for Code	Mar 12, 2024	Code GenerationHumanEval	—Unverified	0
InfiBench: Evaluating the Question-Answering Capabilities of Code Large Language Models	Mar 11, 2024	Code GenerationHumanEval	CodeCode Available	1
Debug like a Human: A Large Language Model Debugger via Verifying Runtime Execution Step-by-step	Feb 25, 2024	Code GenerationHumanEval	CodeCode Available	4
OpenCodeInterpreter: Integrating Code Generation with Execution and Refinement	Feb 22, 2024	Code GenerationHumanEval	CodeCode Available	5

Show:10 25 50

← PrevPage 9 of 13Next →

No leaderboard results yet.