SOTAVerified

Code Generation

Code Generation is an important field to predict explicit code or program structure from multimodal data sources such as incomplete code, programs in another programming language, natural language descriptions or execution examples. Code Generation tools can assist the development of automatic programming tools to improve programming productivity.

Source: Deep Learning for Source Code Modeling and Generation

Image source: Measuring Coding Challenge Competence With APPS

Papers

Showing 110 of 1697 papers

TitleStatusHype
CUDA-L1: Improving CUDA Optimization via Contrastive Reinforcement Learning0
Towards Formal Verification of LLM-Generated Code from Natural Language Prompts0
MERA Code: A Unified Framework for Evaluating Code Generation Across Tasks0
Scaling Up RL: Unlocking Diverse Reasoning in LLMs via Prolonged Training0
The Devil behind the mask: An emergent safety vulnerability of Diffusion LLMsCode2
Turning the Tide: Repository-based Code Reflection0
CodeAssistBench (CAB): Dataset & Benchmarking for Multi-turn Chat-Based Code Assistance0
CodeJudgeBench: Benchmarking LLM-as-a-Judge for Coding Tasks0
Kodezi Chronos: A Debugging-First Language Model for Repository-Scale, Memory-Driven Code UnderstandingCode9
Multilingual Multimodal Software Developer for Code Generation0
Show:102550
← PrevPage 1 of 170Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1EG-CFG (DeepSeek-V3-0324)Accuracy96.6Unverified
2QualityFlow (Sonnet-3.5)Accuracy94.2Unverified
3o1-mini + MapCoder (Hamming.ai)Accuracy93.2Unverified
4MGDebugger (DeepSeek-V3-0324)Accuracy92.4Unverified
5GPT-4 + AgentCoderAccuracy91.8Unverified
6CodeSim (GPT4o)Accuracy90.7Unverified
7Jiutian-大模型Accuracy90Unverified
8GPT-3.5 Turbo (ChatGPT) + AgentCoderAccuracy89.9Unverified
9MapCoder (GPT-4o)Accuracy89.7Unverified
10GPT-4 (ChatGPT Plus)Accuracy87.5Unverified