SOTAVerified

Code Generation

Code Generation is an important field to predict explicit code or program structure from multimodal data sources such as incomplete code, programs in another programming language, natural language descriptions or execution examples. Code Generation tools can assist the development of automatic programming tools to improve programming productivity.

Source: Deep Learning for Source Code Modeling and Generation

Image source: Measuring Coding Challenge Competence With APPS

Papers

Showing 901950 of 1697 papers

TitleStatusHype
Tree-of-Code: A Tree-Structured Exploring Framework for End-to-End Code Generation and Execution in Complex Task Handling0
ACRoBat: Optimizing Auto-batching of Dynamic Deep Learning at Compile Time0
Integrating Graphs with Large Language Models: Methods and Prospects0
Integration of a systolic array based hardware accelerator into a DNN operator auto-tuning framework0
IntelliCode Compose: Code Generation Using Transformer0
Code Retrieval for MILP Instance Generation0
Interactions with Prompt Problems: A New Way to Teach Programming with Large Language Models0
TST^R: Target Similarity Tuning Meets the Real World0
Interactive Code Generation via Test-Driven User-Intent Formalization0
Code Representation Learning At Scale0
ZIP-FIT: Embedding-Free Data Selection via Compression-Based Alignment0
0/1 Deep Neural Networks via Block Coordinate Descent0
In-the-loop Hyper-Parameter Optimization for LLM-Based Automated Design of Heuristics0
CodeMixBench: Evaluating Large Language Models on Code Generation with Code-Mixed Prompts0
Investigating the Effectiveness of a Socratic Chain-of-Thoughts Reasoning Method for Task Planning in Robotics, A Case Study0
Writing-Zero: Bridge the Gap Between Non-verifiable Tasks and Verifiable Rewards0
CodeMirage: Hallucinations in Code Generated by Large Language Models0
CodeLutra: Boosting LLM Code Generation via Preference-Guided Refinement0
Is AI the better programming partner? Human-Human Pair Programming vs. Human-AI pAIr Programming0
ISA Mapper: A Compute and Hardware Agnostic Deep Learning Compiler0
Is ChatGPT the Ultimate Programming Assistant -- How far is it?0
Is DPO Superior to PPO for LLM Alignment? A Comprehensive Study0
Turning the Tide: Repository-based Code Reflection0
Code Less, Align More: Efficient LLM Fine-tuning for Code Generation with Data Pruning0
CodeJudgeBench: Benchmarking LLM-as-a-Judge for Coding Tasks0
A Comprehensive Survey of AI-Driven Advancements and Techniques in Automated Program Repair and Code Generation0
Isolating Language-Coding from Problem-Solving: Benchmarking LLMs with PseudoEval0
Is Programming by Example solved by LLMs?0
Is Your AI-Generated Code Really Safe? Evaluating Large Language Models on Secure Code Generation with CodeSecEval0
CodeIP: A Grammar-Guided Multi-Bit Watermark for Large Language Models of Code0
A Problem-Oriented Perspective and Anchor Verification for Code Optimization0
Iterative Refinement of Project-Level Code Context for Precise Code Generation with Compiler Feedback0
Iterative Self-Training for Code Generation via Reinforced Re-Ranking0
CodeIF-Bench: Evaluating Instruction-Following Capabilities of Large Language Models in Interactive Code Generation0
IterPref: Focal Preference Learning for Code Generation via Iterative Debugging0
1bit-Merging: Dynamic Quantized Merging for Large Language Models0
"It's Weird That it Knows What I Want": Usability and Interactions with Copilot for Novice Programmers0
JaCoText: A Pretrained Model for Java Code-Text Generation0
Code Graph Model (CGM): A Graph-Integrated Large Language Model for Repository-Level Software Engineering Tasks0
Two-Stage Mesh Deep Learning for Automated Tooth Segmentation and Landmark Localization on 3D Intraoral Scans0
CodeGRAG: Bridging the Gap between Natural Language and Programming Language via Graphical Retrieval Augmented Generation0
Code Generation Tools (Almost) for Free? A Study of Few-Shot, Pre-Trained Language Models on Code0
Type-Constrained Code Generation with Language Models0
Code Generation for Unknown Libraries via Reading API Documentations0
Knowledge Graph Based Repository-Level Code Generation0
Knowledge Transfer for Pseudo-code Generation from Low Resource Programming Language0
Code Generation for High-Level Synthesis of Multiresolution Applications on FPGAs0
Kotlin ML Pack: Technical Report0
Code generation and runtime techniques for enabling data-efficient deep learning training on GPUs0
Krikri: Advancing Open Large Language Models for Greek0
Show:102550
← PrevPage 19 of 34Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1EG-CFG (DeepSeek-V3-0324)Accuracy96.6Unverified
2QualityFlow (Sonnet-3.5)Accuracy94.2Unverified
3o1-mini + MapCoder (Hamming.ai)Accuracy93.2Unverified
4MGDebugger (DeepSeek-V3-0324)Accuracy92.4Unverified
5GPT-4 + AgentCoderAccuracy91.8Unverified
6CodeSim (GPT4o)Accuracy90.7Unverified
7Jiutian-大模型Accuracy90Unverified
8GPT-3.5 Turbo (ChatGPT) + AgentCoderAccuracy89.9Unverified
9MapCoder (GPT-4o)Accuracy89.7Unverified
10GPT-4 (ChatGPT Plus)Accuracy87.5Unverified
#ModelMetricClaimedVerifiedStatus
1LPW (GPT-4o)Introductory Pass@187.2Unverified
2MoTCoder-32B-V1.5Introductory Pass@168.44Unverified
3MoTCoder-7B-V1.5Introductory Pass@154.26Unverified
4code-davinci-002 175B (CodeT)Introductory Pass@147.3Unverified
5deepseek-ai/deepseek-coder-6.7b-instructIntroductory Pass@133.8Unverified
6code-davinci-002 175BIntroductory Pass@131.92Unverified
7CodeChain+WizardCoder-15bIntroductory Pass@129.3Unverified
8WizardCoder-15bIntroductory Pass@126.29Unverified
9CodeSim (GPT4)Introductory Pass@126.04Unverified
10AlphaCode 1B Filtered from 50000Competition Pass@any22Unverified
#ModelMetricClaimedVerifiedStatus
1PanGu-Coder-FT-IBLEU44.32Unverified
2RoBERTaMarianBLEU35.74Unverified
3MarianCGBLEU34.43Unverified
4TranX + BERT w/minedBLEU34.2Unverified
5BERT + TAEBLEU33.41Unverified
6BERTMarianBLEU32.46Unverified
7External Knowledge With API + RerankingBLEU32.26Unverified
8External Knowledge With APIBLEU30.69Unverified
9BART W/ MinedBLEU30.55Unverified
10ELECTRAMarianBLEU30.18Unverified
#ModelMetricClaimedVerifiedStatus
1MarianCGAccuracy81.83Unverified
2BERT + TAEAccuracy81.03Unverified
3TranX + BERT w/minedAccuracy81.03Unverified
4RerankerAccuracy80.2Unverified
5LUKEMarianAccuracy78.5Unverified
6RoBERTaMarianAccuracy77.95Unverified
7BERTMarianAccuracy76.68Unverified
8TranxAccuracy73.7Unverified
9ELECTRAMarianAccuracy65.32Unverified
10lpn (Ling et al., 2016)Accuracy62.3Unverified
#ModelMetricClaimedVerifiedStatus
1NL2SQL-RULEExecution Accuracy89.2Unverified
2TypeSQL+TC (Yu et al., 2018)+Execution Accuracy82.6Unverified
3TranxExecution Accuracy78.6Unverified
4STAMP+RL (Sun et al., 2018)+Execution Accuracy74.6Unverified
5STAMP (Sun et al., 2018)+Execution Accuracy74.4Unverified
6TypeSQL (Yu et al., 2018)Execution Accuracy73.5Unverified
7PT-MAML (Huang et al., 2018)Execution Accuracy68Unverified
8Bidirectional Attention for SQL GenerationExecution Accuracy62.5Unverified
9Seq2SQL (Zhong et al., 2017)Execution Accuracy59.4Unverified
10Seq2Seq (Zhong et al., 2017)Execution Accuracy35.9Unverified
#ModelMetricClaimedVerifiedStatus
1QurrentOS-coder + Claude 3.5 Sonnetpass@158Unverified
2QurrentOS-coder + GPT-4opass@146Unverified
3QurrentOS-coder + GPT-4 Turbopass@137Unverified
4QurrentOS-coder + Claude 3 Opuspass@136Unverified
5QurrentOS-coder + Gemini 1.5 Propass@130Unverified
6QurrentOS-coder + GPT-4pass@130Unverified
7QurrentOS-coder + DeepSeek-Coder-V2pass@129Unverified
8QurrentOS-coder + Llama 3 70bpass@120Unverified
9QurrentOS-coder + Qwen-72B-Instructpass@118Unverified
#ModelMetricClaimedVerifiedStatus
1EG-CFG (DeepSeek-V3-0324)Test Set pass@158.18Unverified
2LPW (GPT-4o)Test Set pass@134.7Unverified
3MapCoder (GPT-4)Test Set pass@128.5Unverified
4CodeSim (GPT4)Test Set pass@128.4Unverified
5MoTCoder-15BTest Set pass@126.34Unverified
6MoTCoder-7B-v1.5Test Set pass@120.77Unverified
7CodeChain + WizardCoder-15BTest Set pass@12.35Unverified
8WizardCoder-15BTest Set pass@11.11Unverified
#ModelMetricClaimedVerifiedStatus
1DeepSeek-R1 (MGDebugger)Pass@1100Unverified
2LLaMA 3Pass@199.4Unverified
3QualityFlow (Sonnet-3.5)Pass@198.8Unverified
4Phi-2Pass@198.2Unverified
5EG-CFG (DeepSeek-V3-0324)Pass@196.95Unverified
6Mistral 7BPass@193.9Unverified
7Claude Sonnet 3.5Pass@190.85Unverified
8L2MAC (GPT-4)Pass@190.2Unverified
#ModelMetricClaimedVerifiedStatus
1Claude 3 HaikuPass@327.67Unverified
2GPT-3.5 TurboPass@323.75Unverified
3codechat-bisonPass@311.39Unverified
4chat-bisonPass@38.48Unverified
5Mixtral-8x7B-InstructPass@38.35Unverified
6Phi-3-mini-128k-instructPass@37.18Unverified
7WizardLM-2-7BPass@33.72Unverified
8Llama-3-8B-InstructPass@33.1Unverified
#ModelMetricClaimedVerifiedStatus
1o1-previewpass@10.95Unverified
2o1-minipass@10.94Unverified
3gpt-4o-2024-08-06pass@10.89Unverified
4claude-3.5-sonnetpass@10.88Unverified
5deepseek-v2.5pass@10.83Unverified
6mistral-large-2pass@10.78Unverified
7deepseek-coder-v2-instructpass@10.7Unverified
8llama-v3p1-405b-instructpass@10.3Unverified
#ModelMetricClaimedVerifiedStatus
1BART W/ MinedBLEU35.32Unverified
2BART BaseBLEU34.35Unverified
3External Knowledge With API + RerankingBLEU20.54Unverified
4External Knowledge With APIBLEU20.37Unverified
5RerankerBLEU19.85Unverified
6TranXBLEU18.85Unverified
#ModelMetricClaimedVerifiedStatus
1claude-3-5-sonnetpass@10.68Unverified
2o1-minipass@10.67Unverified