Program Synthesis

Program synthesis is the process of automatically generating a program or code snippet that satisfies a given specification or set of requirements. This can include generating code from a formal specification, a natural language description, or example inputs and outputs. The primary goal of program synthesis is to minimize human intervention in the coding process, reduce errors, and improve productivity.

Program synthesis often involves the use of advanced algorithms, artificial intelligence, and machine learning techniques to search the space of possible programs that meet the given constraints. This process can be guided by a variety of techniques, such as constraint solving, symbolic execution, and genetic algorithms.

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 1–50 of 423 papers

Title	Date	Tasks	Status	Hype	Score
CodeGen: An Open Large Language Model for Code with Multi-Turn Program Synthesis	Mar 25, 2022	Code GenerationHumanEval	CodeCode Available	6	5
Gorilla: Large Language Model Connected with Massive APIs	May 24, 2023	HallucinationLanguage Modeling	CodeCode Available	6	5
TikZero: Zero-Shot Text-Guided Graphics Program Synthesis	Mar 14, 2025	Program Synthesis	CodeCode Available	5	5
CodeGen2: Lessons for Training LLMs on Programming and Natural Languages	May 3, 2023	Causal Language ModelingDecoder	CodeCode Available	5	5
Factorio Learning Environment	Mar 6, 2025	Program SynthesisSpatial Reasoning	CodeCode Available	4	5
ARC Prize 2024: Technical Report	Dec 5, 2024	ARCProgram Synthesis	CodeCode Available	3	5
Large Language Models Are Human-Level Prompt Engineers	Nov 3, 2022	Few-Shot LearningIn-Context Learning	CodeCode Available	3	5
Is Your Code Generated by ChatGPT Really Correct? Rigorous Evaluation of Large Language Models for Code Generation	May 2, 2023	Code GenerationHumanEval	CodeCode Available	3	5
Comparison of Syntactic and Semantic Representations of Programs in Neural Embeddings	Jan 24, 2020	Program Synthesis	CodeCode Available	3	5
The Surprising Effectiveness of Test-Time Training for Few-Shot Learning	Nov 11, 2024	ARCFew-Shot Learning	CodeCode Available	3	5
CodeRL: Mastering Code Generation through Pretrained Models and Deep Reinforcement Learning	Jul 5, 2022	Code GenerationDecoder	CodeCode Available	2	5
Combining Induction and Transduction for Abstract Reasoning	Nov 4, 2024	ARCProgram Synthesis	CodeCode Available	2	5
Parsel: Algorithmic Reasoning with Language Models by Composing Decompositions	Dec 20, 2022	Automated Theorem ProvingCode Generation	CodeCode Available	2	5
Top Leaderboard Ranking = Top Coding Proficiency, Always? EvoEval: Evolving Coding Benchmarks via LLM	Mar 28, 2024	Code GenerationHumanEval	CodeCode Available	2	5
CODESIM: Multi-Agent Code Generation and Problem Solving through Simulation-Driven Planning and Debugging	Feb 8, 2025	Code GenerationHumanEval	CodeCode Available	2	5
Searching Latent Program Spaces	Nov 13, 2024	ARCProgram induction	CodeCode Available	2	5
MapCoder: Multi-Agent Code Generation for Competitive Problem Solving	May 18, 2024	Code GenerationHumanEval	CodeCode Available	2	5
InCoder: A Generative Model for Code Infilling and Synthesis	Apr 12, 2022	Code GenerationComment Generation	CodeCode Available	2	5
Improving Code Generation by Training with Natural Language Feedback	Mar 28, 2023	Code GenerationImitation Learning	CodeCode Available	1	5
Enhancing Network Management Using Code Generated by Large Language Models	Aug 11, 2023	ManagementNatural Language Queries	CodeCode Available	1	5
Improving Molecular Design by Stochastic Iterative Target Augmentation	Feb 11, 2020	Program Synthesis	CodeCode Available	1	5
Guiding Program Synthesis by Learning to Generate Examples	May 1, 2020	Program Synthesis	CodeCode Available	1	5
Data types as a more ergonomic frontend for Grammar-Guided Genetic Programming	Oct 10, 2022	AttributeProgram Synthesis	CodeCode Available	1	5
CrossBeam: Learning to Search in Bottom-Up Program Synthesis	Mar 20, 2022	Program SynthesisStructured Prediction	CodeCode Available	1	5
Graphs, Constraints, and Search for the Abstraction and Reasoning Corpus	Oct 18, 2022	ARCBenchmarking	CodeCode Available	1	5
A Neural Network Solves, Explains, and Generates University Math Problems by Program Synthesis and Few-Shot Learning at Human Level	Dec 31, 2021	Few-Shot LearningLanguage Modelling	CodeCode Available	1	5
PoE-World: Compositional World Modeling with Products of Programmatic Experts	May 16, 2025	Montezuma's RevengeProgram Synthesis	CodeCode Available	1	5
ANPL: Towards Natural Programming with Interactive Decomposition	May 29, 2023	ARCCode Generation	CodeCode Available	1	5
How Efficient is LLM-Generated Code? A Rigorous & High-Standard Benchmark	Jun 10, 2024	HumanEvalProgram Synthesis	CodeCode Available	1	5
DreamCoder: Growing generalizable, interpretable knowledge with wake-sleep Bayesian program learning	Jun 15, 2020	Drawing PicturesProgram induction	CodeCode Available	1	5
Graph-based, Self-Supervised Program Repair from Diagnostic Feedback	May 20, 2020	Code GenerationDiagnostic	CodeCode Available	1	5
H-ARC: A Robust Estimate of Human Performance on the Abstraction and Reasoning Corpus Benchmark	Sep 2, 2024	ARCOut-of-Distribution Generalization	CodeCode Available	1	5
AutoSafeCoder: A Multi-Agent Framework for Securing LLM Code Generation through Static Analysis and Fuzz Testing	Sep 16, 2024	Code GenerationProgram Synthesis	CodeCode Available	1	5
From Examples to Rules: Neural Guided Rule Synthesis for Information Extraction	Jan 16, 2022	Enumerative SearchFew-Shot Learning	CodeCode Available	1	5
Fusion 360 Gallery: A Dataset and Environment for Programmatic CAD Construction from Human Design Sequences	Oct 5, 2020	CAD ReconstructionProgram Synthesis	CodeCode Available	1	5
Communicating Natural Programs to Humans and Machines	Jun 15, 2021	ARCProgram Synthesis	CodeCode Available	1	5
CodeTrans: Towards Cracking the Language of Silicon's Code Through Self-Supervised Deep Learning and High Performance Computing	Apr 6, 2021	API Sequence RecommendationCode Comment Generation	CodeCode Available	1	5
AutoIOT: LLM-Driven Automated Natural Language Programming for AIoT Applications	Mar 7, 2025	Program Synthesis	CodeCode Available	1	5
CodeUpdateArena: Benchmarking Knowledge Editing on API Updates	Jul 8, 2024	Benchmarkingknowledge editing	CodeCode Available	1	5
Constrained Decoding for Fill-in-the-Middle Code Language Models via Efficient Left and Right Quotienting of Context-Sensitive Grammars	Feb 28, 2024	Program Synthesis	CodeCode Available	1	5
Generating Code World Models with Large Language Models Guided by Monte Carlo Tree Search	May 24, 2024	Code GenerationLanguage Modelling	CodeCode Available	1	5
Automatic Program Synthesis of Long Programs with a Learned Garbage Collector	Sep 12, 2018	Program Synthesis	CodeCode Available	1	5
CodeScholar: Growing Idiomatic Code Examples	Dec 23, 2023	Program Synthesis	CodeCode Available	1	5
Automating the Design of Multigrid Methods with Evolutionary Program Synthesis	Dec 22, 2023	Code GenerationProgram Synthesis	CodeCode Available	1	5
Emergent Representations of Program Semantics in Language Models Trained on Programs	May 18, 2023	Inductive BiasLanguage Modelling	CodeCode Available	1	5
Analyzing the Effectiveness of Large Language Models on Text-to-SQL Synthesis	Jan 22, 2024	16kProgram Synthesis	CodeCode Available	1	5
Code Building Genetic Programming	Aug 9, 2020	Program Synthesis	CodeCode Available	1	5
CodeIt: Self-Improving Language Models with Prioritized Hindsight Replay	Feb 7, 2024	ARCData Augmentation	CodeCode Available	1	5
Execution-based Code Generation using Deep Reinforcement Learning	Jan 31, 2023	Code CompletionCode Generation	CodeCode Available	1	5
Bug In the Code Stack: Can LLMs Find Bugs in Large Python Code Stacks	Jun 21, 2024	Program Synthesis	CodeCode Available	1	5

Show:10 25 50

← PrevPage 1 of 9Next →

All datasets SPoC TestP SPoC TestW AlgoLisp

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	DrRepair	Success rate @budget 100	38.5	—	Unverified
2	Multiclass localizer	Success rate @budget 100	34.2	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	DrRepair	Success rate @budget 100	57	—	Unverified
2	Multiclass localizer	Success rate @budget 100	53.7	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	CodeTrans-MT-TF-Small	Accuracy	90.31	—	Unverified