Program Synthesis

Program synthesis is the process of automatically generating a program or code snippet that satisfies a given specification or set of requirements. This can include generating code from a formal specification, a natural language description, or example inputs and outputs. The primary goal of program synthesis is to minimize human intervention in the coding process, reduce errors, and improve productivity.

Program synthesis often involves the use of advanced algorithms, artificial intelligence, and machine learning techniques to search the space of possible programs that meet the given constraints. This process can be guided by a variety of techniques, such as constraint solving, symbolic execution, and genetic algorithms.

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 1–50 of 423 papers

Title	Date	Tasks	Status	Hype
CoRE: Enhancing Metacognition with Label-free Self-evaluation in LRMs	Jul 8, 2025	GSM8KMath	—Unverified	0
Structured Program Synthesis using LLMs: Results and Insights from the IPARC Challenge	Jun 15, 2025	ARCCode Generation	—Unverified	0
Matching Markets Meet LLMs: Algorithmic Reasoning with Ranked Preferences	Jun 4, 2025	Blockingparameter-efficient fine-tuning	—Unverified	0
Chemical classification program synthesis using generative artificial intelligence	May 24, 2025	ClassificationDrug Discovery	—Unverified	0
Transductively Informed Inductive Program Synthesis	May 20, 2025	Program Synthesis	CodeCode Available	0
CLEVER: A Curated Benchmark for Formally Verified Code Generation	May 20, 2025	Code GenerationProgram Synthesis	CodeCode Available	1
PoE-World: Compositional World Modeling with Products of Programmatic Experts	May 16, 2025	Montezuma's RevengeProgram Synthesis	CodeCode Available	1
Enhancing Code Generation via Bidirectional Comment-Level Mutual Grounding	May 12, 2025	Code GenerationComment Generation	CodeCode Available	0
Rewriting Pre-Training Data Boosts LLM Performance in Math and Code	May 5, 2025	Code GenerationGSM8K	CodeCode Available	1
QiMeng-Xpiler: Transcompiling Tensor Programs for Deep Learning Systems with a Neural-Symbolic Approach	May 4, 2025	Code GenerationGPU	—Unverified	0
OSVBench: Benchmarking LLMs on Specification Generation Tasks for Operating System Verification	Apr 29, 2025	BenchmarkingCode Generation	CodeCode Available	1
GPU accelerated program synthesis: Enumerate semantics, not syntax!	Apr 26, 2025	CPUGPU	—Unverified	0
TinyverseGP: Towards a Modular Cross-domain Benchmarking Framework for Genetic Programming	Apr 14, 2025	BenchmarkingProgram Synthesis	CodeCode Available	1
DeQompile: quantum circuit decompilation using genetic programming for explainable quantum architecture search	Apr 11, 2025	Program SynthesisQuantum Machine Learning	—Unverified	0
CodeARC: Benchmarking Reasoning Capabilities of LLM Agents for Inductive Program Synthesis	Mar 29, 2025	BenchmarkingLarge Language Model	—Unverified	0
Synthesizing world models for bilevel planning	Mar 26, 2025	Large Language ModelProgram Synthesis	—Unverified	0
TikZero: Zero-Shot Text-Guided Graphics Program Synthesis	Mar 14, 2025	Program Synthesis	CodeCode Available	5
Research Vision: Multi-Agent Path Planning for Cops And Robbers Via Reactive Synthesis	Mar 14, 2025	Program Synthesis	—Unverified	0
Shedding Light in Task Decomposition in Program Synthesis: The Driving Force of the Synthesizer Model	Mar 11, 2025	Program Synthesis	—Unverified	0
Fully Autonomous Programming using Iterative Multi-Agent Debugging with Large Language Models	Mar 10, 2025	HumanEvalProgram Synthesis	—Unverified	0
An Empirical Comparison of Cost Functions in Inductive Logic Programming	Mar 10, 2025	Inductive logic programmingProgram Synthesis	—Unverified	0
Machine Learning meets Algebraic Combinatorics: A Suite of Datasets Capturing Research-level Conjecturing Ability in Pure Mathematics	Mar 9, 2025	Abstract AlgebraProgram Synthesis	—Unverified	0
AutoIOT: LLM-Driven Automated Natural Language Programming for AIoT Applications	Mar 7, 2025	Program Synthesis	CodeCode Available	1
Factorio Learning Environment	Mar 6, 2025	Program SynthesisSpatial Reasoning	CodeCode Available	4
GPIoT: Tailoring Small Language Models for IoT Program Synthesis and Development	Mar 2, 2025	Code GenerationProgram Synthesis	CodeCode Available	1
Program Synthesis Dialog Agents for Interactive Decision-Making	Feb 26, 2025	Code GenerationDecision Making	CodeCode Available	0
Visual Agentic AI for Spatial Reasoning with a Dynamic API	Feb 10, 2025	Program SynthesisSpatial Reasoning	—Unverified	0
CODESIM: Multi-Agent Code Generation and Problem Solving through Simulation-Driven Planning and Debugging	Feb 8, 2025	Code GenerationHumanEval	CodeCode Available	2
Proving the Coding Interview: A Benchmark for Formally Verified Code Generation	Feb 8, 2025	Automated Theorem ProvingCode Generation	—Unverified	0
Learning Semantics-aware Search Operators for Genetic Programming	Feb 6, 2025	Graph Neural NetworkProgram Synthesis	—Unverified	0
QualityFlow: An Agentic Workflow for Program Synthesis Controlled by LLM Quality Checks	Jan 20, 2025	Code GenerationHumanEval	—Unverified	0
AlgoPilot: Fully Autonomous Program Synthesis Without Human-Written Programs	Jan 11, 2025	Language ModelingLanguage Modelling	—Unverified	0
Online Prompt Selection for Program Synthesis	Jan 9, 2025	Program Synthesis	—Unverified	0
MMFactory: A Universal Solution Search Engine for Vision-Language Tasks	Dec 24, 2024	Program Synthesis	—Unverified	0
EcoSearch: A Constant-Delay Best-First Search Algorithm for Program Synthesis	Dec 23, 2024	Program Synthesis	—Unverified	0
Design2GarmentCode: Turning Design Concepts to Tangible Garments Through Program Synthesis	Dec 11, 2024	Program Synthesis	—Unverified	0
ConceptSearch: Towards Efficient Program Search Using LLMs for Abstraction and Reasoning Corpus (ARC)	Dec 10, 2024	ARCFew-Shot Learning	CodeCode Available	0
ARC Prize 2024: Technical Report	Dec 5, 2024	ARCProgram Synthesis	CodeCode Available	3
From Code to Play: Benchmarking Program Search for Games Using Large Language Models	Dec 5, 2024	Atari GamesBenchmarking	—Unverified	0
Searching Latent Program Spaces	Nov 13, 2024	ARCProgram induction	CodeCode Available	2
LLMPhy: Complex Physical Reasoning Using Large Language Models and World Models	Nov 12, 2024	FrictionProgram Synthesis	—Unverified	0
The Surprising Effectiveness of Test-Time Training for Few-Shot Learning	Nov 11, 2024	ARCFew-Shot Learning	CodeCode Available	3
Combining Induction and Transduction for Abstract Reasoning	Nov 4, 2024	ARCProgram Synthesis	CodeCode Available	2
Reinforcement learning with learned gadgets to tackle hard quantum problems on real hardware	Oct 31, 2024	GPUProgram Synthesis	CodeCode Available	0
System 2 Reasoning via Generality and Adaptation	Oct 10, 2024	Meta-LearningProgram Synthesis	—Unverified	0
Mitigating Gender Bias in Code Large Language Models via Model Editing	Oct 10, 2024	Code Generationknowledge editing	—Unverified	0
Tackling the Abstraction and Reasoning Corpus with Vision Transformers: the Importance of 2D Representation, Positions, and Objects	Oct 8, 2024	ARCProgram Synthesis	CodeCode Available	1
Can LLMs plan paths with extra hints from solvers?	Oct 7, 2024	Mathematical Problem-SolvingProgram Synthesis	—Unverified	0
Learning to Solve Abstract Reasoning Problems with Neurosymbolic Program Synthesis and Task Generation	Oct 6, 2024	Feature EngineeringProgram Synthesis	—Unverified	0
MA-RLHF: Reinforcement Learning from Human Feedback with Macro Actions	Oct 3, 2024	Code GenerationDialogue Generation	CodeCode Available	1

Show:10 25 50

← PrevPage 1 of 9Next →

All datasets SPoC TestP SPoC TestW AlgoLisp

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	DrRepair	Success rate @budget 100	38.5	—	Unverified
2	Multiclass localizer	Success rate @budget 100	34.2	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	DrRepair	Success rate @budget 100	57	—	Unverified
2	Multiclass localizer	Success rate @budget 100	53.7	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	CodeTrans-MT-TF-Small	Accuracy	90.31	—	Unverified