SOTAVerified

Program Synthesis

Program synthesis is the process of automatically generating a program or code snippet that satisfies a given specification or set of requirements. This can include generating code from a formal specification, a natural language description, or example inputs and outputs. The primary goal of program synthesis is to minimize human intervention in the coding process, reduce errors, and improve productivity.

Program synthesis often involves the use of advanced algorithms, artificial intelligence, and machine learning techniques to search the space of possible programs that meet the given constraints. This process can be guided by a variety of techniques, such as constraint solving, symbolic execution, and genetic algorithms.

Papers

Showing 151200 of 423 papers

TitleStatusHype
AbstractBeam: Enhancing Bottom-Up Program Synthesis using Library Learning0
RLSF: Reinforcement Learning via Symbolic Feedback0
Code Repair with LLMs gives an Exploration-Exploitation Tradeoff0
Learning to Reason via Program Generation, Emulation, and SearchCode0
HYSYNTH: Context-Free LLM Approximation for Guiding Program Synthesis0
Analogical proportions II0
Finding structure in logographic writing with library learning0
Program Synthesis using Inductive Logic Programming for the Abstraction and Reasoning Corpus0
Large Language Models Synergize with Automated Machine LearningCode0
PhilHumans: Benchmarking Machine Learning for Personal Health0
Can humans teach machines to code?0
BANSAI: Towards Bridging the AI Adoption Gap in Industrial Robotics with Neurosymbolic Programming0
Fewer Truncations Improve Language Modeling0
Self-Training Large Language Models for Improved Visual Program Synthesis With Visual Reinforcement0
SYNAPSE: SYmbolic Neural-Aided Preference Synthesis Engine0
WatChat: Explaining perplexing programs by debugging mental modelsCode0
Guiding Enumerative Program Synthesis with Large Language Models0
Semi-Instruct: Bridging Natural-Instruct and Self-Instruct for Code Large Language Models0
Procedural Adherence and Interpretability Through Neuro-Symbolic Generative Agents0
Origami: (un)folding the abstraction of recursion schemes for program synthesis0
HumanEval on Latest GPT Models -- 2024Code0
LTL learning on GPUsCode0
WorldCoder, a Model-Based LLM Agent: Building World Models by Writing Code and Interacting with the Environment0
SwissNYF: Tool Grounded LLM Agents for Black Box SettingCode0
Open-Universe Indoor Scene Generation using LLM Program Synthesis and Uncurated Object Databases0
Runtime phylogenetic analysis enables extreme subsampling for test-based problems0
Learning logic programs by finding minimal unsatisfiable subprograms0
DALex: Lexicase-like Selection via Diverse AggregationCode0
3D-PreMise: Can Large Language Models Generate 3D Shapes with Sharp Features and Parametric Control?0
LLM4TDD: Best Practices for Test Driven Development Using Large Language Models0
Program Machine Policy: Addressing Long-Horizon Tasks by Integrating Program Synthesis and State Machines0
Function-constrained Program Synthesis0
Coffee: Boost Your Code LLMs by Fixing Bugs with FeedbackCode0
BizBench: A Quantitative Reasoning Benchmark for Business and Finance0
Generating Pragmatic Examples to Train Neural Program SynthesizersCode0
LLM for SoC Security: A Paradigm Shift0
Logic-Q: Improving Deep Reinforcement Learning-based Quantitative Trading via Program Sketch-based Tuning0
The Program Testing Ability of Large Language Models for Code0
mlirSynth: Automatic, Retargetable Program Raising in Multi-Level IR using Program Synthesis0
Program Synthesis with Best-First Bottom-Up Search0
B-Coder: Value-Based Deep Reinforcement Learning for Program Synthesis0
Learning of Generalizable and Interpretable Knowledge in Grid-Based Reinforcement Learning EnvironmentsCode0
Amortizing Pragmatic Program Synthesis with RankingsCode0
Learning MDL logic programs from noisy dataCode0
Learning logic programs by discovering higher-order abstractionsCode0
Correct and Optimal: the Regular Expression Inference Challenge0
Demonstration of CORNET: A System For Learning Spreadsheet Formatting Rules By Example0
ChatGPT for GTFS: Benchmarking LLMs on GTFS Understanding and RetrievalCode0
ExeDec: Execution Decomposition for Compositional Generalization in Neural Program Synthesis0
A Real-World WebAgent with Planning, Long Context Understanding, and Program Synthesis0
Show:102550
← PrevPage 4 of 9Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1DrRepairSuccess rate @budget 10038.5Unverified
2Multiclass localizerSuccess rate @budget 10034.2Unverified
#ModelMetricClaimedVerifiedStatus
1DrRepairSuccess rate @budget 10057Unverified
2Multiclass localizerSuccess rate @budget 10053.7Unverified
#ModelMetricClaimedVerifiedStatus
1CodeTrans-MT-TF-SmallAccuracy90.31Unverified