DebugBench: Evaluating Debugging Capability of Large Language Models Jan 9, 2024 Code Generation
Code Code Available 2LLM4PLC: Harnessing Large Language Models for Verifiable Programming of PLCs in Industrial Control Systems Jan 8, 2024 Code Generation Prompt Engineering
Code Code Available 2AST-T5: Structure-Aware Pretraining for Code Generation and Understanding Jan 5, 2024 Code Generation Decoder
Code Code Available 2Parameter-Efficient Sparsity Crafting from Dense to Mixture-of-Experts for Instruction Tuning on General Tasks Jan 5, 2024 Arithmetic Reasoning Code Generation
Code Code Available 2TACO: Topics in Algorithmic COde generation dataset Dec 22, 2023 Code Generation
Code Code Available 2AgentCoder: Multi-Agent-based Code Generation with Iterative Testing and Optimisation Dec 20, 2023 Code Generation HumanEval
Code Code Available 2ZeroQuant(4+2): Redefining LLMs Quantization with a New FP6-Centric Strategy for Diverse Generative Tasks Dec 14, 2023 Abstractive Text Summarization Code Generation
Code Code Available 2YUAN 2.0: A Large Language Model with Localized Filtering-based Attention Nov 27, 2023 Code Generation Language Modeling
Code Code Available 2ML-Bench: Evaluating Large Language Models and Agents for Machine Learning Tasks on Repository-Level Code Nov 16, 2023 Code Generation Navigate
Code Code Available 2Octopus: Embodied Vision-Language Programmer from Environmental Feedback Oct 12, 2023 Benchmarking Code Generation
Code Code Available 2Mini-DALLE3: Interactive Text to Image by Prompting Large Language Models Oct 11, 2023 Code Generation Image Generation
Code Code Available 2Language Agent Tree Search Unifies Reasoning Acting and Planning in Language Models Oct 6, 2023 Code Generation Decision Making
Code Code Available 2GenSim: Generating Robotic Simulation Tasks via Large Language Models Oct 2, 2023 Code Generation Diversity
Code Code Available 2VerilogEval: Evaluating Large Language Models for Verilog Code Generation Sep 14, 2023 Benchmarking Code Generation
Code Code Available 2When Do Program-of-Thoughts Work for Reasoning? Aug 29, 2023 Code Generation Mathematical Reasoning
Code Code Available 2FacTool: Factuality Detection in Generative AI -- A Tool Augmented Framework for Multi-Task and Multi-Domain Scenarios Jul 25, 2023 Code Generation Fact Checking
Code Code Available 2InterCode: Standardizing and Benchmarking Interactive Coding with Execution Feedback Jun 26, 2023 Benchmarking Code Generation
Code Code Available 2Guiding Language Models of Code with Global Context using Monitors Jun 19, 2023 Code Completion Code Generation
Code Code Available 2Grammar-Constrained Decoding for Structured NLP Tasks without Finetuning May 23, 2023 Code Generation Constituency Parsing
Code Code Available 2Autonomous GIS: the next-generation AI-powered GIS May 10, 2023 Code Generation Information Retrieval
Code Code Available 2CodeBERTScore: Evaluating Code Generation with Pretrained Models of Code Feb 10, 2023 Code Generation
Code Code Available 2A Vector Quantized Approach for Text to Speech Synthesis on Real-World Spontaneous Speech Feb 8, 2023 Code Generation Diversity
Code Code Available 2Parsel: Algorithmic Reasoning with Language Models by Composing Decompositions Dec 20, 2022 Automated Theorem Proving Code Generation
Code Code Available 2DS-1000: A Natural and Reliable Benchmark for Data Science Code Generation Nov 18, 2022 Code Generation Memorization
Code Code Available 2When Language Model Meets Private Library Oct 31, 2022 Code Generation Language Modeling
Code Code Available 2MultiPL-E: A Scalable and Extensible Approach to Benchmarking Neural Code Generation Aug 17, 2022 Benchmarking Code Generation
Code Code Available 2Language Models Can Teach Themselves to Program Better Jul 29, 2022 Code Generation
Code Code Available 2CodeT: Code Generation with Generated Tests Jul 21, 2022 Code Generation HumanEval
Code Code Available 2DocPrompting: Generating Code by Retrieving the Docs Jul 13, 2022 Code Generation
Code Code Available 2CodeRL: Mastering Code Generation through Pretrained Models and Deep Reinforcement Learning Jul 5, 2022 Code Generation Decoder
Code Code Available 2CERT: Continual Pre-Training on Sketches for Library-Oriented Code Generation Jun 14, 2022 Code Generation Library-Oriented Code Generation
Code Code Available 2InCoder: A Generative Model for Code Infilling and Synthesis Apr 12, 2022 Code Generation Comment Generation
Code Code Available 2Training a Helpful and Harmless Assistant with Reinforcement Learning from Human Feedback Apr 12, 2022 Code Generation Out of Distribution (OOD) Detection
Code Code Available 2PaLM: Scaling Language Modeling with Pathways Apr 5, 2022 Auto Debugging Code Generation
Code Code Available 2Synchromesh: Reliable code generation from pre-trained language models Jan 26, 2022 Code Generation Language Modeling
Code Code Available 2Measuring Coding Challenge Competence With APPS May 20, 2021 BIG-bench Machine Learning Code Generation
Code Code Available 2FBGEMM: Enabling High-Performance Low-Precision Deep Learning Inference Jan 13, 2021 Code Generation Deep Learning
Code Code Available 2Rethinking Verification for LLM Code Generation: From Generation to Testing Jul 9, 2025 Code Generation HumanEval
Code Code Available 1CoreCodeBench: A Configurable Multi-Scenario Repository-Level Benchmark Jul 4, 2025 Bug fixing Code Generation
Code Code Available 1ReCode: Updating Code API Knowledge with Reinforcement Learning Jun 25, 2025 Code Generation reinforcement-learning
Code Code Available 1TeXpert: A Multi-Level Benchmark for Evaluating LaTeX Code Generation by LLMs Jun 20, 2025 Code Generation
Code Code Available 1Sampling from Your Language Model One Byte at a Time Jun 17, 2025 Code Generation Language Modeling
Code Code Available 1Xolver: Multi-Agent Reasoning with Holistic Experience Learning Just Like an Olympiad Team Jun 17, 2025 Code Generation GSM8K
Code Code Available 1PRO-V: An Efficient Program Generation Multi-Agent System for Automatic RTL Verification Jun 13, 2025 Code Generation In-Context Learning
Code Code Available 1ViCrit: A Verifiable Reinforcement Learning Proxy Task for Visual Perception in VLMs Jun 11, 2025 Code Generation Diagnostic
Code Code Available 1UTBoost: Rigorous Evaluation of Coding Agents on SWE-Bench Jun 10, 2025 Code Generation
Code Code Available 1SlideCoder: Layout-aware RAG-enhanced Hierarchical Slide Generation from Design Jun 9, 2025 Code Generation RAG
Code Code Available 1KramaBench: A Benchmark for AI Systems on Data-to-Insight Pipelines over Data Lakes Jun 6, 2025 Code Generation Data Integration
Code Code Available 1DesignBench: A Comprehensive Benchmark for MLLM-based Front-end Code Generation Jun 6, 2025 Code Generation
Code Code Available 1Training Language Models to Generate Quality Code with Program Analysis Feedback May 28, 2025 Code Generation
Code Code Available 1