Capability-Driven Skill Generation with LLMs: A RAG-Based Approach for Reusing Existing Libraries and Interfaces May 6, 2025 Code Generation RAG
— Unverified 0STORY2GAME: Generating (Almost) Everything in an Interactive Fiction Game May 6, 2025 Action Generation Code Generation
— Unverified 0AKD : Adversarial Knowledge Distillation For Large Language Models Alignment on Coding tasks May 5, 2025 Code Completion Code Generation
— Unverified 0Rewriting Pre-Training Data Boosts LLM Performance in Math and Code May 5, 2025 Code Generation GSM8K
Code Code Available 1QiMeng-Xpiler: Transcompiling Tensor Programs for Deep Learning Systems with a Neural-Symbolic Approach May 4, 2025 Code Generation GPU
— Unverified 0Program Semantic Inequivalence Game with Large Language Models May 2, 2025 C++ code Code Generation
Code Code Available 0PipeSpec: Breaking Stage Dependencies in Hierarchical LLM Decoding May 2, 2025 Code Generation Language Modeling
— Unverified 0CHORUS: Zero-shot Hierarchical Retrieval and Orchestration for Generating Linear Programming Code May 2, 2025 Chunking Code Generation
— Unverified 0Ensuring Reproducibility in Generative AI Systems for General Use Cases: A Framework for Regression Testing and Open Datasets May 2, 2025 Code Generation GPR
Code Code Available 0A Rusty Link in the AI Supply Chain: Detecting Evil Configurations in Model Repositories May 2, 2025 Code Generation Text Generation
— Unverified 0Assessing LLM code generation quality through path planning tasks Apr 30, 2025 Code Generation
— Unverified 0CodeFlowBench: A Multi-turn, Iterative Benchmark for Complex Code Generation Apr 30, 2025 Code Generation
Code Code Available 0CoCo-Bench: A Comprehensive Code Benchmark For Multi-task Large Language Model Evaluation Apr 29, 2025 Code Generation Language Model Evaluation
— Unverified 0Hallucination by Code Generation LLMs: Taxonomy, Benchmarks, Mitigation, and Challenges Apr 29, 2025 Code Generation Hallucination
— Unverified 0OSVBench: Benchmarking LLMs on Specification Generation Tasks for Operating System Verification Apr 29, 2025 Benchmarking Code Generation
Code Code Available 1Computational Reasoning of Large Language Models Apr 29, 2025 Code Generation Language Modeling
Code Code Available 0Skill Discovery for Software Scripting Automation via Offline Simulations with LLMs Apr 29, 2025 Code Generation Graph Neural Network
— Unverified 0ARCS: Agentic Retrieval-Augmented Code Synthesis with Iterative Refinement Apr 29, 2025 Code Generation HumanEval
— Unverified 0Reviving Any-Subset Autoregressive Models with Principled Parallel Sampling and Speculative Decoding Apr 29, 2025 Code Generation Density Estimation
Code Code Available 1The Hidden Risks of LLM-Generated Web Application Code: A Security-Centric Evaluation of Code Generation Capabilities in Large Language Models Apr 29, 2025 Code Generation
— Unverified 0SecRepoBench: Benchmarking LLMs for Secure Code Generation in Real-World Repositories Apr 29, 2025 Benchmarking Code Generation
— Unverified 0AutoP2C: An LLM-Based Agent Framework for Code Repository Generation from Multimodal Content in Academic Papers Apr 28, 2025 Code Generation
Code Code Available 1An Automated Reinforcement Learning Reward Design Framework with Large Language Model for Cooperative Platoon Coordination Apr 28, 2025 Code Generation Hallucination
— Unverified 0CodeBC: A More Secure Large Language Model for Smart Contract Code Generation in Blockchain Apr 28, 2025 Code Generation Language Modeling
Code Code Available 0ChiseLLM: Unleashing the Power of Reasoning LLMs for Chisel Agile Hardware Development Apr 27, 2025 Code Generation Domain Adaptation
Code Code Available 1Evaluating Grounded Reasoning by Code-Assisted Large Language Models for Mathematics Apr 24, 2025 Code Generation Math
— Unverified 0Towards Machine-Generated Code for the Resolution of User Intentions Apr 24, 2025 Code Generation
Code Code Available 0Paper2Code: Automating Code Generation from Scientific Papers in Machine Learning Apr 24, 2025 Code Generation
Code Code Available 7High-Fidelity And Complex Test Data Generation For Real-World SQL Code Generation Services Apr 24, 2025 Code Generation
— Unverified 0On Developers' Self-Declaration of AI-Generated Code: An Analysis of Practices Apr 23, 2025 Code Generation
Code Code Available 0EduBot -- Can LLMs Solve Personalized Learning and Programming Assignments? Apr 23, 2025 Code Completion Code Generation
— Unverified 0ClarifyCoder: Clarification-Aware Fine-Tuning for Programmatic Problem Solving Apr 23, 2025 Code Generation Synthetic Data Generation
— Unverified 0A Large-scale Class-level Benchmark Dataset for Code Generation with LLMs Apr 22, 2025 Benchmarking Class-level Code Generation
— Unverified 0VeriCoder: Enhancing LLM-Based RTL Code Generation through Functional Correctness Validation Apr 22, 2025 Code Generation
— Unverified 0Insights from Verification: Training a Verilog Generation LLM with Reinforcement Learning with Testbench Feedback Apr 22, 2025 Code Generation Hallucination
— Unverified 0Evaluating Judges as Evaluators: The JETTS Benchmark of LLM-as-Judges as Test-Time Scaling Evaluators Apr 21, 2025 Code Generation Instruction Following
Code Code Available 0Evaluating Code Generation of LLMs in Advanced Computer Science Problems Apr 21, 2025 Code Generation
— Unverified 0Empowering AI to Generate Better AI Code: Guided Generation of Deep Learning Projects with LLMs Apr 21, 2025 Code Generation Deep Learning
— Unverified 0LeetCodeDataset: A Temporal Dataset for Robust Evaluation and Efficient Training of Code LLMs Apr 20, 2025 Code Generation
Code Code Available 0Towards Optimal Circuit Generation: Multi-Agent Collaboration Meets Collective Intelligence Apr 20, 2025 Code Generation Retrieval-augmented Generation
Code Code Available 0ReasoningV: Efficient Verilog Code Generation with Adaptive Hybrid Reasoning Model Apr 20, 2025 Code Generation Computational Efficiency
Code Code Available 0Improving RL Exploration for LLM Reasoning through Retrospective Replay Apr 19, 2025 Code Generation Mathematical Reasoning
— Unverified 0Towards End-to-End Network Intent Management with Large Language Models Apr 18, 2025 Code Generation Management
— Unverified 0Do Prompt Patterns Affect Code Quality? A First Empirical Assessment of ChatGPT-Generated Code Apr 18, 2025 Code Generation Prompt Engineering
— Unverified 0CodeVisionary: An Agent-based Framework for Evaluating Large Language Models in Code Generation Apr 18, 2025 Code Generation
— Unverified 0Chinese-Vicuna: A Chinese Instruction-following Llama-based Model Apr 17, 2025 Code Generation CPU
Code Code Available 7Syntactic and Semantic Control of Large Language Models via Sequential Monte Carlo Apr 17, 2025 Code Generation Probabilistic Programming
— Unverified 0RoboTwin: Dual-Arm Robot Benchmark with Generative Digital Twins Apr 17, 2025 Code Generation
Code Code Available 0Code Copycat Conundrum: Demystifying Repetition in LLM-based Code Generation Apr 17, 2025 Code Generation
— Unverified 0Data-efficient LLM Fine-tuning for Code Generation Apr 17, 2025 Code Generation GPU
Code Code Available 1