LLM Code Customization with Visual Results: A Benchmark on TikZ May 7, 2025 Code Generation valid
— Unverified 0YABLoCo: Yet Another Benchmark for Long Context Code Generation May 7, 2025 Code Generation
— Unverified 0A Proposal for Evaluating the Operational Risk for ChatBots based on Large Language Models May 7, 2025 Chatbot Code Generation
— Unverified 0Scratch Copilot: Supporting Youth Creative Coding with AI May 6, 2025 Code Generation
— Unverified 0STORY2GAME: Generating (Almost) Everything in an Interactive Fiction Game May 6, 2025 Action Generation Code Generation
— Unverified 0MARCO: Multi-Agent Code Optimization with Real-Time Knowledge Integration for High-Performance Computing May 6, 2025 Code Generation
— Unverified 0Capability-Driven Skill Generation with LLMs: A RAG-Based Approach for Reusing Existing Libraries and Interfaces May 6, 2025 Code Generation RAG
— Unverified 0AKD : Adversarial Knowledge Distillation For Large Language Models Alignment on Coding tasks May 5, 2025 Code Completion Code Generation
— Unverified 0QiMeng-Xpiler: Transcompiling Tensor Programs for Deep Learning Systems with a Neural-Symbolic Approach May 4, 2025 Code Generation GPU
— Unverified 0CHORUS: Zero-shot Hierarchical Retrieval and Orchestration for Generating Linear Programming Code May 2, 2025 Chunking Code Generation
— Unverified 0Ensuring Reproducibility in Generative AI Systems for General Use Cases: A Framework for Regression Testing and Open Datasets May 2, 2025 Code Generation GPR
Code Code Available 0A Rusty Link in the AI Supply Chain: Detecting Evil Configurations in Model Repositories May 2, 2025 Code Generation Text Generation
— Unverified 0PipeSpec: Breaking Stage Dependencies in Hierarchical LLM Decoding May 2, 2025 Code Generation Language Modeling
— Unverified 0Program Semantic Inequivalence Game with Large Language Models May 2, 2025 C++ code Code Generation
Code Code Available 0CodeFlowBench: A Multi-turn, Iterative Benchmark for Complex Code Generation Apr 30, 2025 Code Generation
Code Code Available 0Assessing LLM code generation quality through path planning tasks Apr 30, 2025 Code Generation
— Unverified 0Hallucination by Code Generation LLMs: Taxonomy, Benchmarks, Mitigation, and Challenges Apr 29, 2025 Code Generation Hallucination
— Unverified 0Computational Reasoning of Large Language Models Apr 29, 2025 Code Generation Language Modeling
Code Code Available 0The Hidden Risks of LLM-Generated Web Application Code: A Security-Centric Evaluation of Code Generation Capabilities in Large Language Models Apr 29, 2025 Code Generation
— Unverified 0CoCo-Bench: A Comprehensive Code Benchmark For Multi-task Large Language Model Evaluation Apr 29, 2025 Code Generation Language Model Evaluation
— Unverified 0Skill Discovery for Software Scripting Automation via Offline Simulations with LLMs Apr 29, 2025 Code Generation Graph Neural Network
— Unverified 0ARCS: Agentic Retrieval-Augmented Code Synthesis with Iterative Refinement Apr 29, 2025 Code Generation HumanEval
— Unverified 0SecRepoBench: Benchmarking LLMs for Secure Code Generation in Real-World Repositories Apr 29, 2025 Benchmarking Code Generation
— Unverified 0An Automated Reinforcement Learning Reward Design Framework with Large Language Model for Cooperative Platoon Coordination Apr 28, 2025 Code Generation Hallucination
— Unverified 0CodeBC: A More Secure Large Language Model for Smart Contract Code Generation in Blockchain Apr 28, 2025 Code Generation Language Modeling
Code Code Available 0Towards Machine-Generated Code for the Resolution of User Intentions Apr 24, 2025 Code Generation
Code Code Available 0High-Fidelity And Complex Test Data Generation For Real-World SQL Code Generation Services Apr 24, 2025 Code Generation
— Unverified 0Evaluating Grounded Reasoning by Code-Assisted Large Language Models for Mathematics Apr 24, 2025 Code Generation Math
— Unverified 0On Developers' Self-Declaration of AI-Generated Code: An Analysis of Practices Apr 23, 2025 Code Generation
Code Code Available 0ClarifyCoder: Clarification-Aware Fine-Tuning for Programmatic Problem Solving Apr 23, 2025 Code Generation Synthetic Data Generation
— Unverified 0EduBot -- Can LLMs Solve Personalized Learning and Programming Assignments? Apr 23, 2025 Code Completion Code Generation
— Unverified 0VeriCoder: Enhancing LLM-Based RTL Code Generation through Functional Correctness Validation Apr 22, 2025 Code Generation
— Unverified 0A Large-scale Class-level Benchmark Dataset for Code Generation with LLMs Apr 22, 2025 Benchmarking Class-level Code Generation
— Unverified 0Insights from Verification: Training a Verilog Generation LLM with Reinforcement Learning with Testbench Feedback Apr 22, 2025 Code Generation Hallucination
— Unverified 0Evaluating Judges as Evaluators: The JETTS Benchmark of LLM-as-Judges as Test-Time Scaling Evaluators Apr 21, 2025 Code Generation Instruction Following
Code Code Available 0Evaluating Code Generation of LLMs in Advanced Computer Science Problems Apr 21, 2025 Code Generation
— Unverified 0Empowering AI to Generate Better AI Code: Guided Generation of Deep Learning Projects with LLMs Apr 21, 2025 Code Generation Deep Learning
— Unverified 0Towards Optimal Circuit Generation: Multi-Agent Collaboration Meets Collective Intelligence Apr 20, 2025 Code Generation Retrieval-augmented Generation
Code Code Available 0LeetCodeDataset: A Temporal Dataset for Robust Evaluation and Efficient Training of Code LLMs Apr 20, 2025 Code Generation
Code Code Available 0ReasoningV: Efficient Verilog Code Generation with Adaptive Hybrid Reasoning Model Apr 20, 2025 Code Generation Computational Efficiency
Code Code Available 0Improving RL Exploration for LLM Reasoning through Retrospective Replay Apr 19, 2025 Code Generation Mathematical Reasoning
— Unverified 0CodeVisionary: An Agent-based Framework for Evaluating Large Language Models in Code Generation Apr 18, 2025 Code Generation
— Unverified 0Towards End-to-End Network Intent Management with Large Language Models Apr 18, 2025 Code Generation Management
— Unverified 0Do Prompt Patterns Affect Code Quality? A First Empirical Assessment of ChatGPT-Generated Code Apr 18, 2025 Code Generation Prompt Engineering
— Unverified 0Syntactic and Semantic Control of Large Language Models via Sequential Monte Carlo Apr 17, 2025 Code Generation Probabilistic Programming
— Unverified 0Code Copycat Conundrum: Demystifying Repetition in LLM-based Code Generation Apr 17, 2025 Code Generation
— Unverified 0RoboTwin: Dual-Arm Robot Benchmark with Generative Digital Twins Apr 17, 2025 Code Generation
Code Code Available 0Themisto: Jupyter-Based Runtime Benchmark Apr 16, 2025 Code Generation
— Unverified 0Rethinking the Generation of High-Quality CoT Data from the Perspective of LLM-Adaptive Question Difficulty Grading Apr 16, 2025 2k Code Generation
— Unverified 0The Future of MLLM Prompting is Adaptive: A Comprehensive Experimental Evaluation of Prompt Engineering Methods for Robust Multimodal Performance Apr 14, 2025 Code Generation Hallucination
— Unverified 0