Code-as-Symbolic-Planner: Foundation Model-Based Robot Planning via Symbolic Code Generation Mar 3, 2025 Code Generation Common Sense Reasoning
— Unverified 0AI Agents for Ground-Based Gamma Astronomy Mar 2, 2025 Astronomy Code Generation
— Unverified 0LLMs are everywhere: Ubiquitous Utilization of AI Models through Air Computing Mar 2, 2025 Code Generation Disaster Response
— Unverified 0How Diversely Can Language Models Solve Problems? Exploring the Algorithmic Diversity of Model-Generated Code Mar 2, 2025 Code Generation Diversity
— Unverified 0GPIoT: Tailoring Small Language Models for IoT Program Synthesis and Development Mar 2, 2025 Code Generation Program Synthesis
Code Code Available 1An Extensive Evaluation of PDDL Capabilities in off-the-shelf LLMs Feb 27, 2025 Code Generation
— Unverified 0ConvCodeWorld: Benchmarking Conversational Code Generation in Reproducible Feedback Environments Feb 27, 2025 Benchmarking Code Generation
— Unverified 0Beyond Natural Language Perplexity: Detecting Dead Code Poisoning in Code Generation Datasets Feb 27, 2025 Code Generation Code Search
— Unverified 0Multi-Turn Code Generation Through Single-Step Rewards Feb 27, 2025 Code Generation Hierarchical Reinforcement Learning
Code Code Available 1Program Synthesis Dialog Agents for Interactive Decision-Making Feb 26, 2025 Code Generation Decision Making
Code Code Available 0Automated Code Generation and Validation for Software Components of Microcontrollers Feb 26, 2025 Code Completion Code Generation
— Unverified 0IndicEval-XL: Bridging Linguistic Diversity in Code Generation Across Indic Languages Feb 26, 2025 Code Generation Diversity
Code Code Available 0Deep-Bench: Deep Learning Benchmark Dataset for Code Generation Feb 26, 2025 Code Generation Deep Learning
— Unverified 0CodeIF: Benchmarking the Instruction-Following Capabilities of Large Language Models for Code Generation Feb 26, 2025 Benchmarking Code Generation
Code Code Available 1Conversational Planning for Personal Plans Feb 26, 2025 Code Generation Text Generation
— Unverified 0Isolating Language-Coding from Problem-Solving: Benchmarking LLMs with PseudoEval Feb 26, 2025 Benchmarking Code Generation
— Unverified 0Nexus: A Lightweight and Scalable Multi-Agent Framework for Complex Tasks Automation Feb 26, 2025 Code Generation HumanEval
Code Code Available 2Detection of LLM-Paraphrased Code and Identification of the Responsible LLM Using Coding Style Features Feb 25, 2025 Code Generation
— Unverified 0Assistance or Disruption? Exploring and Evaluating the Design and Trade-offs of Proactive AI Programming Support Feb 25, 2025 Code Generation
— Unverified 0CodeSwift: Accelerating LLM Inference for Efficient Code Generation Feb 24, 2025 Code Generation Retrieval
— Unverified 0RewardDS: Privacy-Preserving Fine-Tuning for Large Language Models via Reward Driven Data Synthesis Feb 23, 2025 Code Generation Privacy Preserving
— Unverified 0An Analyst-Inspector Framework for Evaluating Reproducibility of LLMs in Data Science Feb 23, 2025 Benchmarking Code Generation
Code Code Available 0CodeCriticBench: A Holistic Code Critique Benchmark for Large Language Models Feb 23, 2025 Code Generation HumanEval
Code Code Available 1Beyond Trusting Trust: Multi-Model Validation for Robust Code Generation Feb 22, 2025 Code Generation
— Unverified 0Comparative Analysis of Large Language Models for Context-Aware Code Completion using SAFIM Framework Feb 21, 2025 Code Completion Code Generation
— Unverified 0Mechanistic Understanding of Language Models in Syntactic Code Completion Feb 20, 2025 Code Completion Code Generation
— Unverified 0DeepRTL: Bridging Verilog Understanding and Generation with a Unified Representation Model Feb 20, 2025 Code Generation Semantic Similarity
— Unverified 0Pragmatic Reasoning improves LLM Code Generation Feb 20, 2025 Code Generation Reranking
— Unverified 0TritonBench: Benchmarking Large Language Model Capabilities for Generating Triton Operators Feb 20, 2025 Benchmarking Code Generation
Code Code Available 2I-MCTS: Enhancing Agentic AutoML via Introspective Monte Carlo Tree Search Feb 20, 2025 AutoML Code Generation
Code Code Available 1S*: Test Time Scaling for Code Generation Feb 20, 2025 Code Generation Math
Code Code Available 7GATE: Graph-based Adaptive Tool Evolution Across Diverse Tasks Feb 20, 2025 Code Generation Math
Code Code Available 0BeamLoRA: Beam-Constraint Low-Rank Adaptation Feb 19, 2025 Code Generation Math
— Unverified 0Explore-Construct-Filter: An Automated Framework for Rich and Reliable API Knowledge Graph Construction Feb 19, 2025 Code Generation graph construction
— Unverified 0AdaptiveStep: Automatically Dividing Reasoning Step through Model Confidence Feb 19, 2025 Code Generation Decision Making
Code Code Available 1Exploring Code Language Models for Automated HLS-based Hardware Generation: Benchmark, Infrastructure and Analysis Feb 19, 2025 Code Generation High-Level Synthesis
— Unverified 0DataSciBench: An LLM Agent Benchmark for Data Science Feb 19, 2025 Code Generation Large Language Model
Code Code Available 2Hidden Darkness in LLM-Generated Designs: Exploring Dark Patterns in Ecommerce Web Components Generated by LLMs Feb 19, 2025 Code Generation
— Unverified 0UniGenCoder: Merging Seq2Seq and Seq2Tree Paradigms for Unified Code Generation Feb 18, 2025 Code Generation Contrastive Learning
Code Code Available 0Boost, Disentangle, and Customize: A Robust System2-to-System1 Pipeline for Code Generation Feb 18, 2025 Code Generation
— Unverified 0Interactive Agents to Overcome Ambiguity in Software Engineering Feb 18, 2025 Code Generation
Code Code Available 0The Role of GitHub Copilot on Software Development: A Perspective on Productivity, Security, Best Practices and Future Directions Feb 18, 2025 Code Generation
— Unverified 0Performance Evaluation of Large Language Models in Statistical Programming Feb 18, 2025 Code Generation
Code Code Available 0Training Turn-by-Turn Verifiers for Dialogue Tutoring Agents: The Curious Case of LLMs as Your Coding Tutors Feb 18, 2025 Code Generation Knowledge Tracing
Code Code Available 2GSCE: A Prompt Framework with Enhanced Reasoning for Reliable LLM-driven Drone Control Feb 18, 2025 Code Generation
— Unverified 0Sens-Merging: Sensitivity-Guided Parameter Balancing for Merging Large Language Models Feb 18, 2025 Code Generation General Knowledge
— Unverified 0EquiBench: Benchmarking Large Language Models' Understanding of Program Semantics via Equivalence Checking Feb 18, 2025 Benchmarking Binary Classification
— Unverified 0LLM4EFFI: Leveraging Large Language Models to Enhance Code Efficiency and Correctness Feb 17, 2025 Code Generation
— Unverified 0ToolCoder: A Systematic Code-Empowered Tool Learning Framework for Large Language Models Feb 17, 2025 Code Generation Descriptive
Code Code Available 0UnitCoder: Scalable Iterative Code Synthesis with Unit Test Guidance Feb 17, 2025 Code Generation HumanEval
— Unverified 0