R1-Code-Interpreter: Training LLMs to Reason with Code via Supervised and Reinforcement Learning May 27, 2025 Code Generation Reinforcement Learning (RL)
Code Code Available 1ReChisel: Effective Automatic Chisel Code Generation by LLM with Reflection May 26, 2025 Code Generation
Code Code Available 1Compliance-to-Code: Enhancing Financial Compliance Checking via Code Generation May 26, 2025 Code Generation
Code Code Available 1SWE-rebench: An Automated Pipeline for Task Collection and Decontaminated Evaluation of Software Engineering Agents May 26, 2025 Code Generation
Code Code Available 1Win Fast or Lose Slow: Balancing Speed and Accuracy in Latency-Sensitive Decisions of LLMs May 26, 2025 Code Generation Recommendation Systems
Code Code Available 1Mind the Gap: A Practical Attack on GGUF Quantization May 24, 2025 Code Generation Quantization
Code Code Available 1FullFront: Benchmarking MLLMs Across the Full Front-End Engineering Workflow May 23, 2025 Benchmarking Code Generation
Code Code Available 1CLEVER: A Curated Benchmark for Formally Verified Code Generation May 20, 2025 Code Generation Program Synthesis
Code Code Available 1EffiBench-X: A Multi-Language Benchmark for Measuring Efficiency of LLM-Generated Code May 19, 2025 Code Generation
Code Code Available 1AGI-Elo: How Far Are We From Mastering A Task? May 19, 2025 Code Generation Image Classification
Code Code Available 1HALO: Hierarchical Autonomous Logic-Oriented Orchestration for Multi-Agent LLM Systems May 17, 2025 Arithmetic Reasoning Code Generation
Code Code Available 1VeriReason: Reinforcement Learning with Testbench Feedback for Reasoning-Enhanced Verilog Generation May 17, 2025 Code Generation
Code Code Available 1VeriThoughts: Enabling Automated Verilog Code Generation using Reasoning and Formal Verification May 16, 2025 Code Generation
Code Code Available 1Rethinking Repetition Problems of LLMs in Code Generation May 15, 2025 Code Generation HumanEval
Code Code Available 1Rewriting Pre-Training Data Boosts LLM Performance in Math and Code May 5, 2025 Code Generation GSM8K
Code Code Available 1OSVBench: Benchmarking LLMs on Specification Generation Tasks for Operating System Verification Apr 29, 2025 Benchmarking Code Generation
Code Code Available 1Reviving Any-Subset Autoregressive Models with Principled Parallel Sampling and Speculative Decoding Apr 29, 2025 Code Generation Density Estimation
Code Code Available 1AutoP2C: An LLM-Based Agent Framework for Code Repository Generation from Multimodal Content in Academic Papers Apr 28, 2025 Code Generation
Code Code Available 1ChiseLLM: Unleashing the Power of Reasoning LLMs for Chisel Agile Hardware Development Apr 27, 2025 Code Generation Domain Adaptation
Code Code Available 1Data-efficient LLM Fine-tuning for Code Generation Apr 17, 2025 Code Generation GPU
Code Code Available 1A Dual-Space Framework for General Knowledge Distillation of Large Language Models Apr 15, 2025 Code Generation General Knowledge
Code Code Available 1TuRTLe: A Unified Evaluation of LLMs for RTL Generation Mar 31, 2025 Code Generation
Code Code Available 1MaintainCoder: Maintainable Code Generation Under Dynamic Requirements Mar 31, 2025 Code Generation
Code Code Available 1BigO(Bench) -- Can LLMs Generate Code with Controlled Time and Space Complexity? Mar 19, 2025 Code Generation
Code Code Available 1ProjectEval: A Benchmark for Programming Agents Automated Evaluation on Project-Level Code Generation Mar 10, 2025 Code Generation
Code Code Available 1RepoST: Scalable Repository-Level Coding Environment Construction with Sandbox Testing Mar 10, 2025 Code Generation HumanEval
Code Code Available 1DependEval: Benchmarking LLMs for Repository Dependency Understanding Mar 9, 2025 Benchmarking Code Generation
Code Code Available 1GPIoT: Tailoring Small Language Models for IoT Program Synthesis and Development Mar 2, 2025 Code Generation Program Synthesis
Code Code Available 1Multi-Turn Code Generation Through Single-Step Rewards Feb 27, 2025 Code Generation Hierarchical Reinforcement Learning
Code Code Available 1CodeIF: Benchmarking the Instruction-Following Capabilities of Large Language Models for Code Generation Feb 26, 2025 Benchmarking Code Generation
Code Code Available 1CodeCriticBench: A Holistic Code Critique Benchmark for Large Language Models Feb 23, 2025 Code Generation HumanEval
Code Code Available 1I-MCTS: Enhancing Agentic AutoML via Introspective Monte Carlo Tree Search Feb 20, 2025 AutoML Code Generation
Code Code Available 1AdaptiveStep: Automatically Dividing Reasoning Step through Model Confidence Feb 19, 2025 Code Generation Decision Making
Code Code Available 1Uncovering the Impact of Chain-of-Thought Reasoning for Direct Preference Optimization: Lessons from Text-to-SQL Feb 17, 2025 Code Generation Math
Code Code Available 1Code-Vision: Evaluating Multimodal LLMs Logic Understanding and Code Generation Capabilities Feb 17, 2025 Code Generation HumanEval
Code Code Available 1Enhancing Cross-Tokenizer Knowledge Distillation with Contextual Dynamical Mapping Feb 16, 2025 Code Generation Instruction Following
Code Code Available 1BenchMAX: A Comprehensive Multilingual Evaluation Suite for Large Language Models Feb 11, 2025 Code Generation Instruction Following
Code Code Available 1nvAgent: Automated Data Visualization from Natural Language via Collaborative Agent Workflow Feb 7, 2025 Code Generation Code Translation
Code Code Available 1C codegen considered unnecessary: go directly to binary, do not pass C. Compilation of Julia code for deployment in model-based engineering Feb 3, 2025 C++ code Code Generation
Code Code Available 1o3-mini vs DeepSeek-R1: Which One is Safer? Jan 30, 2025 Code Generation Program Repair
Code Code Available 1Towards Making Flowchart Images Machine Interpretable Jan 29, 2025 Code Generation Optical Character Recognition (OCR)
Code Code Available 1ChaosEater: Fully Automating Chaos Engineering with Large Language Models Jan 19, 2025 Code Generation
Code Code Available 1CWEval: Outcome-driven Evaluation on Functionality and Security of LLM Code Generation Jan 14, 2025 Code Generation
Code Code Available 1Effective LLM-Driven Code Generation with Pythoness Jan 3, 2025 Code Generation
Code Code Available 1Toward Intelligent and Secure Cloud: Large Language Model Empowered Proactive Defense Dec 30, 2024 Cloud Computing Code Generation
Code Code Available 1HumanEval Pro and MBPP Pro: Evaluating Large Language Models on Self-invoking Code Generation Dec 30, 2024 Code Generation HumanEval
Code Code Available 1AutoDroid-V2: Boosting SLM-based GUI Agents via Code Generation Dec 24, 2024 Code Generation
Code Code Available 1Reasoning Through Execution: Unifying Process and Outcome Rewards for Code Generation Dec 19, 2024 Code Generation
Code Code Available 1ChainStream: An LLM-based Framework for Unified Synthetic Sensing Dec 13, 2024 Code Generation
Code Code Available 1Towards Rich Emotions in 3D Avatars: A Text-to-3D Avatar Generation Benchmark Dec 3, 2024 Code Generation Diversity
Code Code Available 1