RES-Q: Evaluating Code-Editing Large Language Model Systems at the Repository Scale Jun 24, 2024 Code Generation HumanEval
Code Code Available 1INDICT: Code Generation with Internal Dialogues of Critiques for Both Security and Helpfulness Jun 23, 2024 Code Generation Navigate
Code Code Available 1CityGPT: Empowering Urban Spatial Cognition of Large Language Models Jun 20, 2024 Code Generation Math
Code Code Available 1Hierarchical Prompting Taxonomy: A Universal Evaluation Framework for Large Language Models Aligned with Human Cognitive Principles Jun 18, 2024 Arithmetic Reasoning Code Generation
Code Code Available 1Long Code Arena: a Set of Benchmarks for Long-Context Code Models Jun 17, 2024 Code Completion Code Generation
Code Code Available 1On the Impacts of Contexts on Repository-Level Code Generation Jun 17, 2024 Code Generation
Code Code Available 1CS-Bench: A Comprehensive Benchmark for Large Language Models towards Computer Science Mastery Jun 12, 2024 Code Generation
Code Code Available 1We Have a Package for You! A Comprehensive Analysis of Package Hallucinations by Code Generating LLMs Jun 12, 2024 Code Generation Hallucination
Code Code Available 1VersiCode: Towards Version-controllable Code Generation Jun 11, 2024 Code Completion Code Generation
Code Code Available 1LogiCode: an LLM-Driven Framework for Logical Anomaly Detection Jun 7, 2024 Anomaly Detection Binary Classification
Code Code Available 1Online Joint Fine-tuning of Multi-Agent Flows Jun 6, 2024 Code Generation Prompt Engineering
Code Code Available 1Re-ReST: Reflection-Reinforced Self-Training for Language Agents Jun 3, 2024 Code Generation Image Generation
Code Code Available 1SemCoder: Training Code Language Models with Comprehensive Semantics Reasoning Jun 3, 2024 Code Completion Code Generation
Code Code Available 1DevEval: A Manually-Annotated Code Generation Benchmark Aligned with Real-World Code Repositories May 30, 2024 Code Generation
Code Code Available 1AlchemistCoder: Harmonizing and Eliciting Code Capability by Hindsight Tuning on Multi-source Data May 29, 2024 Code Generation Diversity
Code Code Available 1Exploiting LLM Quantization May 28, 2024 Code Generation Quantization
Code Code Available 1ReflectionCoder: Learning from Reflection Sequence for Enhanced One-off Code Generation May 27, 2024 Code Generation HumanEval
Code Code Available 1RTL-Repo: A Benchmark for Evaluating LLMs on Large-Scale RTL Design Projects May 27, 2024 Code Generation
Code Code Available 1Generating Code World Models with Large Language Models Guided by Monte Carlo Tree Search May 24, 2024 Code Generation Language Modelling
Code Code Available 1EffiLearner: Enhancing Efficiency of Generated Code via Self-Optimization May 24, 2024 Code Generation HumanEval
Code Code Available 1MHPP: Exploring the Capabilities and Limitations of Language Models Beyond Basic Code Generation May 19, 2024 Code Generation HumanEval
Code Code Available 1DocuMint: Docstring Generation for Python using Small Language Models May 16, 2024 Benchmarking Code Generation
Code Code Available 1LLMs and the Future of Chip Design: Unveiling Security Risks and Building Trust May 11, 2024 Code Generation
Code Code Available 1CodeHalu: Investigating Code Hallucinations in LLMs via Execution-based Verification Apr 30, 2024 Code Generation Hallucination
Code Code Available 1Constrained Decoding for Secure Code Generation Apr 30, 2024 Code Generation
Code Code Available 1PECC: Problem Extraction and Coding Challenges Apr 29, 2024 Code Generation Math
Code Code Available 1AI Coders Are Among Us: Rethinking Programming Language Grammar Towards Efficient Code Generation Apr 25, 2024 Code Generation Math
Code Code Available 1Class-Level Code Generation from Natural Language Using Iterative, Tool-Enhanced Reasoning over Repository Apr 22, 2024 Class-level Code Generation Code Generation
Code Code Available 1MMCode: Benchmarking Multimodal Large Language Models for Code Generation with Visually Rich Programming Problems Apr 15, 2024 Benchmarking Code Generation
Code Code Available 1Xiwu: A Basis Flexible and Learnable LLM for High Energy Physics Apr 8, 2024 Code Generation Language Modelling
Code Code Available 1Self-Organized Agents: A LLM Multi-Agent Framework toward Ultra Large-Scale Code Generation and Optimization Apr 2, 2024 Code Generation HumanEval
Code Code Available 1CYCLE: Learning to Self-Refine the Code Generation Mar 27, 2024 Code Generation HumanEval
Code Code Available 1Diffusion-based Aesthetic QR Code Generation via Scanning-Robust Perceptual Guidance Mar 23, 2024 Code Generation
Code Code Available 1Can Large Language Models Solve Robot Routing? Mar 16, 2024 Code Generation Text-to-Code Generation
Code Code Available 1Automatic Generation of Python Programs Using Context-Free Grammars Mar 11, 2024 Code Generation
Code Code Available 1InfiBench: Evaluating the Question-Answering Capabilities of Code Large Language Models Mar 11, 2024 Code Generation HumanEval
Code Code Available 1Text2QR: Harmonizing Aesthetic Customization and Scanning Robustness for Text-Guided QR Code Generation Mar 11, 2024 Code Generation
Code Code Available 1UniSparse: An Intermediate Language for General Sparse Format Customization Mar 9, 2024 Attribute Code Generation
Code Code Available 1IRCoder: Intermediate Representations Make Language Models Robust Multilingual Code Generators Mar 6, 2024 Code Completion Code Generation
Code Code Available 1Quantifying Contamination in Evaluating Code Generation Capabilities of Language Models Mar 6, 2024 Code Generation Memorization
Code Code Available 1DACO: Towards Application-Driven and Comprehensive Data Analysis via Code Generation Mar 4, 2024 2k Code Generation
Code Code Available 1Open Assistant Toolkit -- version 2 Mar 1, 2024 Code Generation Response Generation
Code Code Available 1Benchmarking Data Science Agents Feb 27, 2024 Benchmarking Code Generation
Code Code Available 1HumanEval-XL: A Multilingual Code Generation Benchmark for Cross-lingual Natural Language Generalization Feb 26, 2024 Code Generation HumanEval
Code Code Available 1Q-Probe: A Lightweight Approach to Reward Maximization for Language Models Feb 22, 2024 Code Generation Language Modeling
Code Code Available 1CodeMind: Evaluating Large Language Models for Code Reasoning Feb 15, 2024 Code Generation
Code Code Available 1DolphCoder: Echo-Locating Code Large Language Models with Diverse and Multi-Objective Instruction Tuning Feb 14, 2024 Code Generation HumanEval
Code Code Available 1MPIrigen: MPI Code Generation through Domain-Specific Language Models Feb 14, 2024 Code Generation
Code Code Available 1GRILLBot In Practice: Lessons and Tradeoffs Deploying Large Language Models for Adaptable Conversational Task Assistants Feb 12, 2024 Code Generation Management
Code Code Available 1Entropy-Regularized Token-Level Policy Optimization for Language Agent Reinforcement Feb 9, 2024 Code Generation Decision Making
Code Code Available 1