GiFT: Gibbs Fine-Tuning for Code Generation Feb 17, 2025 Code Generation valid
Code Code Available 0Uncovering the Impact of Chain-of-Thought Reasoning for Direct Preference Optimization: Lessons from Text-to-SQL Feb 17, 2025 Code Generation Math
Code Code Available 1UnitCoder: Scalable Iterative Code Synthesis with Unit Test Guidance Feb 17, 2025 Code Generation HumanEval
— Unverified 0Performance Review on LLM for solving leetcode problems Feb 16, 2025 Code Generation
— Unverified 0An Interpretable Automated Mechanism Design Framework with Large Language Models Feb 16, 2025 Code Generation
— Unverified 0Enhancing Cross-Tokenizer Knowledge Distillation with Contextual Dynamical Mapping Feb 16, 2025 Code Generation Instruction Following
Code Code Available 1SURGE: On the Potential of Large Language Models as General-Purpose Surrogate Code Executors Feb 16, 2025 Code Generation
Code Code Available 0Diversified Sampling Improves Scaling LLM inference Feb 16, 2025 Code Generation Diversity
— Unverified 0Automated Visualization Code Synthesis via Multi-Path Reasoning and Feedback-Driven Optimization Feb 16, 2025 Code Generation Data Visualization
— Unverified 0CoCoEvo: Co-Evolution of Programs and Test Cases to Enhance Code Generation Feb 15, 2025 Code Generation
Code Code Available 01bit-Merging: Dynamic Quantized Merging for Large Language Models Feb 15, 2025 Code Generation Math
— Unverified 0RefineCoder: Iterative Improving of Large Language Models via Adaptive Critique Refinement for Code Generation Feb 13, 2025 Code Generation
— Unverified 0CRANE: Reasoning with constrained LLM generation Feb 13, 2025 Code Generation Math
— Unverified 03D-Grounded Vision-Language Framework for Robotic Task Planning: Automated Prompt Synthesis and Supervised Reasoning Feb 13, 2025 Code Generation Scene Understanding
— Unverified 0Enhancing LLM Character-Level Manipulation via Divide and Conquer Feb 12, 2025 Code Generation
— Unverified 0From PowerPoint UI Sketches to Web-Based Applications: Pattern-Driven Code Generation for GIS Dashboard Development Using Knowledge-Augmented LLMs, Context-Aware Visual Prompting, and the React Framework Feb 12, 2025 Code Generation RAG
— Unverified 0Bridging LLM-Generated Code and Requirements: Reverse Generation technique and SBC Metric for Developer Insights Feb 11, 2025 Code Generation Semantic Similarity
Code Code Available 0BenchMAX: A Comprehensive Multilingual Evaluation Suite for Large Language Models Feb 11, 2025 Code Generation Instruction Following
Code Code Available 1CodeI/O: Condensing Reasoning Patterns via Code Input-Output Prediction Feb 11, 2025 Code Generation Math
Code Code Available 4Rethinking Fine-Tuning when Scaling Test-Time Compute: Limiting Confidence Improves Mathematical Reasoning Feb 11, 2025 Code Generation Math
Code Code Available 0Verifying LLM-Generated Code in the Context of Software Verification with Ada/SPARK Feb 11, 2025 Code Generation
— Unverified 0What I cannot execute, I do not understand: Training and Evaluating LLMs on Program Execution Traces Feb 10, 2025 Code Generation mbpp
— Unverified 0LessLeak-Bench: A First Investigation of Data Leakage in LLMs Across 83 Software Engineering Benchmarks Feb 10, 2025 Code Generation Program Repair
— Unverified 0Cardiverse: Harnessing LLMs for Novel Card Game Prototyping Feb 10, 2025 Card Games Code Generation
— Unverified 0SnipGen: A Mining Repository Framework for Evaluating LLMs for Code Feb 10, 2025 Code Generation Prompt Engineering
— Unverified 0Can LLMs Replace Human Evaluators? An Empirical Study of LLM-as-a-Judge in Software Engineering Feb 10, 2025 Code Generation Code Summarization
— Unverified 0Benchmarking Prompt Engineering Techniques for Secure Code Generation with GPT Models Feb 9, 2025 Benchmarking Code Generation
— Unverified 0Mitigating Sensitive Information Leakage in LLMs4Code through Machine Unlearning Feb 9, 2025 Code Generation Machine Unlearning
— Unverified 0CODESIM: Multi-Agent Code Generation and Problem Solving through Simulation-Driven Planning and Debugging Feb 8, 2025 Code Generation HumanEval
Code Code Available 2Proving the Coding Interview: A Benchmark for Formally Verified Code Generation Feb 8, 2025 Automated Theorem Proving Code Generation
— Unverified 0nvAgent: Automated Data Visualization from Natural Language via Collaborative Agent Workflow Feb 7, 2025 Code Generation Code Translation
Code Code Available 1CodeSCM: Causal Analysis for Multi-Modal Code Generation Feb 7, 2025 Code Generation
Code Code Available 0Refining Integration-by-Parts Reduction of Feynman Integrals with Machine Learning Feb 7, 2025 Code Generation Language Modeling
— Unverified 0Optimistic Gradient Learning with Hessian Corrections for High-Dimensional Black-Box Optimization Feb 7, 2025 Code Generation
— Unverified 0Teaching Language Models to Critique via Reinforcement Learning Feb 5, 2025 Code Generation reinforcement-learning
— Unverified 0LLMs can be easily Confused by Instructional Distractions Feb 5, 2025 Bias Detection Code Generation
— Unverified 0Large Language Model Guided Self-Debugging Code Generation Feb 5, 2025 Code Generation Computational Efficiency
— Unverified 0Path Planning for Masked Diffusion Model Sampling Feb 5, 2025 Code Generation In-Context Learning
— Unverified 0CodeSteer: Symbolic-Augmented Language Models via Code/Text Guidance Feb 4, 2025 Code Generation Text Generation
Code Code Available 2Reusing Embeddings: Reproducible Reward Model Research in Large Language Model Alignment without GPUs Feb 4, 2025 Code Generation Language Modeling
Code Code Available 2LongDPO: Unlock Better Long-form Generation Abilities for LLMs via Critique-augmented Stepwise Information Feb 4, 2025 Code Generation Form
Code Code Available 0The Elicitation Game: Evaluating Capability Elicitation Techniques Feb 4, 2025 Code Generation
Code Code Available 0Can LLMs Maintain Fundamental Abilities under KV Cache Compression? Feb 4, 2025 Arithmetic Reasoning Code Generation
— Unverified 0PlotGen: Multi-Agent LLM-based Scientific Data Visualization via Multimodal Feedback Feb 3, 2025 Code Generation Data Visualization
— Unverified 0Toward Neurosymbolic Program Comprehension Feb 3, 2025 Code Generation software testing
— Unverified 0Analysis of Student-LLM Interaction in a Software Engineering Project Feb 3, 2025 Code Generation Code Summarization
— Unverified 0Next Steps in LLM-Supported Java Verification Feb 3, 2025 Code Generation
— Unverified 0Security and Quality in LLM-Generated Code: A Multi-Language, Multi-Model Analysis Feb 3, 2025 Code Generation
— Unverified 0SE Arena: An Interactive Platform for Evaluating Foundation Models in Software Engineering Feb 3, 2025 Benchmarking Code Generation
— Unverified 0C codegen considered unnecessary: go directly to binary, do not pass C. Compilation of Julia code for deployment in model-based engineering Feb 3, 2025 C++ code Code Generation
Code Code Available 1