From Reasoning to Code: GRPO Optimization for Underrepresented Languages May 20, 2025 Code Generation
— Unverified 0Text Generation Beyond Discrete Token Sampling May 20, 2025 Code Generation Mathematical Reasoning
— Unverified 0Self-Evolving Curriculum for LLM Reasoning May 20, 2025 Code Generation Policy Gradient Methods
— Unverified 0Knowledge Graph Based Repository-Level Code Generation May 20, 2025 Code Generation Code Search
— Unverified 0Cheaper, Better, Faster, Stronger: Robust Text-to-SQL without Chain-of-Thought or Fine-Tuning May 20, 2025 Code Generation Text to SQL
— Unverified 0CAD-Coder: An Open-Source Vision-Language Model for Computer-Aided Design Code Generation May 20, 2025 Code Generation Language Modeling
Code Code Available 2MLZero: A Multi-Agent System for End-to-end Machine Learning Automation May 20, 2025 AutoML Code Generation
Code Code Available 3CLEVER: A Curated Benchmark for Formally Verified Code Generation May 20, 2025 Code Generation Program Synthesis
Code Code Available 1Krikri: Advancing Open Large Language Models for Greek May 19, 2025 Code Generation Language Modeling
— Unverified 0EffiBench-X: A Multi-Language Benchmark for Measuring Efficiency of LLM-Generated Code May 19, 2025 Code Generation
Code Code Available 1On-Policy Optimization with Group Equivalent Preference for Multi-Programming Language Understanding May 19, 2025 Code Generation Code Translation
— Unverified 0Selective Code Generation for Functional Guarantees May 19, 2025 Code Generation Hallucination
— Unverified 0AD-AGENT: A Multi-agent Framework for End-to-end Anomaly Detection May 19, 2025 Anomaly Detection Code Generation
Code Code Available 2RN-F: A Novel Approach for Mitigating Contaminated Data in Large Language Models May 19, 2025 Code Generation
Code Code Available 0Understanding Complexity in VideoQA via Visual Program Generation May 19, 2025 Code Generation Question Answering
— Unverified 0AGI-Elo: How Far Are We From Mastering A Task? May 19, 2025 Code Generation Image Classification
Code Code Available 1AutoGEEval: A Multimodal and Automated Framework for Geospatial Code Generation on GEE with Large Language Models May 19, 2025 Code Generation Code Translation
— Unverified 0EVALOOP: Assessing LLM Robustness in Programming from a Self-consistency Perspective May 18, 2025 Adversarial Attack Code Generation
— Unverified 0VeriReason: Reinforcement Learning with Testbench Feedback for Reasoning-Enhanced Verilog Generation May 17, 2025 Code Generation
Code Code Available 1HALO: Hierarchical Autonomous Logic-Oriented Orchestration for Multi-Agent LLM Systems May 17, 2025 Arithmetic Reasoning Code Generation
Code Code Available 1OMAC: A Broad Optimization Framework for LLM-Based Multi-Agent Collaboration May 17, 2025 Arithmetic Reasoning Code Generation
Code Code Available 0SOCIA: An End-to-End Agentic Framework for Automated Cyber-Physical-Social Simulator Generation May 17, 2025 Code Generation Language Modeling
— Unverified 0VeriThoughts: Enabling Automated Verilog Code Generation using Reasoning and Formal Verification May 16, 2025 Code Generation
Code Code Available 1Reasoning with OmniThought: A Large CoT Dataset with Verbosity and Cognitive Difficulty Annotations May 16, 2025 Code Generation Mathematical Problem-Solving
— Unverified 0CRPE: Expanding The Reasoning Capability of Large Language Model for Code Generation May 15, 2025 Code Generation Language Modeling
— Unverified 0Code-Driven Planning in Grid Worlds with Large Language Models May 15, 2025 Code Generation
— Unverified 0MONAQ: Multi-Objective Neural Architecture Querying for Time-Series Analysis on Resource-Constrained Devices May 15, 2025 Activity Recognition Code Generation
Code Code Available 0Rethinking Repetition Problems of LLMs in Code Generation May 15, 2025 Code Generation HumanEval
Code Code Available 1Reinforcing the Diffusion Chain of Lateral Thought with Diffusion Language Models May 15, 2025 Code Generation GSM8K
— Unverified 0ComplexFormer: Disruptively Advancing Transformer Inference Ability via Head-Specific Complex Vector Attention May 15, 2025 Code Generation Language Modeling
Code Code Available 0Are Sparse Autoencoders Useful for Java Function Bug Detection? May 15, 2025 Code Generation Vulnerability Detection
Code Code Available 0Can You Really Trust Code Copilots? Evaluating Large Language Models from a Code Security Perspective May 15, 2025 Code Completion Code Generation
Code Code Available 0Qwen3 Technical Report May 14, 2025 Code Generation Mathematical Reasoning
Code Code Available 13Generalizing Large Language Model Usability Across Resource-Constrained May 13, 2025 Code Generation Language Modeling
— Unverified 0Tests as Prompt: A Test-Driven-Development Benchmark for LLM Code Generation May 13, 2025 Code Generation In-Context Learning
— Unverified 0CAD-Coder:Text-Guided CAD Files Code Generation May 13, 2025 Code Generation
— Unverified 0Evaluating LLM Metrics Through Real-World Capabilities May 13, 2025 Code Generation Information Retrieval
— Unverified 0Agent-as-a-Service based on Agent Network May 13, 2025 Code Generation Mathematical Reasoning
— Unverified 0CodePDE: An Inference Framework for LLM-driven PDE Solver Generation May 13, 2025 Code Generation
Code Code Available 2One Trigger Token Is Enough: A Defense Strategy for Balancing Safety and Usability in Large Language Models May 12, 2025 Code Generation Safety Alignment
— Unverified 0Web-Bench: A LLM Code Benchmark Based on Web Standards and Frameworks May 12, 2025 Code Generation
Code Code Available 3Enhancing Code Generation via Bidirectional Comment-Level Mutual Grounding May 12, 2025 Code Generation Comment Generation
Code Code Available 0Code Retrieval for MILP Instance Generation May 11, 2025 Code Generation Retrieval
— Unverified 0RTL++: Graph-enhanced LLM for RTL Code Generation May 11, 2025 Code Generation
— Unverified 0CodeMixBench: Evaluating Large Language Models on Code Generation with Code-Mixed Prompts May 8, 2025 Code Completion Code Generation
— Unverified 0LLM Code Customization with Visual Results: A Benchmark on TikZ May 7, 2025 Code Generation valid
— Unverified 0A Proposal for Evaluating the Operational Risk for ChatBots based on Large Language Models May 7, 2025 Chatbot Code Generation
— Unverified 0YABLoCo: Yet Another Benchmark for Long Context Code Generation May 7, 2025 Code Generation
— Unverified 0Scratch Copilot: Supporting Youth Creative Coding with AI May 6, 2025 Code Generation
— Unverified 0MARCO: Multi-Agent Code Optimization with Real-Time Knowledge Integration for High-Performance Computing May 6, 2025 Code Generation
— Unverified 0