Learning to Reason without External Rewards May 26, 2025 Code Generation reinforcement-learning
Code Code Available 3ChartGalaxy: A Dataset for Infographic Chart Understanding and Generation May 24, 2025 Benchmarking Chart Understanding
Code Code Available 3MLZero: A Multi-Agent System for End-to-end Machine Learning Automation May 20, 2025 AutoML Code Generation
Code Code Available 3Web-Bench: A LLM Code Benchmark Based on Web Standards and Frameworks May 12, 2025 Code Generation
Code Code Available 3Efficiently Serving LLM Reasoning Programs with Certaindex Dec 30, 2024 Code Generation Mathematical Problem-Solving
Code Code Available 3Large Language Model-Brained GUI Agents: A Survey Nov 27, 2024 Code Generation Language Modeling
Code Code Available 3SuffixDecoding: Extreme Speculative Decoding for Emerging AI Applications Nov 7, 2024 Code Generation Language Modeling
Code Code Available 3AutoVFX: Physically Realistic Video Editing from Natural Language Instructions Nov 4, 2024 Code Generation Video Editing
Code Code Available 3SelfCodeAlign: Self-Alignment for Code Generation Oct 31, 2024 Code Generation HumanEval
Code Code Available 3SwiftKV: Fast Prefill-Optimized Inference with Knowledge-Preserving Model Transformation Oct 4, 2024 16k Code Generation
Code Code Available 3RepoGraph: Enhancing AI Software Engineering with Repository-level Code Graph Oct 3, 2024 Code Generation
Code Code Available 3HyperAgent: Generalist Software Engineering Agents to Solve Coding Tasks at Scale Sep 9, 2024 Code Generation Fault localization
Code Code Available 3Revisiting VerilogEval: A Year of Improvements in Large-Language Models for Hardware Code Generation Aug 20, 2024 Code Completion Code Generation
Code Code Available 3AppWorld: A Controllable World of Apps and People for Benchmarking Interactive Coding Agents Jul 26, 2024 Benchmarking Code Generation
Code Code Available 3AgileCoder: Dynamic Collaborative Agents for Software Development based on Agile Methodology Jun 16, 2024 Code Generation
Code Code Available 3Advancing LLM Reasoning Generalists with Preference Trees Apr 2, 2024 Benchmarking Code Generation
Code Code Available 3Gemini 1.5: Unlocking multimodal understanding across millions of tokens of context Mar 8, 2024 1 Image, 2*2 Stitching Code Generation
Code Code Available 3RAT: Retrieval Augmented Thoughts Elicit Context-Aware Reasoning in Long-Horizon Generation Mar 8, 2024 Code Generation Hallucination
Code Code Available 3SynCode: LLM Generation with Grammar Augmentation Mar 3, 2024 Code Generation valid
Code Code Available 3DS-Agent: Automated Data Science by Empowering Large Language Models with Case-Based Reasoning Feb 27, 2024 Code Generation
Code Code Available 3SpikingJelly: An open-source machine learning infrastructure platform for spike-based intelligence Oct 25, 2023 Code Generation
Code Code Available 3How Abilities in Large Language Models are Affected by Supervised Fine-tuning Data Composition Oct 9, 2023 Code Generation Instruction Following
Code Code Available 3OctoPack: Instruction Tuning Code Large Language Models Aug 14, 2023 Code Generation Code Repair
Code Code Available 3Is Your Code Generated by ChatGPT Really Correct? Rigorous Evaluation of Large Language Models for Code Generation May 2, 2023 Code Generation HumanEval
Code Code Available 3ViperGPT: Visual Inference via Python Execution for Reasoning Mar 14, 2023 Code Generation Video Question Answering
Code Code Available 3Prompting Is Programming: A Query Language for Large Language Models Dec 12, 2022 Code Generation Language Modeling
Code Code Available 3SymForce: Symbolic Computation and Code Generation for Robotics Apr 17, 2022 Code Generation Math
Code Code Available 3Evaluating Large Language Models Trained on Code Jul 7, 2021 Code Generation HumanEval
Code Code Available 3The Devil behind the mask: An emergent safety vulnerability of Diffusion LLMs Jul 15, 2025 Code Generation Safety Alignment
Code Code Available 2Language Modeling by Language Models Jun 25, 2025 Code Generation Language Modeling
Code Code Available 2cAST: Enhancing Code Retrieval-Augmented Generation with Structural Chunking via Abstract Syntax Tree Jun 18, 2025 Chunking Code Generation
Code Code Available 2Comprehensive Verilog Design Problems: A Next-Generation Benchmark Dataset for Evaluating Large Language Models and Agents on RTL Design and Verification Jun 17, 2025 Code Generation
Code Code Available 2Humanity's Last Code Exam: Can Advanced LLMs Conquer Human's Hardest Code Competition? Jun 15, 2025 Code Generation
Code Code Available 2Execution Guided Line-by-Line Code Generation Jun 12, 2025 Code Generation
Code Code Available 2AutoMind: Adaptive Knowledgeable Agent for Automated Data Science Jun 12, 2025 Code Generation Large Language Model
Code Code Available 2VERINA: Benchmarking Verifiable Code Generation May 29, 2025 Benchmarking Code Generation
Code Code Available 2dKV-Cache: The Cache for Diffusion Language Models May 21, 2025 Code Generation Denoising
Code Code Available 2CAD-Coder: An Open-Source Vision-Language Model for Computer-Aided Design Code Generation May 20, 2025 Code Generation Language Modeling
Code Code Available 2AD-AGENT: A Multi-agent Framework for End-to-end Anomaly Detection May 19, 2025 Anomaly Detection Code Generation
Code Code Available 2CodePDE: An Inference Framework for LLM-driven PDE Solver Generation May 13, 2025 Code Generation
Code Code Available 2LoRI: Reducing Cross-Task Interference in Multi-Task Low-Rank Adaptation Apr 10, 2025 Code Generation Continual Learning
Code Code Available 2DistiLLM-2: A Contrastive Approach Boosts the Distillation of LLMs Mar 10, 2025 Code Generation Instruction Following
Code Code Available 2Nexus: A Lightweight and Scalable Multi-Agent Framework for Complex Tasks Automation Feb 26, 2025 Code Generation HumanEval
Code Code Available 2TritonBench: Benchmarking Large Language Model Capabilities for Generating Triton Operators Feb 20, 2025 Benchmarking Code Generation
Code Code Available 2DataSciBench: An LLM Agent Benchmark for Data Science Feb 19, 2025 Code Generation Large Language Model
Code Code Available 2Training Turn-by-Turn Verifiers for Dialogue Tutoring Agents: The Curious Case of LLMs as Your Coding Tutors Feb 18, 2025 Code Generation Knowledge Tracing
Code Code Available 2CODESIM: Multi-Agent Code Generation and Problem Solving through Simulation-Driven Planning and Debugging Feb 8, 2025 Code Generation HumanEval
Code Code Available 2Reusing Embeddings: Reproducible Reward Model Research in Large Language Model Alignment without GPUs Feb 4, 2025 Code Generation Language Modeling
Code Code Available 2CodeSteer: Symbolic-Augmented Language Models via Code/Text Guidance Feb 4, 2025 Code Generation Text Generation
Code Code Available 2OptiChat: Bridging Optimization Models and Practitioners with Large Language Models Jan 14, 2025 Code Generation counterfactual
Code Code Available 2