Toward Neurosymbolic Program Comprehension Feb 3, 2025 Code Generation software testing
— Unverified 0Importing Phantoms: Measuring LLM Package Hallucination Vulnerabilities Jan 31, 2025 Code Generation Hallucination
— Unverified 0Towards Adaptive Self-Improvement for Smarter Energy Systems Jan 31, 2025 Code Generation Decision Making
— Unverified 0Analysis of LLMs vs Human Experts in Requirements Engineering Jan 31, 2025 Code Generation
— Unverified 0Cogito, ergo sum: A Neurobiologically-Inspired Cognition-Memory-Growth System for Code Generation Jan 30, 2025 Code Generation Hippocampus
Code Code Available 0Enhancing Large Language Model Efficiencyvia Symbolic Compression: A Formal Approach Towards Interpretability Jan 30, 2025 Code Generation Language Modeling
— Unverified 0Statistical multi-metric evaluation and visualization of LLM system predictive performance Jan 30, 2025 Code Generation Decision Making
— Unverified 0o3-mini vs DeepSeek-R1: Which One is Safer? Jan 30, 2025 Code Generation Program Repair
Code Code Available 1GLLM: Self-Corrective G-Code Generation using Large Language Models with User Feedback Jan 29, 2025 Code Generation RAG
— Unverified 0Using Code Generation to Solve Open Instances of Combinatorial Design Problems Jan 29, 2025 Code Generation valid
Code Code Available 0Towards Making Flowchart Images Machine Interpretable Jan 29, 2025 Code Generation Optical Character Recognition (OCR)
Code Code Available 1CoCoNUT: Structural Code Understanding does not fall out of a tree Jan 27, 2025 Code Generation HumanEval
Code Code Available 0Programming by Examples Meets Historical Linguistics: A Large Language Model Based Approach to Sound Law Induction Jan 27, 2025 Code Generation Inductive Bias
— Unverified 0Advancing Generative Artificial Intelligence and Large Language Models for Demand Side Management with Internet of Electric Vehicles Jan 26, 2025 Code Generation energy management
— Unverified 0ENTER: Event Based Interpretable Reasoning for VideoQA Jan 24, 2025 Code Generation EgoSchema
— Unverified 0Chain of Grounded Objectives: Bridging Process and Goal-oriented Prompting for Code Generation Jan 23, 2025 Code Generation
— Unverified 0Pseudocode-Injection Magic: Enabling LLMs to Tackle Graph Computational Tasks Jan 23, 2025 Code Generation Computational Efficiency
— Unverified 0Correctness Assessment of Code Generated by Large Language Models Using Internal Representations Jan 22, 2025 Code Generation
Code Code Available 0Revisit Self-Debugging with Self-Generated Tests for Code Generation Jan 22, 2025 Code Generation
— Unverified 0QualityFlow: An Agentic Workflow for Program Synthesis Controlled by LLM Quality Checks Jan 20, 2025 Code Generation HumanEval
— Unverified 0Consolidating TinyML Lifecycle with Large Language Models: Reality, Illusion, or Opportunity? Jan 20, 2025 Code Generation Model Optimization
— Unverified 0Towards Advancing Code Generation with Large Language Models: A Research Roadmap Jan 20, 2025 Code Generation
— Unverified 0GREEN-CODE: Learning to Optimize Energy Efficiency in LLM-based Code Generation Jan 19, 2025 Bug fixing Code Completion
Code Code Available 0ChaosEater: Fully Automating Chaos Engineering with Large Language Models Jan 19, 2025 Code Generation
Code Code Available 1SOP-Agent: Empower General Purpose AI Agent with Domain-Specific SOPs Jan 16, 2025 AI Agent Code Generation
— Unverified 0OptiChat: Bridging Optimization Models and Practitioners with Large Language Models Jan 14, 2025 Code Generation counterfactual
Code Code Available 2The Invisible Hand: Unveiling Provider Bias in Large Language Models for Code Generation Jan 14, 2025 Code Generation Dataset Generation
— Unverified 0CWEval: Outcome-driven Evaluation on Functionality and Security of LLM Code Generation Jan 14, 2025 Code Generation
Code Code Available 1Leveraging Metamemory Mechanisms for Enhanced Data-Free Code Generation in LLMs Jan 14, 2025 Code Generation HumanEval
— Unverified 0Evaluating Agent-based Program Repair at Google Jan 13, 2025 Code Generation Program Repair
— Unverified 0ChartCoder: Advancing Multimodal Large Language Model for Chart-to-Code Generation Jan 11, 2025 Chart Understanding Code Generation
Code Code Available 2Guided Code Generation with LLMs: A Multi-Agent Framework for Complex Code Tasks Jan 11, 2025 Code Generation HumanEval
— Unverified 0Dafny as Verification-Aware Intermediate Language for Code Generation Jan 10, 2025 Code Generation HumanEval
— Unverified 0BioAgents: Democratizing Bioinformatics Analysis with Multi-Agent Systems Jan 10, 2025 Code Generation RAG
— Unverified 0FairCoder: Evaluating Social Bias of LLMs in Code Generation Jan 9, 2025 Code Generation Fairness
Code Code Available 0Deriving Coding-Specific Sub-Models from LLMs using Resource-Efficient Pruning Jan 9, 2025 Code Generation
— Unverified 0Search-o1: Agentic Search-Enhanced Large Reasoning Models Jan 9, 2025 Code Generation
Code Code Available 5The Future of AI: Exploring the Potential of Large Concept Models Jan 8, 2025 Autonomous Vehicles Code Generation
— Unverified 0Do Code LLMs Understand Design Patterns? Jan 8, 2025 Code Generation
— Unverified 0Robotic Programmer: Video Instructed Policy Code Generation for Robotic Manipulation Jan 8, 2025 Code Generation Language Modeling
— Unverified 0EpiCoder: Encompassing Diversity and Complexity in Code Generation Jan 8, 2025 Code Generation Diversity
— Unverified 0Practical Design and Benchmarking of Generative AI Applications for Surgical Billing and Coding Jan 7, 2025 Benchmarking Code Generation
— Unverified 0How to Select Pre-Trained Code Models for Reuse? A Learning Perspective Jan 7, 2025 Code Generation Code Summarization
Code Code Available 0ChronoLLM: A Framework for Customizing Large Language Model for Digital Twins generalization based on PyChrono Jan 7, 2025 Code Generation Computational Efficiency
— Unverified 0RTLSquad: Multi-Agent Based Interpretable RTL Design Jan 6, 2025 Code Generation
— Unverified 0CodeVision: Detecting LLM-Generated Code Using 2D Token Probability Maps and Vision Models Jan 6, 2025 Code Generation Computational Efficiency
— Unverified 0Semantic Captioning: Benchmark Dataset and Graph-Aware Few-Shot In-Context Learning for SQL2Text Jan 6, 2025 Code Generation In-Context Learning
Code Code Available 0ToolHop: A Query-Driven Benchmark for Evaluating Large Language Models in Multi-Hop Tool Use Jan 5, 2025 Code Generation
— Unverified 0Layer-Level Self-Exposure and Patch: Affirmative Token Mitigation for Jailbreak Attack Defense Jan 5, 2025 Chatbot Code Generation
Code Code Available 0Cracks in The Stack: Hidden Vulnerabilities and Licensing Risks in LLM Pre-Training Datasets Jan 5, 2025 Code Generation
— Unverified 0