| Accelerating Large Language Model Reasoning via Speculative Search | May 3, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Semantic Intelligence: Integrating GPT-4 with A Planning in Low-Cost Robotics | May 3, 2025 | Large Language ModelRobot Navigation | —Unverified | 0 |
| WirelessAgent: Large Language Model Agents for Intelligent Wireless Networks | May 2, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Seeking to Collide: Online Safety-Critical Scenario Generation for Autonomous Driving with Retrieval Augmented Large Language Models | May 2, 2025 | Autonomous DrivingAutonomous Vehicles | —Unverified | 0 |
| Large Language Model-Driven Dynamic Assessment of Grammatical Accuracy in English Language Learner Writing | May 2, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| FlowDubber: Movie Dubbing with LLM-based Semantic-aware Learning and Flow Matching based Voice Enhancing | May 2, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Document Retrieval Augmented Fine-Tuning (DRAFT) for safety-critical software assessments | May 2, 2025 | Dataset GenerationLanguage Modeling | —Unverified | 0 |
| PipeSpec: Breaking Stage Dependencies in Hierarchical LLM Decoding | May 2, 2025 | Code GenerationLanguage Modeling | —Unverified | 0 |
| TutorGym: A Testbed for Evaluating AI Agents as Tutors and Students | May 2, 2025 | GSM8KIn-Context Learning | CodeCode Available | 0 |
| LLM Watermarking Using Mixtures and Statistical-to-Computational Gaps | May 2, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Preserving Privacy and Utility in LLM-Based Product Recommendations | May 2, 2025 | Collaborative FilteringLarge Language Model | —Unverified | 0 |
| Patchwork: A Unified Framework for RAG Serving | May 1, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Red Teaming Large Language Models for Healthcare | May 1, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| KoACD: The First Korean Adolescent Dataset for Cognitive Distortion Analysis | May 1, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Self-Generated In-Context Examples Improve LLM Agents for Sequential Decision-Making Tasks | May 1, 2025 | Decision MakingLarge Language Model | —Unverified | 0 |
| Data Therapist: Eliciting Domain Knowledge from Subject Matter Experts Using Large Language Models | May 1, 2025 | Data VisualizationLanguage Modeling | —Unverified | 0 |
| UserCentrix: An Agentic Memory-augmented AI Framework for Smart Spaces | May 1, 2025 | Decision MakingLarge Language Model | —Unverified | 0 |
| Sentient Agent as a Judge: Evaluating Higher-Order Social Cognition in Large Language Models | May 1, 2025 | Large Language Model | CodeCode Available | 3 |
| Urban Air Mobility as a System of Systems: An LLM-Enhanced Holonic Approach | May 1, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| LLM-Based Threat Detection and Prevention Framework for IoT Ecosystems | May 1, 2025 | Anomaly DetectionLanguage Modeling | —Unverified | 0 |
| Leveraging Partial SMILES Validation Scheme for Enhanced Drug Design in Reinforcement Learning Frameworks | May 1, 2025 | Deep Reinforcement LearningDrug Design | —Unverified | 0 |
| A Survey on Large Language Model based Human-Agent Systems | May 1, 2025 | Human Agent CollaborationLanguage Modeling | CodeCode Available | 0 |
| UniBiomed: A Universal Foundation Model for Grounded Biomedical Image Interpretation | Apr 30, 2025 | DiagnosticLarge Language Model | CodeCode Available | 1 |
| Leveraging Pre-trained Large Language Models with Refined Prompting for Online Task and Motion Planning | Apr 30, 2025 | Large Language ModelMotion Planning | —Unverified | 0 |
| Confidence in Large Language Model Evaluation: A Bayesian Approach to Limited-Sample Challenges | Apr 30, 2025 | Bayesian InferenceLanguage Model Evaluation | —Unverified | 0 |
| Does the Prompt-based Large Language Model Recognize Students' Demographics and Introduce Bias in Essay Scoring? | Apr 30, 2025 | Automated Essay ScoringFairness | —Unverified | 0 |
| DeepSeek-Prover-V2: Advancing Formal Mathematical Reasoning via Reinforcement Learning for Subgoal Decomposition | Apr 30, 2025 | Automated Theorem ProvingLarge Language Model | CodeCode Available | 5 |
| MF-LLM: Simulating Population Decision Dynamics via a Mean-Field Large Language Model Framework | Apr 30, 2025 | Decision MakingLanguage Modeling | CodeCode Available | 1 |
| Consistency-aware Fake Videos Detection on Short Video Platforms | Apr 30, 2025 | Large Language ModelMultimodal Large Language Model | CodeCode Available | 0 |
| Iterative Tool Usage Exploration for Multimodal Agents via Step-wise Preference Tuning | Apr 30, 2025 | Large Language Model | —Unverified | 0 |
| LLM Enhancer: Merged Approach using Vector Embedding for Reducing Large Language Model Hallucinations with External Knowledge | Apr 29, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Detecting Manipulated Contents Using Knowledge-Grounded Inference | Apr 29, 2025 | Claim VerificationFact Checking | CodeCode Available | 0 |
| CoCo-Bench: A Comprehensive Code Benchmark For Multi-task Large Language Model Evaluation | Apr 29, 2025 | Code GenerationLanguage Model Evaluation | —Unverified | 0 |
| Computational Reasoning of Large Language Models | Apr 29, 2025 | Code GenerationLanguage Modeling | CodeCode Available | 0 |
| BrAIcht, a theatrical agent that speaks like Bertolt Brecht's characters | Apr 29, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| WenyanGPT: A Large Language Model for Classical Chinese Tasks | Apr 29, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| A Framework to Assess the Persuasion Risks Large Language Model Chatbots Pose to Democratic Societies | Apr 29, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| LLM-Enabled EV Charging Stations Recommendation | Apr 29, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Cognitive maps are generative programs | Apr 29, 2025 | Computational EfficiencyLarge Language Model | —Unverified | 0 |
| GVPO: Group Variance Policy Optimization for Large Language Model Post-Training | Apr 28, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| AutoJudge: Judge Decoding Without Manual Annotation | Apr 28, 2025 | GSM8KLarge Language Model | —Unverified | 0 |
| Fitness Landscape of Large Language Model-Assisted Automated Algorithm Search | Apr 28, 2025 | Combinatorial OptimizationLanguage Modeling | —Unverified | 0 |
| PhenoAssistant: A Conversational Multi-Agent AI System for Automated Plant Phenotyping | Apr 28, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| semi-PD: Towards Efficient LLM Serving via Phase-Wise Disaggregated Computation and Unified Storage | Apr 28, 2025 | GPULarge Language Model | —Unverified | 0 |
| Intelligent4DSE: Optimizing High-Level Synthesis Design Space Exploration with Graph Neural Networks and Large Language Models | Apr 28, 2025 | Evolutionary AlgorithmsGraph Neural Network | —Unverified | 0 |
| An Automated Reinforcement Learning Reward Design Framework with Large Language Model for Cooperative Platoon Coordination | Apr 28, 2025 | Code GenerationHallucination | —Unverified | 0 |
| Leveraging LLM to Strengthen ML-Based Cross-Site Scripting Detection | Apr 28, 2025 | Large Language Model | —Unverified | 0 |
| CodeBC: A More Secure Large Language Model for Smart Contract Code Generation in Blockchain | Apr 28, 2025 | Code GenerationLanguage Modeling | CodeCode Available | 0 |
| SPC: Evolving Self-Play Critic via Adversarial Games for LLM Reasoning | Apr 27, 2025 | Large Language ModelMathematical Reasoning | —Unverified | 0 |
| GenTorrent: Scaling Large Language Model Serving with An Overley Network | Apr 27, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |