| Runaway is Ashamed, But Helpful: On the Early-Exit Behavior of Large Language Model-based Agents in Embodied Environments | May 23, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| The Real Barrier to LLM Agent Usability is Agentic ROI | May 23, 2025 | Large Language Model | —Unverified | 0 |
| ProgRM: Build Better GUI Agents with Progress Rewards | May 23, 2025 | Imitation LearningLarge Language Model | —Unverified | 0 |
| Guided by Gut: Efficient Test-Time Scaling with Reinforced Intrinsic Confidence | May 23, 2025 | GPULarge Language Model | —Unverified | 0 |
| Large language model as user daily behavior data generator: balancing population diversity and individual personality | May 23, 2025 | Data AugmentationDiversity | —Unverified | 0 |
| How Can I Publish My LLM Benchmark Without Giving the True Answers Away? | May 23, 2025 | Large Language Model | —Unverified | 0 |
| Simulating Macroeconomic Expectations using LLM Agents | May 23, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| NSNQuant: A Double Normalization Approach for Calibration-Free Low-Bit Vector Quantization of KV Cache | May 23, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Retrieval Augmented Generation-based Large Language Models for Bridging Transportation Cybersecurity Legal Knowledge Gaps | May 23, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Taming LLMs with Negative Samples: A Reference-Free Framework to Evaluate Presentation Content with Actionable Feedback | May 23, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |