| Reinforcement Learning for Aligning Large Language Models Agents with Interactive Environments: Quantifying and Mitigating Prompt Overfitting | Oct 25, 2024 | Decision MakingReinforcement Learning (RL) | —Unverified | 0 |
| Robust Thompson Sampling Algorithms Against Reward Poisoning Attacks | Oct 25, 2024 | Decision MakingSequential Decision Making | —Unverified | 0 |
| Learning Versatile Skills with Curriculum Masking | Oct 23, 2024 | Decision MakingOffline RL | CodeCode Available | 0 |
| Convex Markov Games: A New Frontier for Multi-Agent Reinforcement Learning | Oct 22, 2024 | Decision MakingDiversity | —Unverified | 0 |
| Hierarchical Upper Confidence Bounds for Constrained Online Learning | Oct 22, 2024 | Decision MakingDecision Making Under Uncertainty | —Unverified | 0 |
| SAC-GLAM: Improving Online RL for LLM agents with Soft Actor-Critic and Hindsight Relabeling | Oct 16, 2024 | Decision MakingReinforcement Learning (RL) | —Unverified | 0 |
| Counterfactual Effect Decomposition in Multi-Agent Sequential Decision Making | Oct 16, 2024 | Attributecounterfactual | CodeCode Available | 0 |
| Communication-Control Codesign for Large-Scale Wireless Networked Control Systems | Oct 15, 2024 | Deep Reinforcement LearningScheduling | —Unverified | 0 |
| Burning RED: Unlocking Subtask-Driven Reinforcement Learning and Risk-Awareness in Average-Reward Markov Decision Processes | Oct 14, 2024 | Decision MakingDecision Making Under Uncertainty | —Unverified | 0 |
| Efficient Reinforcement Learning with Large Language Model Priors | Oct 10, 2024 | Bayesian InferenceDecision Making | —Unverified | 0 |