| Rethinking Transformers in Solving POMDPs | May 27, 2024 | Decision MakingReinforcement Learning (RL) | CodeCode Available | 1 |
| Variational Offline Multi-agent Skill Discovery | May 26, 2024 | Decision MakingMulti-agent Reinforcement Learning | —Unverified | 0 |
| Inference of Utilities and Time Preference in Sequential Decision-Making | May 24, 2024 | Decision MakingSequential Decision Making | —Unverified | 0 |
| Inverse-RLignment: Large Language Model Alignment from Demonstrations through Inverse Reinforcement Learning | May 24, 2024 | Decision MakingLanguage Modeling | —Unverified | 0 |
| Reinforcing Language Agents via Policy Optimization with Action Decomposition | May 23, 2024 | Sequential Decision Making | —Unverified | 0 |
| Efficiently Training Deep-Learning Parametric Policies using Lagrangian Duality | May 23, 2024 | Decision MakingDecision Making Under Uncertainty | —Unverified | 0 |
| A finite time analysis of distributed Q-learning | May 23, 2024 | Decision MakingMulti-agent Reinforcement Learning | —Unverified | 0 |
| Understanding the Training and Generalization of Pretrained Transformer for Sequential Decision Making | May 23, 2024 | Decision MakingSequential Decision Making | —Unverified | 0 |
| FLIPHAT: Joint Differential Privacy for High Dimensional Sparse Linear Bandits | May 22, 2024 | Decision MakingSequential Decision Making | CodeCode Available | 0 |
| On the Brittle Foundations of ReAct Prompting for Agentic Large Language Models | May 22, 2024 | Decision MakingSequential Decision Making | —Unverified | 0 |