| On Sample-Efficient Offline Reinforcement Learning: Data Diversity, Posterior Sampling, and Beyond | Jan 6, 2024 | Decision MakingDiversity | —Unverified | 0 |
| On the Brittle Foundations of ReAct Prompting for Agentic Large Language Models | May 22, 2024 | Decision MakingSequential Decision Making | —Unverified | 0 |
| On the Expressivity of Multidimensional Markov Reward | Jul 22, 2023 | Decision MakingDecision Making Under Uncertainty | —Unverified | 0 |
| On the Global Convergence of Risk-Averse Policy Gradient Methods with Expected Conditional Risk Measures | Jan 26, 2023 | Decision MakingPolicy Gradient Methods | —Unverified | 0 |
| On the Modeling Capabilities of Large Language Models for Sequential Decision Making | Oct 8, 2024 | Decision MakingDiversity | —Unverified | 0 |
| On the Performance of Empirical Risk Minimization with Smoothed Data | Feb 22, 2024 | Decision MakingSequential Decision Making | —Unverified | 0 |
| On the Relationship Between Structure in Natural Language and Models of Sequential Decision Processes | Jun 12, 2020 | Decision MakingSequential Decision Making | —Unverified | 0 |
| On the Role of Information Structure in Reinforcement Learning for Partially-Observable Sequential Teams and Games | Mar 1, 2024 | Decision Makingreinforcement-learning | —Unverified | 0 |
| Open Problem: Approximate Planning of POMDPs in the class of Memoryless Policies | Aug 17, 2016 | Decision MakingReinforcement Learning | —Unverified | 0 |
| OPERA: Automatic Offline Policy Evaluation with Re-weighted Aggregates of Multiple Estimators | May 27, 2024 | Decision MakingOffline RL | —Unverified | 0 |