| Large Language Model as a Policy Teacher for Training Reinforcement Learning Agents | Nov 22, 2023 | Decision MakingLanguage Modeling | CodeCode Available | 1 |
| Code Models are Zero-shot Precondition Reasoners | Nov 16, 2023 | Decision MakingSequential Decision Making | —Unverified | 0 |
| An Expandable Machine Learning-Optimization Framework to Sequential Decision-Making | Nov 12, 2023 | Combinatorial OptimizationDecision Making | —Unverified | 0 |
| An advantage based policy transfer algorithm for reinforcement learning with measures of transferability | Nov 12, 2023 | continuous-controlContinuous Control | —Unverified | 0 |
| Likelihood Ratio Confidence Sets for Sequential Decision Making | Nov 8, 2023 | Decision MakingSequential Decision Making | —Unverified | 0 |
| Finding Increasingly Large Extremal Graphs with AlphaZero and Tabu Search | Nov 6, 2023 | Decision MakingGraph Generation | —Unverified | 0 |
| Using General Value Functions to Learn Domain-Backed Inventory Management Policies | Nov 3, 2023 | Decision MakingManagement | —Unverified | 0 |
| Safe Sequential Optimization for Switching Environments | Nov 3, 2023 | Bayesian OptimizationChange Point Detection | —Unverified | 0 |
| Efficient Symbolic Policy Learning with Differentiable Symbolic Expression | Nov 2, 2023 | Decision MakingDeep Reinforcement Learning | CodeCode Available | 0 |
| Rethinking Decision Transformer via Hierarchical Reinforcement Learning | Nov 1, 2023 | Decision MakingHierarchical Reinforcement Learning | —Unverified | 0 |