| Provable Benefits of Multi-task RL under Non-Markovian Decision Making Processes | Oct 20, 2023 | Decision MakingMulti-Task Learning | —Unverified | 0 |
| Auction-Based Scheduling | Oct 18, 2023 | Decision MakingFairness | —Unverified | 0 |
| Keep Various Trajectories: Promoting Exploration of Ensemble Policies in Continuous Control | Oct 17, 2023 | continuous-controlContinuous Control | —Unverified | 0 |
| Partially Observable Stochastic Games with Neural Perception Mechanisms | Oct 17, 2023 | Decision MakingDecision Making Under Uncertainty | —Unverified | 0 |
| Agent-Specific Effects: A Causal Effect Propagation Analysis in Multi-Agent MDPs | Oct 17, 2023 | counterfactualDecision Making | CodeCode Available | 0 |
| Autonomous Tree-search Ability of Large Language Models | Oct 14, 2023 | Decision MakingSequential Decision Making | —Unverified | 0 |
| Imitation Learning from Purified Demonstrations | Oct 11, 2023 | Decision MakingImitation Learning | CodeCode Available | 0 |
| Evaluating Explanation Methods for Vision-and-Language Navigation | Oct 10, 2023 | Decision MakingNavigate | —Unverified | 0 |
| Global Convergence of Policy Gradient Methods in Reinforcement Learning, Games and Control | Oct 8, 2023 | Decision MakingPolicy Gradient Methods | —Unverified | 0 |
| Optimal Sequential Decision-Making in Geosteering: A Reinforcement Learning Approach | Oct 7, 2023 | Decision Makingreinforcement-learning | —Unverified | 0 |