| Route Optimization via Environment-Aware Deep Network and Reinforcement Learning | Nov 16, 2021 | Decision Makingreinforcement-learning | —Unverified | 0 | 0 |
| SAAC: Safe Reinforcement Learning as an Adversarial Game of Actor-Critics | Apr 20, 2022 | continuous-controlContinuous Control | —Unverified | 0 | 0 |
| SAC-GLAM: Improving Online RL for LLM agents with Soft Actor-Critic and Hindsight Relabeling | Oct 16, 2024 | Decision MakingReinforcement Learning (RL) | —Unverified | 0 | 0 |
| Safe Policy Improvement by Minimizing Robust Baseline Regret | Jul 13, 2016 | Decision MakingDecision Making Under Uncertainty | —Unverified | 0 | 0 |
| Safe POMDP Online Planning via Shielding | Sep 19, 2023 | Autonomous DrivingDecision Making | —Unverified | 0 | 0 |
| Safe Posterior Sampling for Constrained MDPs with Bounded Constraint Violation | Jan 27, 2023 | Decision MakingSequential Decision Making | —Unverified | 0 | 0 |
| Safe Sequential Optimization for Switching Environments | Nov 3, 2023 | Bayesian OptimizationChange Point Detection | —Unverified | 0 | 0 |
| Safe Time-Varying Optimization based on Gaussian Processes with Spatio-Temporal Kernel | Sep 26, 2024 | Bayesian OptimizationChange Detection | —Unverified | 0 | 0 |
| Safety-Aware Algorithms for Adversarial Contextual Bandit | Aug 1, 2017 | Decision MakingMulti-Armed Bandits | —Unverified | 0 | 0 |
| Safety-aware Causal Representation for Trustworthy Offline Reinforcement Learning in Autonomous Driving | Oct 31, 2023 | Autonomous DrivingAutonomous Vehicles | —Unverified | 0 | 0 |