| Toward Evaluating Robustness of Deep Reinforcement Learning with Continuous Control | May 1, 2020 | continuous-controlContinuous Control | —Unverified | 0 | 0 |
| Towards Characterizing Divergence in Deep Q-Learning | Mar 21, 2019 | continuous-controlContinuous Control | —Unverified | 0 | 0 |
| Towards Simplicity in Deep Reinforcement Learning: Streamlined Off-Policy Learning | Sep 25, 2019 | continuous-controlContinuous Control | —Unverified | 0 | 0 |
| Transferable Reward Learning by Dynamics-Agnostic Discriminator Ensemble | Jun 1, 2022 | Imitation LearningMuJoCo | —Unverified | 0 | 0 |
| Tsallis Reinforcement Learning: A Unified Framework for Maximum Entropy Reinforcement Learning | Jan 31, 2019 | MuJoCoreinforcement-learning | —Unverified | 0 | 0 |
| Turning Sand to Gold: Recycling Data to Bridge On-Policy and Off-Policy Learning via Causal Bound | Jul 15, 2025 | counterfactualDecision Making | —Unverified | 0 | 0 |
| Uncertainty-aware Low-Rank Q-Matrix Estimation for Deep Reinforcement Learning | Nov 19, 2021 | continuous-controlContinuous Control | —Unverified | 0 | 0 |
| Understanding the Asymptotic Performance of Model-Based RL Methods | Sep 27, 2018 | Model-based Reinforcement LearningMuJoCo | —Unverified | 0 | 0 |
| Unified Policy Optimization for Continuous-action Reinforcement Learning in Non-stationary Tasks and Games | Aug 19, 2022 | MuJoCoReinforcement Learning (RL) | —Unverified | 0 | 0 |
| Universal Successor Features for Transfer Reinforcement Learning | Jan 5, 2020 | MuJoCoreinforcement-learning | —Unverified | 0 | 0 |