| A Unified Bellman Optimality Principle Combining Reward Maximization and Empowerment | Jul 26, 2019 | MuJoCoReinforcement Learning | —Unverified | 0 | 0 |
| A Unifying Framework for Causal Imitation Learning with Hidden Confounders | Feb 11, 2025 | Imitation LearningMuJoCo | —Unverified | 0 | 0 |
| AutoDIME: Automatic Design of Interesting Multi-Agent Environments | Mar 4, 2022 | DiagnosticMuJoCo | —Unverified | 0 | 0 |
| Auto-Encoding Inverse Reinforcement Learning | Sep 29, 2021 | Decision MakingImitation Learning | —Unverified | 0 | 0 |
| Autoregressive Dynamics Models for Offline Policy Evaluation and Optimization | Apr 28, 2021 | continuous-controlContinuous Control | —Unverified | 0 | 0 |
| Average-Reward Reinforcement Learning with Trust Region Methods | Jun 7, 2021 | continuous-controlContinuous Control | —Unverified | 0 | 0 |
| AVG-DICE: Stationary Distribution Correction by Regression | Mar 3, 2025 | AvgMuJoCo | —Unverified | 0 | 0 |
| Backward Imitation and Forward Reinforcement Learning via Bi-directional Model Rollouts | Aug 4, 2022 | Generative Adversarial NetworkModel-based Reinforcement Learning | —Unverified | 0 | 0 |
| Balancing Constraints and Rewards with Meta-Gradient D4PG | Oct 13, 2020 | MuJoCoReinforcement Learning (RL) | —Unverified | 0 | 0 |
| Bayesian Distributional Policy Gradients | Mar 20, 2021 | Atari GamesContrastive Learning | —Unverified | 0 | 0 |