| Action Advising with Advice Imitation in Deep Reinforcement Learning | Apr 17, 2021 | Atari GamesBehavioural cloning | CodeCode Available | 0 | 5 |
| Policy Improvement using Language Feedback Models | Feb 12, 2024 | Behavioural cloningImitation Learning | CodeCode Available | 0 | 5 |
| Improving Trust Estimation in Human-Robot Collaboration Using Beta Reputation at Fine-grained Timescales | Nov 4, 2024 | Bayesian InferenceBehavioural cloning | CodeCode Available | 0 | 5 |
| Balancing policy constraint and ensemble size in uncertainty-based offline reinforcement learning | Mar 26, 2023 | Behavioural cloningBenchmarking | CodeCode Available | 0 | 5 |
| A Pragmatic Look at Deep Imitation Learning | Aug 4, 2021 | Behavioural cloningD4RL | CodeCode Available | 0 | 5 |
| OIL-AD: An Anomaly Detection Framework for Sequential Decision Sequences | Feb 7, 2024 | Anomaly DetectionBehavioural cloning | CodeCode Available | 0 | 5 |
| Evaluation-Time Policy Switching for Offline Reinforcement Learning | Mar 15, 2025 | Behavioural cloningOffline RL | —Unverified | 0 | 0 |
| Closing the gap: Optimizing Guidance and Control Networks through Neural ODEs | Apr 25, 2024 | Behavioural cloning | —Unverified | 0 | 0 |
| Autonomous Vehicle Controllers From End-to-End Differentiable Simulation | Sep 12, 2024 | Autonomous VehiclesBehavioural cloning | —Unverified | 0 | 0 |
| Improving TD3-BC: Relaxed Policy Constraint for Offline Learning and Stable Online Fine-Tuning | Nov 21, 2022 | Behavioural cloningReinforcement Learning (RL) | —Unverified | 0 | 0 |