A Theoretical Understanding of Gradient Bias in Meta-Reinforcement Learning Dec 31, 2021 Atari Games Meta Reinforcement Learning
Code Code Available 0On the Design of Safe Continual RL Methods for Control of Nonlinear Systems Feb 21, 2025 Continual Learning MuJoCo
Code Code Available 0On the Challenges of using Reinforcement Learning in Precision Drug Dosing: Delay and Prolongedness of Action Effects Jan 2, 2023 Reinforcement Learning (RL)
Code Code Available 0Replacing Rewards with Examples: Example-Based Policy Search via Recursive Classification Mar 23, 2021 General Classification Reinforcement Learning (RL)
Code Code Available 0TD or not TD: Analyzing the Role of Temporal Differencing in Deep Reinforcement Learning Jun 4, 2018 Deep Reinforcement Learning Reinforcement Learning
Code Code Available 0TD-Regularized Actor-Critic Methods Dec 19, 2018 reinforcement-learning Reinforcement Learning
Code Code Available 0ReInform: Selecting paths with reinforcement learning for contextualized link prediction Nov 19, 2022 Link Prediction Prediction
Code Code Available 0Neural-encoding Human Experts' Domain Knowledge to Warm Start Reinforcement Learning Feb 15, 2019 Deep Reinforcement Learning Imitation Learning
Code Code Available 0On the calibration of compartmental epidemiological models Dec 9, 2023 Decision Making Reinforcement Learning (RL)
Code Code Available 0Replication of Impedance Identification Experiments on a Reinforcement-Learning-Controlled Digital Twin of Human Elbows Feb 5, 2024 Reinforcement Learning (RL)
Code Code Available 0Teach Biped Robots to Walk via Gait Principles and Reinforcement Learning with Adversarial Critics Oct 22, 2019 Reinforcement Learning Reinforcement Learning (RL)
Code Code Available 0Reinforcement Replaces Supervision: Query focused Summarization using Deep Reinforcement Learning Nov 29, 2023 Deep Reinforcement Learning Long Form Question Answering
Code Code Available 0Project proposal: A modular reinforcement learning based automated theorem prover Sep 6, 2022 OpenAI Gym reinforcement-learning
Code Code Available 0SFV: Reinforcement Learning of Physical Skills from Videos Oct 8, 2018 Deep Reinforcement Learning Pose Estimation
Code Code Available 0Understanding the Evolution of Linear Regions in Deep Reinforcement Learning Oct 24, 2022 continuous-control Continuous Control
Code Code Available 0Shapechanger: Environments for Transfer Learning Sep 15, 2017 reinforcement-learning Reinforcement Learning
Code Code Available 0On Solving the 2-Dimensional Greedy Shooter Problem for UAVs Nov 2, 2019 Q-Learning reinforcement-learning
Code Code Available 0Which Experiences Are Influential for RL Agents? Efficiently Estimating The Influence of Experiences May 23, 2024 Reinforcement Learning (RL)
Code Code Available 0Shaping Advice in Deep Multi-Agent Reinforcement Learning Mar 29, 2021 Multi-agent Reinforcement Learning reinforcement-learning
Code Code Available 0Shaping Advice in Deep Reinforcement Learning Feb 19, 2022 Deep Reinforcement Learning Multi-agent Reinforcement Learning
Code Code Available 0On Practical Reinforcement Learning: Provable Robustness, Scalability, and Statistical Efficiency Mar 3, 2022 Offline RL reinforcement-learning
Code Code Available 0Representation Learning for Grounded Spatial Reasoning Jul 13, 2017 reinforcement-learning Reinforcement Learning
Code Code Available 0Teaching a Machine to Read Maps with Deep Reinforcement Learning Nov 20, 2017 Deep Reinforcement Learning Navigate
Code Code Available 0Towards Optimally Decentralized Multi-Robot Collision Avoidance via Deep Reinforcement Learning Sep 28, 2017 Collision Avoidance Deep Reinforcement Learning
Code Code Available 0Teaching Embodied Reinforcement Learning Agents: Informativeness and Diversity of Language Use Oct 31, 2024 Diversity Informativeness
Code Code Available 0Shapley Machine: A Game-Theoretic Framework for N-Agent Ad Hoc Teamwork Jun 12, 2025 Reinforcement Learning (RL)
Code Code Available 0Shared Autonomy via Deep Reinforcement Learning Feb 6, 2018 Deep Reinforcement Learning reinforcement-learning
Code Code Available 0Progressive Neural Architecture Search Dec 2, 2017 Evolutionary Algorithms General Classification
Code Code Available 0Understanding the impact of entropy on policy optimization Nov 27, 2018 reinforcement-learning Reinforcement Learning
Code Code Available 0Reinforcement Learning with Unsupervised Auxiliary Tasks Nov 16, 2016 Deep Reinforcement Learning reinforcement-learning
Code Code Available 0Towards optimized actions in critical situations of soccer games with deep reinforcement learning Sep 14, 2021 Deep Reinforcement Learning reinforcement-learning
Code Code Available 0Reproducibility of Benchmarked Deep Reinforcement Learning Tasks for Continuous Control Aug 10, 2017 continuous-control Continuous Control
Code Code Available 0Which Experiences Are Influential for Your Agent? Policy Iteration with Turn-over Dropout Jan 26, 2023 MuJoCo reinforcement-learning
Code Code Available 0TEAC: Intergrating Trust Region and Max Entropy Actor Critic for Continuous Control Jan 1, 2021 continuous-control Continuous Control
Code Code Available 0Combining Reinforcement Learning and Tensor Networks, with an Application to Dynamical Large Deviations Sep 28, 2022 reinforcement-learning Reinforcement Learning (RL)
Code Code Available 0Probing the Robustness of Trained Metrics for Conversational Dialogue Systems Feb 28, 2022 reinforcement-learning Reinforcement Learning (RL)
Code Code Available 0On-Policy Trust Region Policy Optimisation with Replay Buffers Jan 18, 2019 Continuous Control Deep Reinforcement Learning
Code Code Available 0TeaMs-RL: Teaching LLMs to Generate Better Instruction Datasets via Reinforcement Learning Mar 13, 2024 reinforcement-learning Reinforcement Learning
Code Code Available 0Reinforcement Learning with Success Induced Task Prioritization Dec 30, 2022 reinforcement-learning Reinforcement Learning
Code Code Available 0Probabilistic Mixture-of-Experts for Efficient Deep Reinforcement Learning Apr 19, 2021 Deep Reinforcement Learning Mixture-of-Experts
Code Code Available 0Reset-free Trial-and-Error Learning for Robot Damage Recovery Oct 13, 2016 Reinforcement Learning Reinforcement Learning (RL)
Code Code Available 0Value Prediction Network Jul 11, 2017 Atari Games Deep Reinforcement Learning
Code Code Available 0Shortest Edit Path Crossover: A Theory-driven Solution to the Permutation Problem in Evolutionary Neural Architecture Search Oct 25, 2022 Evolutionary Algorithms Neural Architecture Search
Code Code Available 0Residual Learning and Context Encoding for Adaptive Offline-to-Online Reinforcement Learning Jun 12, 2024 D4RL MuJoCo
Code Code Available 0Probabilistic Counterexample Guidance for Safer Reinforcement Learning (Extended Version) Jul 10, 2023 reinforcement-learning Reinforcement Learning
Code Code Available 0Residual Loss Prediction: Reinforcement Learning With No Incremental Feedback Jan 1, 2018 Multi-Armed Bandits Prediction
Code Code Available 0Residual Policy Learning Dec 15, 2018 Deep Reinforcement Learning MuJoCo
Code Code Available 0What Did You Think Would Happen? Explaining Agent Behaviour Through Intended Outcomes Nov 10, 2020 reinforcement-learning Reinforcement Learning
Code Code Available 0Bridging the Sim-to-Real Gap from the Information Bottleneck Perspective May 29, 2023 Knowledge Distillation Reinforcement Learning (RL)
Code Code Available 0Understanding the Safety Requirements for Learning-based Power Systems Operations Oct 11, 2021 BIG-bench Machine Learning Decision Making
Code Code Available 0