Learning State Abstractions for Transfer in Continuous Control Feb 8, 2020 continuous-control Continuous Control
Code Code Available 0Knowledge Transfer in Deep Reinforcement Learning via an RL-Specific GAN-Based Correspondence Function Sep 14, 2022 Decision Making Deep Reinforcement Learning
Code Code Available 0An Empirical Study of Deep Reinforcement Learning in Continuing Tasks Jan 12, 2025 Deep Reinforcement Learning MuJoCo
Code Code Available 0Improving Automatic Source Code Summarization via Deep Reinforcement Learning Nov 17, 2018 Code Summarization Decoder
Code Code Available 0A Simple, Fast Diverse Decoding Algorithm for Neural Generation Nov 25, 2016 Abstractive Text Summarization Diversity
Code Code Available 0Empirical Study of Off-Policy Policy Evaluation for Reinforcement Learning Nov 15, 2019 Benchmarking Diversity
Code Code Available 0Bootstrap State Representation using Style Transfer for Better Generalization in Deep Reinforcement Learning Jul 15, 2022 Data Augmentation Deep Reinforcement Learning
Code Code Available 0Empowering recommender systems using automatically generated Knowledge Graphs and Reinforcement Learning Jul 11, 2023 Decision Making Knowledge Graphs
Code Code Available 0Empowerment-driven Exploration using Mutual Information Estimation Oct 11, 2018 Deep Reinforcement Learning Montezuma's Revenge
Code Code Available 0A2-RL: Aesthetics Aware Reinforcement Learning for Image Cropping Sep 14, 2017 Decision Making Image Cropping
Code Code Available 0Improving Coordination in Small-Scale Multi-Agent Deep Reinforcement Learning through Memory-driven Communication Jan 12, 2019 Deep Reinforcement Learning Reinforcement Learning
Code Code Available 0Enabling Adaptive Agent Training in Open-Ended Simulators by Targeting Diversity Nov 7, 2024 Diversity Meta Reinforcement Learning
Code Code Available 0A General, Evolution-Inspired Reward Function for Social Robotics Feb 1, 2022 Cultural Vocal Bursts Intensity Prediction Imitation Learning
Code Code Available 0A Self-Adaptive Proposal Model for Temporal Action Detection based on Reinforcement Learning Jun 22, 2017 Action Detection Position
Code Code Available 0Curiosity Killed or Incapacitated the Cat and the Asymptotically Optimal Agent Jun 5, 2020 reinforcement-learning Reinforcement Learning
Code Code Available 0On the Expressivity of Neural Networks for Deep Reinforcement Learning Oct 14, 2019 Deep Reinforcement Learning MuJoCo
Code Code Available 0Improving Deep Reinforcement Learning by Reducing the Chain Effect of Value and Policy Churn Sep 7, 2024 Deep Reinforcement Learning Reinforcement Learning (RL)
Code Code Available 0Curiosity-Driven Multi-Criteria Hindsight Experience Replay Jun 9, 2019 reinforcement-learning Reinforcement Learning
Code Code Available 0IRLAS: Inverse Reinforcement Learning for Architecture Search Dec 13, 2018 Neural Architecture Search reinforcement-learning
Code Code Available 0Improving Dialogue Management: Quality Datasets vs Models Oct 2, 2023 Dialog Learning Dialogue Management
Code Code Available 0GEP-PG: Decoupling Exploration and Exploitation in Deep Reinforcement Learning Algorithms Feb 14, 2018 Deep Reinforcement Learning Diversity
Code Code Available 0CUP: A Conservative Update Policy Algorithm for Safe Reinforcement Learning Feb 15, 2022 reinforcement-learning Reinforcement Learning
Code Code Available 0Iroko: A Framework to Prototype Reinforcement Learning for Data Center Traffic Control Dec 24, 2018 Deep Reinforcement Learning OpenAI Gym
Code Code Available 0Getting More Juice Out of the SFT Data: Reward Learning from Human Demonstration Improves SFT for LLM Alignment May 28, 2024 reinforcement-learning Reinforcement Learning
Code Code Available 0Improving Environment Robustness of Deep Reinforcement Learning Approaches for Autonomous Racing Using Bayesian Optimization-based Curriculum Learning Dec 16, 2023 Autonomous Driving Autonomous Racing
Code Code Available 0End-to-end grasping policies for human-in-the-loop robots via deep reinforcement learning Apr 26, 2021 Deep Reinforcement Learning Electromyography (EMG)
Code Code Available 0GFlowNets and variational inference Oct 2, 2022 Diversity Reinforcement Learning (RL)
Code Code Available 0End-to-End Learning of Communications Systems Without a Channel Model Apr 6, 2018 reinforcement-learning Reinforcement Learning
Code Code Available 0Learning of feature points without additional supervision improves reinforcement learning from images Jun 15, 2021 Continuous Control reinforcement-learning
Code Code Available 0GFlowNet Training by Policy Gradients Aug 12, 2024 Reinforcement Learning (RL)
Code Code Available 0Improving Experience Replay through Modeling of Similar Transitions' Sets Nov 12, 2021 Atari Games reinforcement-learning
Code Code Available 0End-to-End Meta-Bayesian Optimisation with Transformer Neural Processes May 25, 2023 Bayesian Optimisation Inductive Bias
Code Code Available 0End-to-End Model-Free Reinforcement Learning for Urban Driving using Implicit Affordances Nov 25, 2019 Autonomous Driving reinforcement-learning
Code Code Available 0Bootstrapped Q-learning with Context Relevant Observation Pruning to Generalize in Text-based Games Sep 24, 2020 Q-Learning Reinforcement Learning (RL)
Code Code Available 0GHQ: Grouped Hybrid Q Learning for Heterogeneous Cooperative Multi-agent Reinforcement Learning Mar 2, 2023 Multi-agent Reinforcement Learning Q-Learning
Code Code Available 0Gifting in multi-agent reinforcement learning May 5, 2020 Multi-agent Reinforcement Learning reinforcement-learning
Code Code Available 0Bootstrap Advantage Estimation for Policy Optimization in Reinforcement Learning Oct 13, 2022 Data Augmentation reinforcement-learning
Code Code Available 0Improving Exploration in Evolution Strategies for Deep Reinforcement Learning via a Population of Novelty-Seeking Agents Dec 18, 2017 Deep Reinforcement Learning Policy Gradient Methods
Code Code Available 0End-to-End Reinforcement Learning for Automatic Taxonomy Induction May 10, 2018 reinforcement-learning Reinforcement Learning
Code Code Available 0End-to-End Reinforcement Learning for Torque Based Variable Height Hopping Jul 31, 2023 reinforcement-learning Reinforcement Learning (RL)
Code Code Available 0Improving Exploration in Soft-Actor-Critic with Normalizing Flows Policies Jun 6, 2019 Deep Reinforcement Learning Reinforcement Learning
Code Code Available 0End-to-End Robotic Reinforcement Learning without Reward Engineering Apr 16, 2019 reinforcement-learning Reinforcement Learning
Code Code Available 0End-to-End Safe Reinforcement Learning through Barrier Functions for Safety-Critical Continuous Control Tasks Mar 21, 2019 continuous-control Continuous Control
Code Code Available 0CTD4 -- A Deep Continuous Distributional Actor-Critic Agent with a Kalman Fusion of Multiple Critics May 4, 2024 continuous-control Continuous Control
Code Code Available 0End-to-End Video Captioning with Multitask Reinforcement Learning Mar 21, 2018 GPU reinforcement-learning
Code Code Available 0A general class of surrogate functions for stable and efficient reinforcement learning Aug 12, 2021 MuJoCo Policy Gradient Methods
Code Code Available 0"Give Me an Example Like This": Episodic Active Reinforcement Learning from Demonstrations Jun 5, 2024 Active Learning Reinforcement Learning (RL)
Code Code Available 0Boosting Reinforcement Learning with Strongly Delayed Feedback Through Auxiliary Short Delays Feb 5, 2024 reinforcement-learning Reinforcement Learning
Code Code Available 0Learning-Driven Exploration for Reinforcement Learning Jun 17, 2019 Efficient Exploration FPS Games
Code Code Available 0Energy-Based Hindsight Experience Prioritization Oct 2, 2018 reinforcement-learning Reinforcement Learning
Code Code Available 0