Reinforcement Learning Upside Down: Don't Predict Rewards -- Just Map Them to Actions Dec 5, 2019 reinforcement-learning Reinforcement Learning
Code Code Available 0Predicting Real-time Scientific Experiments Using Transformer models and Reinforcement Learning Apr 25, 2022 reinforcement-learning Reinforcement Learning (RL)
Code Code Available 0Predicting optimal value functions by interpolating reward functions in scalarized multi-objective reinforcement learning Sep 11, 2019 Autonomous Vehicles Multi-Objective Reinforcement Learning
Code Code Available 0On Instrumental Variable Regression for Deep Offline Policy Evaluation May 21, 2021 regression Reinforcement Learning (RL)
Code Code Available 0Revisiting the Softmax Bellman Operator: New Benefits and New Perspective Dec 2, 2018 Atari Games Q-Learning
Code Code Available 0Reinforcement Learning under Threats Sep 5, 2018 reinforcement-learning Reinforcement Learning
Code Code Available 0MyCaffe: A Complete C# Re-Write of Caffe with Reinforcement Learning Oct 4, 2018 Deep Learning reinforcement-learning
Code Code Available 0Towards Similarity Graphs Constructed by Deep Reinforcement Learning Nov 27, 2019 Deep Reinforcement Learning graph construction
Code Code Available 0Predicting Head Movement in Panoramic Video: A Deep Reinforcement Learning Approach Oct 30, 2017 Deep Reinforcement Learning Position
Code Code Available 0On Improving Deep Reinforcement Learning for POMDPs Apr 26, 2017 Atari Games Decision Making
Code Code Available 0ViZDoom Competitions: Playing Doom from Pixels Sep 10, 2018 Navigate reinforcement-learning
Code Code Available 0Modular Networks Prevent Catastrophic Interference in Model-Based Multi-Task Reinforcement Learning Nov 15, 2021 reinforcement-learning Reinforcement Learning
Code Code Available 0Reward Certification for Policy Smoothed Reinforcement Learning Dec 11, 2023 reinforcement-learning Reinforcement Learning
Code Code Available 0Reward-Conditioned Policies Dec 31, 2019 Imitation Learning reinforcement-learning
Code Code Available 0Text-Driven Video Acceleration: A Weakly-Supervised Reinforcement Learning Method Mar 29, 2022 reinforcement-learning Reinforcement Learning
Code Code Available 0Reinforcement Learning to Rank in E-Commerce Search Engine: Formalization, Analysis, and Application Mar 2, 2018 Decision Making Learning-To-Rank
Code Code Available 0Reward Delay Attacks on Deep Reinforcement Learning Sep 8, 2022 Deep Reinforcement Learning Q-Learning
Code Code Available 0Reward Design For An Online Reinforcement Learning Algorithm Supporting Oral Self-Care Aug 15, 2022 reinforcement-learning Reinforcement Learning (RL)
Code Code Available 0Single Episode Policy Transfer in Reinforcement Learning Oct 17, 2019 reinforcement-learning Reinforcement Learning
Code Code Available 0Reward Design for Reinforcement Learning Agents Mar 27, 2025 Meta-Learning reinforcement-learning
Code Code Available 0On Credit Assignment in Hierarchical Reinforcement Learning Mar 7, 2022 Hierarchical Reinforcement Learning reinforcement-learning
Code Code Available 0Reinforcement learning to learn quantum states for Heisenberg scaling accuracy Dec 3, 2024 Meta-Learning Quantum Machine Learning
Code Code Available 0Single-partition adaptive Q-learning Jul 14, 2020 Q-Learning Reinforcement Learning (RL)
Code Code Available 0Reward Engineering for Generating Semi-structured Explanation Sep 15, 2023 Explanation Generation Reinforcement Learning (RL)
Code Code Available 0Reward Engineering for Object Pick and Place Training Jan 11, 2020 Object reinforcement-learning
Code Code Available 0Reward Estimation for Variance Reduction in Deep Reinforcement Learning May 9, 2018 Deep Reinforcement Learning reinforcement-learning
Code Code Available 0On Context Distribution Shift in Task Representation Learning for Offline Meta RL Apr 1, 2023 continuous-control Continuous Control
Code Code Available 0Unified Off-Policy Learning to Rank: a Reinforcement Learning Perspective Jun 13, 2023 Learning-To-Rank Offline RL
Code Code Available 0Meta-Reinforcement Learning in Broad and Non-Parametric Environments Aug 8, 2021 Meta Reinforcement Learning reinforcement-learning
Code Code Available 0Towards Solving Text-based Games by Producing Adaptive Action Spaces Dec 3, 2018 Deep Reinforcement Learning reinforcement-learning
Code Code Available 0Mutual Information Based Knowledge Transfer Under State-Action Dimension Mismatch Jun 12, 2020 Decision Making Deep Reinforcement Learning
Code Code Available 0TGRPO :Fine-tuning Vision-Language-Action Model via Trajectory-wise Group Relative Policy Optimization Jun 10, 2025 reinforcement-learning Reinforcement Learning
Code Code Available 0Towards Symbolic Reinforcement Learning with Common Sense Apr 23, 2018 Common Sense Reasoning Deep Reinforcement Learning
Code Code Available 0SiT: Symmetry-Invariant Transformers for Generalisation in Reinforcement Learning Jun 21, 2024 reinforcement-learning Reinforcement Learning
Code Code Available 0What is the Solution for State-Adversarial Multi-Agent Reinforcement Learning? Dec 6, 2022 Deep Reinforcement Learning Multi-agent Reinforcement Learning
Code Code Available 0Predictable Reinforcement Learning Dynamics through Entropy Rate Minimization Nov 30, 2023 Policy Gradient Methods reinforcement-learning
Code Code Available 0Unified State Representation Learning under Data Augmentation Sep 12, 2022 Data Augmentation Domain Adaptation
Code Code Available 0Rewarding Coreference Resolvers for Being Consistent with World Knowledge Sep 5, 2019 reinforcement-learning Reinforcement Learning
Code Code Available 0On Catastrophic Interference in Atari 2600 Games Feb 28, 2020 Atari Games Deep Reinforcement Learning
Code Code Available 0PPO Dash: Improving Generalization in Deep Reinforcement Learning Jul 15, 2019 Deep Reinforcement Learning reinforcement-learning
Code Code Available 0OIL-AD: An Anomaly Detection Framework for Sequential Decision Sequences Feb 7, 2024 Anomaly Detection Behavioural cloning
Code Code Available 0The Arcade Learning Environment: An Evaluation Platform for General Agents Jul 19, 2012 Atari Games Benchmarking
Code Code Available 0PPO-CMA: Proximal Policy Optimization with Covariance Matrix Adaptation Oct 5, 2018 continuous-control Continuous Control
Code Code Available 0Mutation Testing of Deep Reinforcement Learning Based on Real Faults Jan 13, 2023 Deep Reinforcement Learning reinforcement-learning
Code Code Available 0Skill Decision Transformer Jan 31, 2023 D4RL Descriptive
Code Code Available 0Towards the Use of Deep Reinforcement Learning with Global Policy For Query-based Extractive Summarisation Nov 10, 2017 Deep Reinforcement Learning reinforcement-learning
Code Code Available 0MUSE: Modularizing Unsupervised Sense Embeddings Apr 15, 2017 Reinforcement Learning Reinforcement Learning (RL)
Code Code Available 0Reward Learning for Efficient Reinforcement Learning in Extractive Document Summarisation Jul 30, 2019 Decision Making Learning-To-Rank
Code Code Available 0Reward learning from human preferences and demonstrations in Atari Nov 15, 2018 Atari Games Deep Reinforcement Learning
Code Code Available 0The Atari Data Scraper Apr 11, 2021 Deep Reinforcement Learning reinforcement-learning
Code Code Available 0