Where Do You Think You're Going?: Inferring Beliefs about Dynamics from Behavior May 21, 2018 reinforcement-learning Reinforcement Learning
Code Code Available 0Self-Correcting Models for Model-Based Reinforcement Learning Dec 19, 2016 model Model-based Reinforcement Learning
Code Code Available 0MICRO: Model-Based Offline Reinforcement Learning with a Conservative Bellman Operator Dec 7, 2023 Offline RL reinforcement-learning
Code Code Available 0Near-optimal Deep Reinforcement Learning Policies from Data for Zone Temperature Control Mar 10, 2022 Deep Reinforcement Learning reinforcement-learning
Code Code Available 0PsiPhi-Learning: Reinforcement Learning with Demonstrations using Successor Features and Inverse Temporal Difference Learning Feb 24, 2021 Autonomous Driving reinforcement-learning
Code Code Available 0Opponent Modeling in Deep Reinforcement Learning Sep 18, 2016 Deep Reinforcement Learning Mixture-of-Experts
Code Code Available 0Pseudo-Rehearsal: Achieving Deep Reinforcement Learning without Catastrophic Forgetting Dec 6, 2018 Atari Games Continual Learning
Code Code Available 0Opponent Aware Reinforcement Learning Aug 22, 2019 reinforcement-learning Reinforcement Learning
Code Code Available 0Towards Finding Longer Proofs May 30, 2019 Automated Theorem Proving reinforcement-learning
Code Code Available 0MICo: Improved representations via sampling-based state similarity for Markov decision processes Jun 3, 2021 Atari Games Deep Reinforcement Learning
Code Code Available 0Optimality Inductive Biases and Agnostic Guidelines for Offline Reinforcement Learning Jul 3, 2021 Attribute Inductive Bias
Code Code Available 0Self-Guided Evolution Strategies with Historical Estimated Gradients Apr 20, 2020 Reinforcement Learning (RL)
Code Code Available 0OPIRL: Sample Efficient Off-Policy Inverse Reinforcement Learning via Distribution Matching Sep 9, 2021 reinforcement-learning Reinforcement Learning
Code Code Available 0Zeroth-Order Actor-Critic: An Evolutionary Framework for Sequential Decision Problems Jan 29, 2022 continuous-control Continuous Control
Code Code Available 0Systematic Rectification of Language Models via Dead-end Analysis Feb 27, 2023 Reinforcement Learning (RL)
Code Code Available 0Self-Imitation Learning for Robot Tasks with Sparse and Delayed Rewards Oct 14, 2020 Imitation Learning MuJoCo
Code Code Available 0Near Optimal Behavior via Approximate State Abstraction Jan 15, 2017 reinforcement-learning Reinforcement Learning
Code Code Available 0Proximal Reinforcement Learning: Efficient Off-Policy Evaluation in Partially Observed Markov Decision Processes Oct 28, 2021 Causal Inference Management
Code Code Available 0MORAL: Aligning AI with Human Norms through Multi-Objective Reinforced Active Learning Dec 30, 2021 Active Learning Ethics
Code Code Available 0Operator World Models for Reinforcement Learning Jun 28, 2024 Decision Making reinforcement-learning
Code Code Available 0Self-Learning Exploration and Mapping for Mobile Robots via Deep Reinforcement Learning Jan 6, 2019 Computational Efficiency Deep Reinforcement Learning
Code Code Available 0Memory-based Deep Reinforcement Learning for Obstacle Avoidance in UAV with Limited Environment Knowledge Nov 8, 2018 Decision Making Deep Reinforcement Learning
Code Code Available 0Tackling Asymmetric and Circular Sequential Social Dilemmas with Reinforcement Learning and Graph-based Tit-for-Tat Jun 26, 2022 Deep Reinforcement Learning Reinforcement Learning (RL)
Code Code Available 0Tackling Error Propagation through Reinforcement Learning: A Case of Greedy Dependency Parsing Feb 22, 2017 Dependency Parsing reinforcement-learning
Code Code Available 0Weight Normalization: A Simple Reparameterization to Accelerate Training of Deep Neural Networks Feb 25, 2016 Deep Reinforcement Learning Image Classification
Code Code Available 0VacSIM: Learning Effective Strategies for COVID-19 Vaccine Distribution using Reinforcement Learning Sep 14, 2020 Deep Reinforcement Learning Multi-Armed Bandits
Code Code Available 0Self-Paced Context Evaluation for Contextual Reinforcement Learning Jun 9, 2021 reinforcement-learning Reinforcement Learning
Code Code Available 0Proximal Distilled Evolutionary Reinforcement Learning Jun 24, 2019 OpenAI Gym reinforcement-learning
Code Code Available 0Proximal Curriculum with Task Correlations for Deep Reinforcement Learning May 3, 2024 Deep Reinforcement Learning reinforcement-learning
Code Code Available 0Learning Progress Driven Multi-Agent Curriculum May 20, 2022 Multi-agent Reinforcement Learning Open-Ended Question Answering
Code Code Available 0Uncertainty-Aware Reward-Free Exploration with General Function Approximation Jun 24, 2024 Reinforcement Learning (RL)
Code Code Available 0Memory Augmented Self-Play May 28, 2018 reinforcement-learning Reinforcement Learning
Code Code Available 0Proximal Curriculum for Reinforcement Learning Agents Apr 25, 2023 reinforcement-learning Reinforcement Learning
Code Code Available 0Model-Free Adaptive Optimal Control of Episodic Fixed-Horizon Manufacturing Processes using Reinforcement Learning Sep 18, 2018 Model Predictive Control Q-Learning
Code Code Available 0Self Punishment and Reward Backfill for Deep Q-Learning Apr 10, 2020 Atari Games Deep Reinforcement Learning
Code Code Available 0Learning to Stabilize Online Reinforcement Learning in Unbounded State Spaces Jun 2, 2023 Attribute reinforcement-learning
Code Code Available 0Self-supervised Deep Reinforcement Learning with Generalized Computation Graphs for Robot Navigation Sep 29, 2017 Deep Reinforcement Learning Navigate
Code Code Available 0Provably Efficient Reinforcement Learning with Linear Function Approximation Jul 11, 2019 reinforcement-learning Reinforcement Learning
Code Code Available 0MOOSS: Mask-Enhanced Temporal Contrastive Learning for Smooth State Evolution in Visual Reinforcement Learning Sep 2, 2024 Contrastive Learning graph construction
Code Code Available 0Navigating Demand Uncertainty in Container Shipping: Deep Reinforcement Learning for Enabling Adaptive and Feasible Master Stowage Planning Feb 18, 2025 Combinatorial Optimization Deep Reinforcement Learning
Code Code Available 0VIPeR: Provably Efficient Algorithm for Offline RL with Neural Function Approximation Feb 24, 2023 Computational Efficiency Offline RL
Code Code Available 0Uncertainty-based Offline Variational Bayesian Reinforcement Learning for Robustness under Diverse Data Corruptions Nov 1, 2024 Bayesian Inference Offline RL
Code Code Available 0On the Unreasonable Efficiency of State Space Clustering in Personalization Tasks Dec 24, 2021 Clustering reinforcement-learning
Code Code Available 0Towards Hyperparameter-free Policy Selection for Offline Reinforcement Learning Oct 26, 2021 Off-policy evaluation Open-Ended Question Answering
Code Code Available 0On the Reuse Bias in Off-Policy Reinforcement Learning Sep 15, 2022 continuous-control Continuous Control
Code Code Available 0Visual Foresight: Model-Based Deep Reinforcement Learning for Vision-Based Robotic Control Dec 3, 2018 Deep Reinforcement Learning reinforcement-learning
Code Code Available 0Addressing Sample Complexity in Visual Tasks Using HER and Hallucinatory GANs Jan 31, 2019 Reinforcement Learning Reinforcement Learning (RL)
Code Code Available 0Self-Supervised State-Control through Intrinsic Mutual Information Rewards Sep 25, 2019 OpenAI Gym reinforcement-learning
Code Code Available 0Welfare and Fairness in Multi-objective Reinforcement Learning Nov 30, 2022 Fairness Multi-Objective Reinforcement Learning
Code Code Available 0Provably Efficient Exploration for Reinforcement Learning Using Unsupervised Learning Mar 15, 2020 Efficient Exploration reinforcement-learning
Code Code Available 0