Mollification Effects of Policy Gradient Methods May 28, 2024 continuous-control Continuous Control
— Unverified 0Offline Reinforcement Learning from Datasets with Structured Non-Stationarity May 23, 2024 continuous-control Continuous Control
Code Code Available 0Investigating the Impact of Choice on Deep Reinforcement Learning for Space Controls May 20, 2024 continuous-control Continuous Control
— Unverified 0Learning Future Representation with Synthetic Observations for Sample-efficient Reinforcement Learning May 20, 2024 continuous-control Continuous Control
— Unverified 0The Curse of Diversity in Ensemble-Based Exploration May 7, 2024 Attribute continuous-control
Code Code Available 0CTD4 -- A Deep Continuous Distributional Actor-Critic Agent with a Kalman Fusion of Multiple Critics May 4, 2024 continuous-control Continuous Control
Code Code Available 0Implicit Safe Set Algorithm for Provably Safe Reinforcement Learning May 4, 2024 continuous-control Continuous Control
— Unverified 0AFU: Actor-Free critic Updates in off-policy RL for continuous control Apr 24, 2024 continuous-control Continuous Control
Code Code Available 0Explicit Lipschitz Value Estimation Enhances Policy Robustness Against Perturbation Apr 22, 2024 continuous-control Continuous Control
— Unverified 0On the stability of Lipschitz continuous control problems and its application to reinforcement learning Apr 20, 2024 continuous-control Continuous Control
— Unverified 0Adaptive Regularization of Representation Rank as an Implicit Constraint of Bellman Equation Apr 19, 2024 continuous-control Continuous Control
Code Code Available 0LTL-Constrained Policy Optimization with Cycle Experience Replay Apr 17, 2024 continuous-control Continuous Control
— Unverified 0Continuous Control Reinforcement Learning: Distributed Distributional DrQ Algorithms Apr 16, 2024 continuous-control Continuous Control
— Unverified 0NoiseNCA: Noisy Seed Improves Spatio-Temporal Continuity of Neural Cellular Automata Apr 9, 2024 continuous-control Continuous Control
— Unverified 0Growing Q-Networks: Solving Continuous Control Tasks with Adaptive Control Resolution Apr 5, 2024 continuous-control Continuous Control
— Unverified 0Decision Transformer as a Foundation Model for Partially Observable Continuous Control Apr 3, 2024 continuous-control Continuous Control
— Unverified 0Learning Off-policy with Model-based Intrinsic Motivation For Active Online Exploration Mar 31, 2024 continuous-control Continuous Control
— Unverified 0Demystifying the Physics of Deep Reinforcement Learning-Based Autonomous Vehicle Decision-Making Mar 18, 2024 Autonomous Vehicles continuous-control
— Unverified 0Reinforcement Learning from Delayed Observations via World Models Mar 18, 2024 continuous-control Continuous Control
Code Code Available 0Online Policy Learning from Offline Preferences Mar 15, 2024 continuous-control Continuous Control
— Unverified 0Symmetric Q-learning: Reducing Skewness of Bellman Error in Online Reinforcement Learning Mar 12, 2024 continuous-control Continuous Control
— Unverified 0Sample-Optimal Zero-Violation Safety For Continuous Control Mar 9, 2024 continuous-control Continuous Control
— Unverified 0Noisy Spiking Actor Network for Exploration Mar 7, 2024 continuous-control Continuous Control
— Unverified 0Iterated Q-Network: Beyond One-Step Bellman Updates in Deep Reinforcement Learning Mar 4, 2024 Atari Games continuous-control
— Unverified 0A Model-Based Approach for Improving Reinforcement Learning Efficiency Leveraging Expert Observations Feb 29, 2024 continuous-control Continuous Control
Code Code Available 0DynaMITE-RL: A Dynamic Model for Improved Temporal Meta-Reinforcement Learning Feb 25, 2024 continuous-control Continuous Control
— Unverified 0ACE : Off-Policy Actor-Critic with Causality-Aware Entropy Regularization Feb 22, 2024 continuous-control Continuous Control
— Unverified 0Dataset Clustering for Improved Offline Policy Learning Feb 14, 2024 Clustering continuous-control
Code Code Available 0Exploiting Estimation Bias in Clipped Double Q-Learning for Continous Control Reinforcement Learning Tasks Feb 14, 2024 Computational Efficiency continuous-control
— Unverified 0Offline Actor-Critic Reinforcement Learning Scales to Large Models Feb 8, 2024 continuous-control Continuous Control
— Unverified 0Differentially Private Deep Model-Based Reinforcement Learning Feb 8, 2024 continuous-control Continuous Control
— Unverified 0Learning Diverse Policies with Soft Self-Generated Guidance Feb 7, 2024 continuous-control Continuous Control
— Unverified 0FlowPG: Action-constrained Policy Gradient with Normalizing Flows Feb 7, 2024 continuous-control Continuous Control
Code Code Available 0Logical Specifications-guided Dynamic Task Sampling for Reinforcement Learning Agents Feb 6, 2024 continuous-control Continuous Control
Code Code Available 0Understanding What Affects the Generalization Gap in Visual Reinforcement Learning: Theory and Empirical Evidence Feb 5, 2024 continuous-control Continuous Control
— Unverified 0Deep Exploration with PAC-Bayes Feb 5, 2024 continuous-control Continuous Control
— Unverified 0Frugal Actor-Critic: Sample Efficient Off-Policy Deep Reinforcement Learning Using Unique Experiences Feb 5, 2024 continuous-control Continuous Control
— Unverified 0A Strategy for Preparing Quantum Squeezed States Using Reinforcement Learning Jan 29, 2024 continuous-control Continuous Control
— Unverified 0Pulse Width Modulation Method Applied to Nonlinear Model Predictive Control on an Under-actuated Small Satellite Jan 21, 2024 continuous-control Continuous Control
— Unverified 0Reconciling Spatial and Temporal Abstractions for Goal Representation Jan 18, 2024 continuous-control Continuous Control
Code Code Available 0Identifying Policy Gradient Subspaces Jan 12, 2024 continuous-control Continuous Control
— Unverified 0The Distributional Reward Critic Framework for Reinforcement Learning Under Perturbed Rewards Jan 11, 2024 continuous-control Continuous Control
Code Code Available 0A Minimaximalist Approach to Reinforcement Learning from Human Feedback Jan 8, 2024 continuous-control Continuous Control
— Unverified 0Trajectory-Oriented Policy Optimization with Sparse Rewards Jan 4, 2024 continuous-control Continuous Control
— Unverified 0Adversarially Trained Weighted Actor-Critic for Safe Offline Reinforcement Learning Jan 1, 2024 continuous-control Continuous Control
— Unverified 0Agnostic Interactive Imitation Learning: New Theory and Practical Algorithms Dec 28, 2023 continuous-control Continuous Control
Code Code Available 0REBEL: Reward Regularization-Based Approach for Robotic Reinforcement Learning from Human Feedback Dec 22, 2023 Bilevel Optimization continuous-control
— Unverified 0OVD-Explorer: Optimism Should Not Be the Sole Pursuit of Exploration in Noisy Environments Dec 19, 2023 continuous-control Continuous Control
— Unverified 0Analyzing Generalization in Policy Networks: A Case Study with the Double-Integrator System Dec 16, 2023 continuous-control Continuous Control
Code Code Available 0Risk-Aware Continuous Control with Neural Contextual Bandits Dec 15, 2023 continuous-control Continuous Control
Code Code Available 0