Mollification Effects of Policy Gradient Methods May 28, 2024 continuous-control Continuous Control
— Unverified 0Bigger, Regularized, Optimistic: scaling for compute and sample-efficient continuous control May 25, 2024 continuous-control Continuous Control
Code Code Available 2Diffusion-based Reinforcement Learning via Q-weighted Variational Policy Optimization May 25, 2024 continuous-control Continuous Control
Code Code Available 2OLLIE: Imitation Learning from Offline Pretraining to Online Finetuning May 24, 2024 continuous-control Continuous Control
Code Code Available 1How to Leverage Diverse Demonstrations in Offline Imitation Learning May 24, 2024 continuous-control Continuous Control
Code Code Available 1Offline Reinforcement Learning from Datasets with Structured Non-Stationarity May 23, 2024 continuous-control Continuous Control
Code Code Available 0Investigating the Impact of Choice on Deep Reinforcement Learning for Space Controls May 20, 2024 continuous-control Continuous Control
— Unverified 0Learning Future Representation with Synthetic Observations for Sample-efficient Reinforcement Learning May 20, 2024 continuous-control Continuous Control
— Unverified 0The Curse of Diversity in Ensemble-Based Exploration May 7, 2024 Attribute continuous-control
Code Code Available 0CTD4 -- A Deep Continuous Distributional Actor-Critic Agent with a Kalman Fusion of Multiple Critics May 4, 2024 continuous-control Continuous Control
Code Code Available 0Implicit Safe Set Algorithm for Provably Safe Reinforcement Learning May 4, 2024 continuous-control Continuous Control
— Unverified 0REBEL: Reinforcement Learning via Regressing Relative Rewards Apr 25, 2024 continuous-control Continuous Control
Code Code Available 2AFU: Actor-Free critic Updates in off-policy RL for continuous control Apr 24, 2024 continuous-control Continuous Control
Code Code Available 0Explicit Lipschitz Value Estimation Enhances Policy Robustness Against Perturbation Apr 22, 2024 continuous-control Continuous Control
— Unverified 0On the stability of Lipschitz continuous control problems and its application to reinforcement learning Apr 20, 2024 continuous-control Continuous Control
— Unverified 0Adaptive Regularization of Representation Rank as an Implicit Constraint of Bellman Equation Apr 19, 2024 continuous-control Continuous Control
Code Code Available 0LTL-Constrained Policy Optimization with Cycle Experience Replay Apr 17, 2024 continuous-control Continuous Control
— Unverified 0Continuous Control Reinforcement Learning: Distributed Distributional DrQ Algorithms Apr 16, 2024 continuous-control Continuous Control
— Unverified 0NoiseNCA: Noisy Seed Improves Spatio-Temporal Continuity of Neural Cellular Automata Apr 9, 2024 continuous-control Continuous Control
— Unverified 0Growing Q-Networks: Solving Continuous Control Tasks with Adaptive Control Resolution Apr 5, 2024 continuous-control Continuous Control
— Unverified 0Decision Transformer as a Foundation Model for Partially Observable Continuous Control Apr 3, 2024 continuous-control Continuous Control
— Unverified 0Learning Off-policy with Model-based Intrinsic Motivation For Active Online Exploration Mar 31, 2024 continuous-control Continuous Control
— Unverified 0Reinforcement Learning from Delayed Observations via World Models Mar 18, 2024 continuous-control Continuous Control
Code Code Available 0Demystifying the Physics of Deep Reinforcement Learning-Based Autonomous Vehicle Decision-Making Mar 18, 2024 Autonomous Vehicles continuous-control
— Unverified 0Quality-Diversity Actor-Critic: Learning High-Performing and Diverse Behaviors via Value and Successor Features Critics Mar 15, 2024 continuous-control Continuous Control
Code Code Available 1Online Policy Learning from Offline Preferences Mar 15, 2024 continuous-control Continuous Control
— Unverified 0Symmetric Q-learning: Reducing Skewness of Bellman Error in Online Reinforcement Learning Mar 12, 2024 continuous-control Continuous Control
— Unverified 0Sample-Optimal Zero-Violation Safety For Continuous Control Mar 9, 2024 continuous-control Continuous Control
— Unverified 0Noisy Spiking Actor Network for Exploration Mar 7, 2024 continuous-control Continuous Control
— Unverified 0SplAgger: Split Aggregation for Meta-Reinforcement Learning Mar 5, 2024 continuous-control Continuous Control
Code Code Available 1Iterated Q-Network: Beyond One-Step Bellman Updates in Deep Reinforcement Learning Mar 4, 2024 Atari Games continuous-control
— Unverified 0EfficientZero V2: Mastering Discrete and Continuous Control with Limited Data Mar 1, 2024 continuous-control Continuous Control
Code Code Available 2A Model-Based Approach for Improving Reinforcement Learning Efficiency Leveraging Expert Observations Feb 29, 2024 continuous-control Continuous Control
Code Code Available 0DynaMITE-RL: A Dynamic Model for Improved Temporal Meta-Reinforcement Learning Feb 25, 2024 continuous-control Continuous Control
— Unverified 0ACE : Off-Policy Actor-Critic with Causality-Aware Entropy Regularization Feb 22, 2024 continuous-control Continuous Control
— Unverified 0PRISE: LLM-Style Sequence Compression for Learning Temporal Action Abstractions in Control Feb 16, 2024 continuous-control Continuous Control
Code Code Available 1Dataset Clustering for Improved Offline Policy Learning Feb 14, 2024 Clustering continuous-control
Code Code Available 0Exploiting Estimation Bias in Clipped Double Q-Learning for Continous Control Reinforcement Learning Tasks Feb 14, 2024 Computational Efficiency continuous-control
— Unverified 0Hybrid Inverse Reinforcement Learning Feb 13, 2024 continuous-control Continuous Control
Code Code Available 1Premier-TACO is a Few-Shot Policy Learner: Pretraining Multitask Representation via Temporal Action-Driven Contrastive Loss Feb 9, 2024 Computational Efficiency continuous-control
Code Code Available 1FedAA: A Reinforcement Learning Perspective on Adaptive Aggregation for Fair and Robust Federated Learning Feb 8, 2024 continuous-control Continuous Control
Code Code Available 1Offline Actor-Critic Reinforcement Learning Scales to Large Models Feb 8, 2024 continuous-control Continuous Control
— Unverified 0Differentially Private Deep Model-Based Reinforcement Learning Feb 8, 2024 continuous-control Continuous Control
— Unverified 0FlowPG: Action-constrained Policy Gradient with Normalizing Flows Feb 7, 2024 continuous-control Continuous Control
Code Code Available 0Learning Diverse Policies with Soft Self-Generated Guidance Feb 7, 2024 continuous-control Continuous Control
— Unverified 0Logical Specifications-guided Dynamic Task Sampling for Reinforcement Learning Agents Feb 6, 2024 continuous-control Continuous Control
Code Code Available 0Frugal Actor-Critic: Sample Efficient Off-Policy Deep Reinforcement Learning Using Unique Experiences Feb 5, 2024 continuous-control Continuous Control
— Unverified 0Deep Exploration with PAC-Bayes Feb 5, 2024 continuous-control Continuous Control
— Unverified 0Understanding What Affects the Generalization Gap in Visual Reinforcement Learning: Theory and Empirical Evidence Feb 5, 2024 continuous-control Continuous Control
— Unverified 0A Strategy for Preparing Quantum Squeezed States Using Reinforcement Learning Jan 29, 2024 continuous-control Continuous Control
— Unverified 0