NADPEx: An on-policy temporally consistent exploration method for deep reinforcement learning Dec 21, 2018 continuous-control Continuous Control
— Unverified 00 SFP: State-free Priors for Exploration in Off-Policy Reinforcement Learning May 26, 2022 continuous-control Continuous Control
— Unverified 00 What About Taking Policy as Input of Value Function: Policy-extended Value Function Approximator Sep 28, 2020 continuous-control Continuous Control
— Unverified 00 The Cross-environment Hyperparameter Setting Benchmark for Reinforcement Learning Jul 26, 2024 continuous-control Continuous Control
— Unverified 00 Nested Mixture of Experts: Cooperative and Competitive Learning of Hybrid Dynamical System Nov 20, 2020 continuous-control Continuous Control
— Unverified 00 Neural Architecture Evolution in Deep Reinforcement Learning for Continuous Control Oct 28, 2019 continuous-control Continuous Control
— Unverified 00 Neural Lyapunov Model Predictive Control Sep 28, 2020 continuous-control Continuous Control
— Unverified 00 Neural Simplex Architecture Aug 1, 2019 continuous-control Continuous Control
— Unverified 00 NoiseNCA: Noisy Seed Improves Spatio-Temporal Continuity of Neural Cellular Automata Apr 9, 2024 continuous-control Continuous Control
— Unverified 00 Noisy Spiking Actor Network for Exploration Mar 7, 2024 continuous-control Continuous Control
— Unverified 00 Normality-Guided Distributional Reinforcement Learning for Continuous Control Aug 28, 2022 continuous-control Continuous Control
— Unverified 00 Wasserstein Barycenter Soft Actor-Critic Jun 11, 2025 continuous-control Continuous Control
— Unverified 00 Obstacle Avoidance for UAS in Continuous Action Space Using Deep Reinforcement Learning Nov 13, 2021 continuous-control Continuous Control
— Unverified 00 ODE-based Recurrent Model-free Reinforcement Learning for POMDPs Sep 25, 2023 continuous-control Continuous Control
— Unverified 00 CAPACITY-LIMITED REINFORCEMENT LEARNING: APPLICATIONS IN DEEP ACTOR-CRITIC METHODS FOR CONTINUOUS CONTROL Sep 25, 2019 continuous-control Continuous Control
— Unverified 00 Off-Dynamics Inverse Reinforcement Learning from Hetero-Domain Oct 21, 2021 continuous-control Continuous Control
— Unverified 00 Adversarial Imitation Learning from Video using a State Observer Feb 1, 2022 continuous-control Continuous Control
— Unverified 00 Offline Actor-Critic Reinforcement Learning Scales to Large Models Feb 8, 2024 continuous-control Continuous Control
— Unverified 00 Offline Imitation Learning with Suboptimal Demonstrations via Relaxed Distribution Matching Mar 5, 2023 continuous-control Continuous Control
— Unverified 00 Offline Learning from Demonstrations and Unlabeled Experience Nov 27, 2020 continuous-control Continuous Control
— Unverified 00 Offline Multi-agent Reinforcement Learning via Score Decomposition May 9, 2025 continuous-control Continuous Control
— Unverified 00 Offline Policy Optimization in RL with Variance Regularizaton Dec 29, 2022 continuous-control Continuous Control
— Unverified 00 What Matters for Adversarial Imitation Learning? Jun 1, 2021 continuous-control Continuous Control
— Unverified 00 Offline Policy Optimization with Variance Regularization Jan 1, 2021 continuous-control Continuous Control
— Unverified 00 Offline Reinforcement Learning as Anti-Exploration Jun 11, 2021 continuous-control Continuous Control
— Unverified 00 What Matters for On-Policy Deep Actor-Critic Methods? A Large-Scale Study Jan 1, 2021 Attribute continuous-control
— Unverified 00 Offline Reinforcement Learning with Soft Behavior Regularization Oct 14, 2021 continuous-control Continuous Control
— Unverified 00 Can Reinforcement Learning for Continuous Control Generalize Across Physics Engines? Oct 27, 2020 continuous-control Continuous Control
— Unverified 00 The Gap Between Model-Based and Model-Free Methods on the Linear Quadratic Regulator: An Asymptotic Viewpoint Dec 9, 2018 continuous-control Continuous Control
— Unverified 00 Off-Policy Correction for Deep Deterministic Policy Gradient Algorithms via Batch Prioritized Experience Replay Nov 2, 2021 Computational Efficiency continuous-control
— Unverified 00 CACTO: Continuous Actor-Critic with Trajectory Optimization -- Towards global optimality Nov 12, 2022 continuous-control Continuous Control
— Unverified 00 The Intentional Unintentional Agent: Learning to Solve Many Continuous Control Tasks Simultaneously Jul 11, 2017 continuous-control Continuous Control
— Unverified 00 Off-policy Maximum Entropy Reinforcement Learning : Soft Actor-Critic with Advantage Weighted Mixture Policy(SAC-AWMP) Feb 7, 2020 continuous-control Continuous Control
— Unverified 00 Off-Policy Policy Gradient Algorithms by Constraining the State Distribution Shift Nov 16, 2019 continuous-control Continuous Control
— Unverified 00 Off-policy Reinforcement Learning with Optimistic Exploration and Distribution Correction Oct 22, 2021 continuous-control Continuous Control
— Unverified 00 Theoretically Guaranteed Policy Improvement Distilled from Model-Based Planning Jul 24, 2023 continuous-control Continuous Control
— Unverified 00 oIRL: Robust Adversarial Inverse Reinforcement Learning with Temporally Extended Actions Feb 20, 2020 continuous-control Continuous Control
— Unverified 00 Cache-Efficient Posterior Sampling for Reinforcement Learning with LLM-Derived Priors Across Discrete and Continuous Domains May 12, 2025 continuous-control Continuous Control
— Unverified 00 Mind the Model, Not the Agent: The Primacy Bias in Model-based RL Oct 23, 2023 continuous-control Continuous Control
— Unverified 00 On the importance of data collection for training general goal-reaching policies Nov 7, 2022 continuous-control Continuous Control
— Unverified 00 On Inductive Biases in Deep Reinforcement Learning Jul 5, 2019 continuous-control Continuous Control
— Unverified 00 On learning history based policies for controlling Markov decision processes Nov 6, 2022 continuous-control Continuous Control
— Unverified 00 Butterfly Effects of SGD Noise: Error Amplification in Behavior Cloning and Autoregression Oct 17, 2023 continuous-control Continuous Control
— Unverified 00 Online Hyper-parameter Tuning in Off-policy Learning via Evolutionary Strategies Jun 13, 2020 continuous-control Continuous Control
— Unverified 00 Broad Critic Deep Actor Reinforcement Learning for Continuous Control Nov 24, 2024 Computational Efficiency continuous-control
— Unverified 00 Online Policy Learning from Offline Preferences Mar 15, 2024 continuous-control Continuous Control
— Unverified 00 Time-Constrained Robust MDPs Jun 12, 2024 continuous-control Continuous Control
— Unverified 00 Towards Tractable Optimism in Model-Based Reinforcement Learning Jun 21, 2020 continuous-control Continuous Control
— Unverified 00 Policy Optimization Reinforcement Learning with Entropy Regularization Dec 2, 2019 Continuous Control reinforcement-learning
— Unverified 00 On-Policy Robot Imitation Learning from a Converging Supervisor Jul 8, 2019 continuous-control Continuous Control
— Unverified 00