Catalyst.RL: A Distributed Framework for Reproducible RL Research Feb 28, 2019 continuous-control Continuous Control
Code Code Available 1Marathon Environments: Multi-Agent Continuous Control Benchmarks in a Modern Video Game Engine Feb 25, 2019 continuous-control Continuous Control
Code Code Available 1CrossQ: Batch Normalization in Deep Reinforcement Learning for Greater Sample Efficiency and Simplicity Feb 14, 2019 continuous-control Continuous Control
Code Code Available 1Off-Policy Deep Reinforcement Learning without Exploration Dec 7, 2018 continuous-control Continuous Control
Code Code Available 1Learning Latent Dynamics for Planning from Pixels Nov 12, 2018 continuous-control Continuous Control
Code Code Available 1Maximum a Posteriori Policy Optimisation Jun 14, 2018 continuous-control Continuous Control
Code Code Available 1Learning to Adapt in Dynamic, Real-World Environments Through Meta-Reinforcement Learning Mar 30, 2018 continuous-control Continuous Control
Code Code Available 1Simple random search provides a competitive approach to reinforcement learning Mar 19, 2018 Computational Efficiency continuous-control
Code Code Available 1Addressing Function Approximation Error in Actor-Critic Methods Feb 26, 2018 Continuous Control OpenAI Gym
Code Code Available 1Soft Actor-Critic: Off-Policy Maximum Entropy Deep Reinforcement Learning with a Stochastic Actor Jan 4, 2018 Continuous Control Decision Making
Code Code Available 1DeepMind Control Suite Jan 2, 2018 continuous-control Continuous Control
Code Code Available 1Action Branching Architectures for Deep Reinforcement Learning Nov 24, 2017 continuous-control Continuous Control
Code Code Available 1Scalable trust-region method for deep reinforcement learning using Kronecker-factored approximation Aug 17, 2017 Atari Games continuous-control
Code Code Available 1Virtual-to-real Deep Reinforcement Learning: Continuous Control of Mobile Robots for Mapless Navigation Mar 1, 2017 continuous-control Continuous Control
Code Code Available 1#Exploration: A Study of Count-Based Exploration for Deep Reinforcement Learning Nov 15, 2016 Atari Games continuous-control
Code Code Available 1Sample Efficient Actor-Critic with Experience Replay Nov 3, 2016 continuous-control Continuous Control
Code Code Available 1Continuous Deep Q-Learning with Model-based Acceleration Mar 2, 2016 continuous-control Continuous Control
Code Code Available 1Continuous control with deep reinforcement learning Sep 9, 2015 Action Detection continuous-control
Code Code Available 1Supervised Fine Tuning on Curated Data is Reinforcement Learning (and can be improved) Jul 17, 2025 continuous-control Continuous Control
— Unverified 0rQdia: Regularizing Q-Value Distributions With Image Augmentation Jun 26, 2025 continuous-control Continuous Control
— Unverified 0Sparse-Reg: Improving Sample Complexity in Offline Reinforcement Learning using Sparsity Jun 20, 2025 continuous-control Continuous Control
Code Code Available 0Fractional Reasoning via Latent Steering Vectors Improves Inference Time Compute Jun 18, 2025 continuous-control Continuous Control
— Unverified 0Scaling Algorithm Distillation for Continuous Control with Mamba Jun 16, 2025 continuous-control Continuous Control
— Unverified 0DR-SAC: Distributionally Robust Soft Actor-Critic for Reinforcement Learning under Uncertainty Jun 14, 2025 continuous-control Continuous Control
Code Code Available 0Wasserstein Barycenter Soft Actor-Critic Jun 11, 2025 continuous-control Continuous Control
— Unverified 0Reinforcement Learning via Implicit Imitation Guidance Jun 9, 2025 continuous-control Continuous Control
— Unverified 0BEAST: Efficient Tokenization of B-Splines Encoded Action Sequences for Imitation Learning Jun 6, 2025 continuous-control Continuous Control
— Unverified 0AutoQD: Automatic Discovery of Diverse Behaviors with Quality-Diversity Optimization Jun 5, 2025 continuous-control Continuous Control
— Unverified 0Safe Planning and Policy Optimization via World Model Learning Jun 5, 2025 continuous-control Continuous Control
— Unverified 0Self-Composing Policies for Scalable Continual Reinforcement Learning Jun 4, 2025 continuous-control Continuous Control
— Unverified 0Unsupervised Meta-Testing with Conditional Neural Processes for Hybrid Meta-Reinforcement Learning Jun 4, 2025 continuous-control Continuous Control
— Unverified 0Proxy Target: Bridging the Gap Between Discrete Spiking Neural Networks and Continuous Control May 30, 2025 continuous-control Continuous Control
— Unverified 0DATD3: Depthwise Attention Twin Delayed Deep Deterministic Policy Gradient For Model Free Reinforcement Learning Under Output Feedback Control May 29, 2025 continuous-control Continuous Control
— Unverified 0Equivalence of stochastic and deterministic policy gradients May 29, 2025 continuous-control Continuous Control
— Unverified 0Knowledge Insulating Vision-Language-Action Models: Train Fast, Run Fast, Generalize Better May 29, 2025 continuous-control Continuous Control
— Unverified 0Improving Value Estimation Critically Enhances Vanilla Policy Gradient May 25, 2025 continuous-control Continuous Control
Code Code Available 0Guided Policy Optimization under Partial Observability May 21, 2025 continuous-control Continuous Control
Code Code Available 0AM-PPO: (Advantage) Alpha-Modulation with Proximal Policy Optimization May 21, 2025 continuous-control Continuous Control
— Unverified 0World Models as Reference Trajectories for Rapid Motor Adaptation May 21, 2025 continuous-control Continuous Control
— Unverified 0Sample and Computationally Efficient Continuous-Time Reinforcement Learning with General Function Approximation May 20, 2025 Computational Efficiency continuous-control
Code Code Available 0KIPPO: Koopman-Inspired Proximal Policy Optimization May 20, 2025 Computational Efficiency continuous-control
— Unverified 0CIE: Controlling Language Model Text Generations Using Continuous Signals May 19, 2025 continuous-control Continuous Control
Code Code Available 0Bi-Level Policy Optimization with Nyström Hypergradients May 16, 2025 Bilevel Optimization continuous-control
— Unverified 0Monte Carlo Beam Search for Actor-Critic Reinforcement Learning in Continuous Control May 13, 2025 Computational Efficiency continuous-control
— Unverified 0Adaptive Diffusion Policy Optimization for Robotic Manipulation May 13, 2025 continuous-control Continuous Control
Code Code Available 0Cache-Efficient Posterior Sampling for Reinforcement Learning with LLM-Derived Priors Across Discrete and Continuous Domains May 12, 2025 continuous-control Continuous Control
— Unverified 0Offline Multi-agent Reinforcement Learning via Score Decomposition May 9, 2025 continuous-control Continuous Control
— Unverified 0Enhanced Robust Tracking Control: An Online Learning Approach May 8, 2025 continuous-control Continuous Control
Code Code Available 0CLAM: Continuous Latent Action Models for Robot Learning from Unlabeled Demonstrations May 8, 2025 continuous-control Continuous Control
— Unverified 0Policy-labeled Preference Learning: Is Preference Enough for RLHF? May 6, 2025 continuous-control Continuous Control
— Unverified 0