The Definitive Guide to Policy Gradients in Deep Reinforcement Learning: Theory, Algorithms and Implementations Jan 24, 2024 continuous-control Continuous Control
Code Code Available 1Pulse Width Modulation Method Applied to Nonlinear Model Predictive Control on an Under-actuated Small Satellite Jan 21, 2024 continuous-control Continuous Control
— Unverified 0Reconciling Spatial and Temporal Abstractions for Goal Representation Jan 18, 2024 continuous-control Continuous Control
Code Code Available 0Identifying Policy Gradient Subspaces Jan 12, 2024 continuous-control Continuous Control
— Unverified 0The Distributional Reward Critic Framework for Reinforcement Learning Under Perturbed Rewards Jan 11, 2024 continuous-control Continuous Control
Code Code Available 0A Minimaximalist Approach to Reinforcement Learning from Human Feedback Jan 8, 2024 continuous-control Continuous Control
— Unverified 0Trajectory-Oriented Policy Optimization with Sparse Rewards Jan 4, 2024 continuous-control Continuous Control
— Unverified 0Adversarially Trained Weighted Actor-Critic for Safe Offline Reinforcement Learning Jan 1, 2024 continuous-control Continuous Control
— Unverified 0Agnostic Interactive Imitation Learning: New Theory and Practical Algorithms Dec 28, 2023 continuous-control Continuous Control
Code Code Available 0REBEL: Reward Regularization-Based Approach for Robotic Reinforcement Learning from Human Feedback Dec 22, 2023 Bilevel Optimization continuous-control
— Unverified 0OVD-Explorer: Optimism Should Not Be the Sole Pursuit of Exploration in Noisy Environments Dec 19, 2023 continuous-control Continuous Control
— Unverified 0Analyzing Generalization in Policy Networks: A Case Study with the Double-Integrator System Dec 16, 2023 continuous-control Continuous Control
Code Code Available 0Risk-Aware Continuous Control with Neural Contextual Bandits Dec 15, 2023 continuous-control Continuous Control
Code Code Available 0World Models via Policy-Guided Trajectory Diffusion Dec 13, 2023 continuous-control Continuous Control
Code Code Available 1Decoupling Meta-Reinforcement Learning with Gaussian Task Contexts and Skills Dec 11, 2023 continuous-control Continuous Control
Code Code Available 0Synergizing Quality-Diversity with Descriptor-Conditioned Reinforcement Learning Dec 10, 2023 Continuous Control Diversity
Code Code Available 0A Q-learning approach to the continuous control problem of robot inverted pendulum balancing Dec 5, 2023 continuous-control Continuous Control
— Unverified 0RLIF: Interactive Imitation Learning as Reinforcement Learning Nov 21, 2023 continuous-control Continuous Control
— Unverified 0Visual tracking brain computer interface Nov 21, 2023 Brain Computer Interface continuous-control
— Unverified 0An advantage based policy transfer algorithm for reinforcement learning with measures of transferability Nov 12, 2023 continuous-control Continuous Control
— Unverified 0An Intelligent Social Learning-based Optimization Strategy for Black-box Robotic Control with Reinforcement Learning Nov 11, 2023 continuous-control Continuous Control
— Unverified 0Real-Time Recurrent Reinforcement Learning Nov 8, 2023 continuous-control Continuous Control
— Unverified 0Time-Efficient Reinforcement Learning with Stochastic Stateful Policies Nov 7, 2023 continuous-control Continuous Control
— Unverified 0Imitation Bootstrapped Reinforcement Learning Nov 3, 2023 Continuous Control Imitation Learning
— Unverified 0Mix-ME: Quality-Diversity for Multi-Agent Learning Nov 3, 2023 continuous-control Continuous Control
— Unverified 0Learning to Discover Skills through Guidance Oct 31, 2023 continuous-control Continuous Control
— Unverified 0DrM: Mastering Visual Reinforcement Learning through Dormant Ratio Minimization Oct 30, 2023 continuous-control Continuous Control
Code Code Available 1TD-MPC2: Scalable, Robust World Models for Continuous Control Oct 25, 2023 continuous-control Continuous Control
Code Code Available 2Mind the Model, Not the Agent: The Primacy Bias in Model-based RL Oct 23, 2023 continuous-control Continuous Control
— Unverified 0Absolute Policy Optimization Oct 20, 2023 Atari Games continuous-control
Code Code Available 0Analysis of potential flow networks: Variations in transport time with discrete, continuous, and selfish operation Oct 17, 2023 continuous-control Continuous Control
— Unverified 0Keep Various Trajectories: Promoting Exploration of Ensemble Policies in Continuous Control Oct 17, 2023 continuous-control Continuous Control
— Unverified 0Butterfly Effects of SGD Noise: Error Amplification in Behavior Cloning and Autoregression Oct 17, 2023 continuous-control Continuous Control
— Unverified 0Reduced Policy Optimization for Continuous Control with Hard Constraints Oct 14, 2023 continuous-control Continuous Control
Code Code Available 1Cross-Episodic Curriculum for Transformer Agents Oct 12, 2023 continuous-control Continuous Control
— Unverified 0COPlanner: Plan to Roll Out Conservatively but to Explore Optimistically for Model-Based RL Oct 11, 2023 continuous-control Continuous Control
— Unverified 0Boosting Continuous Control with Consistency Policy Oct 10, 2023 continuous-control Continuous Control
Code Code Available 1Planning to Go Out-of-Distribution in Offline-to-Online Reinforcement Learning Oct 9, 2023 continuous-control Continuous Control
— Unverified 0Improving Reinforcement Learning Efficiency with Auxiliary Tasks in Non-Visual Environments: A Comparison Oct 6, 2023 Continuous Control reinforcement-learning
— Unverified 0Imitation Learning from Observation through Optimal Transport Oct 2, 2023 continuous-control Continuous Control
— Unverified 0Improving Emotional Expression and Cohesion in Image-Based Playlist Description and Music Topics: A Continuous Parameterization Approach Oct 2, 2023 continuous-control Continuous Control
— Unverified 0Policy Optimization in a Noisy Neighborhood: On Return Landscapes in Continuous Control Sep 26, 2023 continuous-control Continuous Control
Code Code Available 0ODE-based Recurrent Model-free Reinforcement Learning for POMDPs Sep 25, 2023 continuous-control Continuous Control
— Unverified 0Emergent Communication in Multi-Agent Reinforcement Learning for Future Wireless Networks Sep 12, 2023 Autonomous Driving continuous-control
— Unverified 0Learning Shared Safety Constraints from Multi-task Demonstrations Sep 1, 2023 continuous-control Continuous Control
Code Code Available 1Bearing-based Formation with Disturbance Rejection Aug 29, 2023 continuous-control Continuous Control
— Unverified 0Stabilizing Unsupervised Environment Design with a Learned Adversary Aug 21, 2023 Car Racing continuous-control
Code Code Available 0Soft Decomposed Policy-Critic: Bridging the Gap for Effective Continuous Control with Discrete RL Aug 20, 2023 Atari Games continuous-control
— Unverified 0ACRE: Actor-Critic with Reward-Preserving Exploration Aug 14, 2023 continuous-control Continuous Control
Code Code Available 0Value-Distributional Model-Based Reinforcement Learning Aug 12, 2023 continuous-control Continuous Control
Code Code Available 0