SOTAVerified

Reinforcement Learning (RL)

Reinforcement Learning (RL) involves training an agent to take actions in an environment to maximize a cumulative reward signal. The agent interacts with the environment and learns by receiving feedback in the form of rewards or punishments for its actions. The goal of reinforcement learning is to find the optimal policy or decision-making strategy that maximizes the long-term reward.

Papers

Showing 1190111950 of 15113 papers

TitleStatusHype
Single Episode Policy Transfer in Reinforcement LearningCode0
Adaptive Discretization for Episodic Reinforcement Learning in Metric SpacesCode0
Adaptive Curriculum Generation from Demonstrations for Sim-to-Real Visuomotor ControlCode0
Adaptive Trade-Offs in Off-Policy Learning0
Conditional Importance Sampling for Off-Policy Learning0
Deep Reinforcement Learning meets Graph Neural Networks: exploring a routing optimization use caseCode0
Creativity in Robot Manipulation with Deep Reinforcement Learning0
Soft Actor-Critic for Discrete Action SettingsCode0
Parallel Exploration via Negatively Correlated Search0
Reinforcement Learning for Robotic Manipulation using Simulated Locomotion DemonstrationsCode0
Reinforced Bit Allocation under Task-Driven Semantic Distortion Metrics0
Model-free Reinforcement Learning in Infinite-horizon Average-reward Markov Decision ProcessesCode0
On the Reduction of Variance and Overestimation of Deep Q-Learning0
Actor Critic with Differentially Private Critic0
A unified view of likelihood ratio and reparameterization gradients and an optimal importance sampling scheme0
Coordination of PV Smart Inverters Using Deep Reinforcement Learning for Grid Voltage Regulation0
On the Expressivity of Neural Networks for Deep Reinforcement LearningCode0
Dynamic Graph Configuration with Reinforcement Learning for Connected Autonomous Vehicle Trajectories0
Federated Transfer Reinforcement Learning for Autonomous Driving0
Policy Poisoning in Batch Reinforcement Learning and ControlCode0
Neural Program Synthesis By Self-Learning0
Rethinking Exposure Bias In Language Modeling0
QoS and Jamming-Aware Wireless Networking Using Deep Reinforcement Learning0
Uncertainty Quantification and Exploration for Reinforcement Learning0
Influence-Based Multi-Agent ExplorationCode0
Curiosity-Driven Recommendation Strategy for Adaptive Learning via Deep Reinforcement Learning0
Autonomous Navigation via Deep Reinforcement Learning for Resource Constraint Edge Nodes using Transfer LearningCode0
Building HVAC Scheduling Using Reinforcement Learning via Neural Network Based Model Approximation0
Network Randomization: A Simple Technique for Generalization in Deep Reinforcement LearningCode0
Improving Gradient Estimation in Evolutionary Strategies With Past Descent Directions0
Green Deep Reinforcement Learning for Radio Resource Management: Architecture, Algorithm Compression and Challenge0
Modeling Cyber-Physical Human Systems via an Interplay Between Reinforcement Learning and Game Theory0
RLCard: A Toolkit for Reinforcement Learning in Card GamesCode0
Hierarchical Reinforcement Learning with Advantage-Based Auxiliary RewardsCode0
Autonomous Driving using Safe Reinforcement Learning by Incorporating a Regret-based Human Lane-Changing Decision Model0
Fast Task-Adaptation for Tasks Labeled Using Natural Language in Reinforcement Learning0
Linear-Quadratic Mean-Field Reinforcement Learning: Convergence of Policy Gradient Methods0
Defensive Escort Teams via Multi-Agent Deep Reinforcement Learning0
Improving Generalization in Meta Reinforcement Learning using Learned Objectives0
Integrating Behavior Cloning and Reinforcement Learning for Improved Performance in Dense and Sparse Reward Environments0
Imagined Value Gradients: Model-Based Policy Optimization with Transferable Latent Dynamics Models0
Ctrl-Z: Recovering from Instability in Reinforcement Learning0
A Dual-Hormone Closed-Loop Delivery System for Type 1 Diabetes Using Deep Reinforcement Learning0
Black-box Optimizer with Implicit Natural Gradient0
Multiple-objective Reinforcement Learning for Inverse Design and Identification0
Model-based Reinforcement Learning for Predictions and Control for Limit Order Books0
Model-Based Reinforcement Learning Exploiting State-Action Equivalence0
TorchBeast: A PyTorch Platform for Distributed RLCode0
Tactical Reward Shaping: Bypassing Reinforcement Learning with Strategy-Based Goals0
Reinforcement Learning with Structured Hierarchical Grammar Representations of Actions0
Show:102550
← PrevPage 239 of 303Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1PPGMean Normalized Performance0.76Unverified
2PPOMean Normalized Performance0.58Unverified