SOTAVerified

Reinforcement Learning (RL)

Reinforcement Learning (RL) involves training an agent to take actions in an environment to maximize a cumulative reward signal. The agent interacts with the environment and learns by receiving feedback in the form of rewards or punishments for its actions. The goal of reinforcement learning is to find the optimal policy or decision-making strategy that maximizes the long-term reward.

Papers

Showing 32763300 of 15113 papers

TitleStatusHype
DDPG based on multi-scale strokes for financial time series trading strategy0
Modified DDPG car-following model with a real-world human driving experience with CARLA simulator0
DDPG++: Striving for Simplicity in Continuous-control Off-Policy Reinforcement Learning0
A Fast Convergence Theory for Offline Decision Making0
Dealing with Limited Backhaul Capacity in Millimeter Wave Systems: A Deep Reinforcement Learning Approach0
Dealing with Non-Stationarity in Multi-Agent Deep Reinforcement Learning0
A Hybrid Approach for Reinforcement Learning Using Virtual Policy Gradient for Balancing an Inverted Pendulum0
Dealing with Sparse Rewards Using Graph Neural Networks0
Dealing with the Unknown: Pessimistic Offline Reinforcement Learning0
DEALIO: Data-Efficient Adversarial Learning for Imitation from Observation0
INTAGS: Interactive Agent-Guided Simulation0
Agent Modeling as Auxiliary Task for Deep Reinforcement Learning0
Death and Suicide in Universal Artificial Intelligence0
A SUMO Framework for Deep Reinforcement Learning Experiments Solving Electric Vehicle Charging Dispatching Problem0
De-Biased Modelling of Search Click Behavior with Reinforcement Learning0
DEAR: Deep Reinforcement Learning for Online Advertising Impression in Recommender Systems0
Deep Reinforcement Learning for Online Control of Stochastic Partial Differential Equations0
Decentralized Automotive Radar Spectrum Allocation to Avoid Mutual Interference Using Reinforcement Learning0
Decentralized Circle Formation Control for Fish-like Robots in the Real-world via Reinforcement Learning0
A Succinct Summary of Reinforcement Learning0
Deep Reinforcement Learning for NLP0
A Gentle Lecture Note on Filtrations in Reinforcement Learning0
On Improving Model-Free Algorithms for Decentralized Multi-Agent Reinforcement Learning0
Decentralized Cooperative Reinforcement Learning with Hierarchical Information Structure0
A Subgame Perfect Equilibrium Reinforcement Learning Approach to Time-inconsistent Problems0
Show:102550
← PrevPage 132 of 605Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1PPGMean Normalized Performance0.76Unverified
2PPOMean Normalized Performance0.58Unverified